Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention more apparent, embodiments of the present invention will be described in detail below with reference to the accompanying drawings. However, it will be appreciated by those of ordinary skill in the art that numerous technical details are set forth in order to provide a better understanding of the present application in various embodiments of the present invention. However, the technical solution claimed in the present application can be implemented without these technical details and various changes and modifications based on the following embodiments.
The first embodiment of the invention relates to an audio recording method, which is applied to an intelligent recording device, wherein the intelligent recording device can be any device with a recording function, such as a sound box, a recording pen, a recording rod, a plane recording plate and the like. The specific process is shown in fig. 1, and comprises the following steps:
step 101, the intelligent recording device establishes communication connection with the N mobile terminals in advance. Wherein N is a natural number greater than or equal to 1.
For example, if 20 users in a speech need to record the speech, before the speech starts, the 20 users may establish a communication connection between a mobile terminal (e.g., a mobile phone) and an intelligent recording device (e.g., an intelligent speaker) through bluetooth or the like. Preferably, in a preset time before a speech or a conference begins, the user can remind the user of establishing communication connection with the intelligent recording device through the loudspeaker of the intelligent recording device, so that the situation that the audio content cannot be marked by dotting due to the fact that the audio content is not connected with the intelligent recording device in advance in time is avoided. Of course, if the user establishes a communication connection with the intelligent recording device after the audio information is recorded due to reasons such as late arrival, it is also feasible, and it is only impossible to perform dotting marking on the audio content before establishing a communication connection with the intelligent recording device.
It is worth mentioning that after the intelligent recording device establishes communication connection with the mobile terminal in advance, the intelligent recording device records the acquired audio information, and the mobile terminal of the user is not required to perform recording operation on the audio information. Therefore, after the recording of the intelligent recording equipment is finished, the recorded audio file is directly sent to the mobile terminal in communication connection with the intelligent recording equipment, and power consumption of the mobile terminal is saved.
And 102, recording the acquired audio information, and judging whether a dotting marking instruction sent by any one of the N mobile terminals is received. If the dotting marking instruction sent by any one of the N mobile terminals is judged to be received, entering step 103; otherwise, step 105 is entered.
Here, a dotting instruction sent by 2 users a and b to audio information (such as lecture content) is taken as an example for explanation: when the teacher is heard to speak for the 5 th minute (for example, the first question is the first question), the first does not understand, so that the first sends a dotting and marking instruction through the mobile phone, and when the intelligent recording device receives the dotting and marking instruction in the recording process of the acquired audio information, the audio information is dotted and marked at the current time point (the dotting and marking are performed in the 5 th minute), namely step 103. Similarly, when the teacher speaks the 30 th minute (for example, the sixth topic at this time), the second does not understand, so that the second sends the dotting instruction through the tablet pc, and after the intelligent recording device receives the dotting instruction, the audio information is dotted at the current time point (the dotting is performed in the 30 th minute), that is, step 103. If no mobile terminal sends the dotting marking instruction in the whole process of recording the collected audio information, the step 105 is entered.
It should be noted that the above is only an example, and the number of the mobile terminals for sending the dotting instruction and the application scenario are not limited in this embodiment, for example, a place in which the speech content is interested may be dotted, a knowledge point that the user considers important may be dotted, and the like.
And 103, dotting and marking the audio information at the current time point.
That is, when a dotting instruction sent by any one of the N mobile terminals is received, the audio information is dotted.
It should be noted that, when the audio information is marked at the current time point, if the content of the mark sent by the mobile terminal is received, for example: when the user pays attention to memorization, question, discussion and the like, the mark content is added at the position where the audio information is doted and marked at the current time point, so that the user can easily think of the reason for dotting and marking at the position in the later playback process. Preferably, after the intelligent recording device establishes a communication connection with the mobile terminal in advance, preset keywords may be provided in the mobile terminal, for example: the user can directly click the required keywords for marking when sending the dotting marks through the mobile terminal according to the requirements, and the dotting efficiency of the user is favorably improved.
It is worth mentioning that, as in the prior art, the dotting marking performed by the embodiment has no time delay, for example, if the third minute of the audio file is dotted, the server sends the accurate dotting time point to the intelligent recording device, and the intelligent recording device performs the dotting in the fifth minute.
And 104, generating an audio file with the marking information when the recording is finished.
Specifically, the generated audio file with the tag information is an audio file carrying dotting and tagging instructions issued by all listeners. As illustrated by way of example in step 102, an audio file with the tag information for the 5 th minute (e.g., the first topic in this case) and the tag information for the 30 th minute (e.g., the sixth topic in this case) is generated. By doing so, not only be favorable to the audiences oneself to record and master the understanding of audio content, still be favorable to the audiences to know other audiences and mark the information to the marking of beating of this audio content, especially be favorable to the audio file that the sound producer has mark information through the formation and have a macroscopic understanding according to the mark information of audiences, can carry out the analysis to the speech content of oneself according to the audio file that has mark information, thereby can adjust the key point of next speech, meeting content order etc. for example the mr can know the condition of mastering of the audiences to knowledge point, thereby carry out key lecture to the more knowledge point of doubting.
And 105, directly generating an audio file when the recording is finished.
That is to say, if a dotting instruction sent by any one of the mobile terminals is not received in the process of recording the acquired audio information by the intelligent recording device, the recorded content is directly generated into an audio file.
And step 106, sending the audio file to the N mobile terminals.
Namely, if the audio file with the tag information is the audio file with the tag information, the audio file with the tag information is sent to the N mobile terminals, and if the dotting and tagging instruction sent by any one mobile terminal is not received in the process of recording the acquired audio information by the intelligent recording equipment, the recorded content is directly generated into the audio file to be sent to the N mobile terminals.
Compared with the prior art, the embodiment establishes communication connection with N mobile terminals in advance through the intelligent recording equipment, wherein N is a natural number greater than or equal to 1; in the process of recording the acquired audio information by the intelligent recording equipment, if a dotting marking instruction sent by any one of the N mobile terminals is received, dotting and marking the audio information at the current time point, and generating an audio file with marked information when the recording is finished; and sending the audio file to the N mobile terminals. After the intelligent recording equipment establishes communication connection with the mobile terminal in advance, the intelligent recording equipment records the acquired audio information, and the mobile terminal of a user is not required to perform recording operation on the audio information. Therefore, after the recording of the intelligent recording equipment is finished, the recorded audio file is directly sent to the mobile terminal in communication connection with the intelligent recording equipment, and the power consumption of the mobile terminal is saved; if there is a dotting marking instruction sent by any mobile terminal, the generated file is an audio file with marking information, which is not only beneficial for the listeners to record and master the knowledge of the audio content themselves, but also beneficial for the listeners to master the dotting marking information of the audio content of other listeners, and is especially beneficial for the speakers to have a macroscopic knowledge according to the marking information of the listeners through the generated audio file with marking information, and to analyze the voice content themselves according to the audio file with marking information, so that the next speaking focus, meeting content sequence and the like can be adjusted, for example, the teacher can know the knowledge point master condition of the listeners, and thus, the more knowledge points are subjected to focused detailed teaching.
A second embodiment of the present invention relates to an audio recording method. The embodiment is further improved on the basis of the first embodiment, and the specific improvement is as follows: in the embodiment, after the intelligent recording device establishes communication connection with the N mobile terminals in advance, the device identification of the mobile terminal is further acquired; in the process of recording the collected audio information by the intelligent recording equipment, if a dotting marking instruction sent by any one of the N mobile terminals is received, dotting and marking the audio information at the current time point according to the equipment identification of the mobile terminal, generating an audio file with marking information corresponding to the equipment identification of the mobile terminal when recording is finished, and sending the audio file to the mobile terminal corresponding to the equipment identification. In the embodiment, the recorded audio file is subjected to dotting marking according to the equipment identifier of the user, the audio file with the marking information corresponding to the equipment identifier is generated, and the recorded audio file is only sent to the mobile terminal corresponding to the equipment identifier, namely, the audio file finally obtained by the user only has the dotting mark of the user, but is not attached with the dotting marks of other users, so that the user can quickly find the dotting mark of the user in the audio playback process, and the marked content is analyzed and mastered in a targeted manner. A specific flow in the present embodiment is shown in fig. 2, and includes:
step 201, the intelligent recording device establishes communication connection with the N mobile terminals in advance. N is a natural number greater than or equal to 1.
Since step 201 in this embodiment is substantially the same as step 101 in the first embodiment. It should be mentioned that the intelligent recording device in this embodiment may be an intelligent sound box with a microphone array, and those skilled in the art can understand that the microphone array can filter sound waves by using the difference between the phases of sound waves received by two microphones, so as to filter the environmental background sound to the maximum extent, and only the required sound waves are left. The intelligent sound box adopting the configuration in a noisy environment can enable recorded audio to be free of noise and clearer.
Step 202, acquiring the device identifier of the mobile terminal.
Specifically, the device identifier of the mobile terminal may be a device number of the mobile terminal when the mobile terminal leaves a factory, or may also be information such as a device identifier number and a nickname set for a user, as long as the intelligent recording device can distinguish identifiers of mobile terminals, which is not specifically limited herein.
And 203, recording the acquired audio information, and judging whether a dotting marking instruction sent by any one of the N mobile terminals is received. If the dotting marking instruction sent by any one of the N mobile terminals is judged to be received, entering step 204; otherwise, step 206 is entered.
Step 203 in the present embodiment is substantially the same as step 102 in the first embodiment. Note that, in the present embodiment, the dotting instruction may be generated by a dotting operation of the user; wherein the dotting and marking actions can be detected by one or a combination of the following: gyroscope, touch screen, gravity accelerometer.
For example, the dotting marking action when the user taps the mobile phone 2 may be set, and if the mobile phone is detected to be tapped 2, the dotting marking action of the user is considered to be detected. As long as the dotting and marking actions of the user are easy to implement. For example, an accelerometer or gyroscope may be used to detect a user shaking or tapping in a particular motion. Preferably, in this embodiment, even when the display interface of the mobile terminal is hidden due to the power saving mode (in a so-called off-screen state), the marking can be completed without waking up the display screen. Under the condition, the mobile terminal needs to be provided with an accelerometer or a gyroscope, and the current mobile phones, tablet computers and the like are provided with the device. The detection can also be carried out in a mode of combining the gyroscope and the touch screen, and the detection can avoid the mistaken touch of a user. In addition, many mobile terminals are provided with keys, so that a one-key pressing mode or a double-key pressing mode is feasible as the dotting marking action. And the mode that the double keys are pressed down simultaneously can also avoid the mistaken pressing of the user.
And step 204, according to the equipment identification of the mobile terminal, dotting and marking the audio information at the current time point.
For example, the user a is interested in the content of the speech in the 10 th minute, the user B is interested in the content of the speech in the 20 th minute, and the device identifiers of the user a and the user B are different, so the intelligent sound recording device performs the dotting marking on the audio information in the 10 th minute according to the dotting marking instruction sent by the mobile terminal of the user a; and carrying out dotting marking on the audio information in the 20 th minute according to a dotting marking instruction sent by the mobile terminal of the user B. The difference from the first embodiment is that in the first embodiment, the marking is performed as long as the dotting instruction is received, and the device identifier of the mobile terminal that transmitted the dotting instruction is not distinguished.
And step 205, when the recording is finished, generating an audio file with the mark information corresponding to the equipment identification of the mobile terminal.
This is still illustrated here by way of example in step 204: that is, when the recording is finished, two audio files are generated according to the difference of the dotting instructions sent by the user a and the user B, wherein one audio file is generated by dotting and marking the audio information at the 10 th minute according to the dotting instruction sent by the mobile terminal of the user a, and the other audio file is generated by dotting and marking the audio information at the 20 th minute according to the dotting instruction sent by the mobile terminal of the user B.
In addition, after the audio file with the mark information corresponding to the equipment identifier of the mobile terminal is generated, if the audio file request message which is sent by the mobile terminal and carries the target equipment identifier is received, the audio file corresponding to the target equipment identifier is sent to the mobile terminal.
Specifically, the user can obtain the audio file after the audio file is marked by dotting by the appointed user through inputting the device identification number. For example, the following steps: the first user and the second user are learning partners, so that the first user can input the equipment identification number of the second user at the mobile terminal of the first user to obtain the marking information of the second user on the recorded audio file, and the second user can also obtain the marking information of the first user on the recorded audio file, thereby being beneficial to pertinently guiding important and difficult points between the first user and the second user in the later learning process, namely being beneficial to promoting communication and learning among a plurality of users through the audio file. It is worth mentioning that the premise of obtaining the audio file after the audio file is dotted and marked by other users is to know the target device identifier, that is, the audio file after the audio file is dotted and marked can be obtained only after the user approves and informs the device identifier, so that the privacy of the audio file after the audio file is dotted and marked by the user can be protected.
And step 206, directly generating an audio file when the recording is finished.
Since step 206 in this embodiment is substantially the same as step 105 in the first embodiment, it is not repeated here.
And step 207, sending the audio file to the mobile terminal corresponding to the equipment identifier.
That is, when the recording is finished, two audio files are generated according to the difference of the dotting instructions sent by the user a and the user B, the audio file generated by dotting and marking the audio information at the 10 th minute according to the dotting instruction sent by the mobile terminal of the user a is sent to the mobile terminal of the user a, and the audio file generated by dotting and marking the audio information at the 20 th minute according to the dotting instruction sent by the mobile terminal of the user B is sent to the mobile terminal of the user B. The A user and the B user receive different audio files with the mark information because the dotting places are different.
Compared with the prior art, the audio recording method provided by the embodiment further acquires the equipment identifier of the mobile terminal after the intelligent recording equipment establishes communication connection with the N mobile terminals in advance; in the process of recording the collected audio information by the intelligent recording equipment, if a dotting marking instruction sent by any one of the N mobile terminals is received, dotting and marking the audio information at the current time point according to the equipment identification of the mobile terminal, generating an audio file with marking information corresponding to the equipment identification of the mobile terminal when recording is finished, and sending the audio file to the mobile terminal corresponding to the equipment identification. In the embodiment, the recorded audio file is subjected to dotting marking according to the equipment identifier of the user, the audio file with the marking information corresponding to the equipment identifier is generated, and the recorded audio file is only sent to the mobile terminal corresponding to the equipment identifier, namely, the audio file finally obtained by the user only has the dotting mark of the user, but is not attached with the dotting marks of other users, so that the user can quickly find the dotting mark of the user in the audio playback process, and the marked content is analyzed and mastered in a targeted manner.
A third embodiment of the present invention relates to an audio recording method. The embodiment is further improved on the basis of the first embodiment, and the specific improvement is as follows: in the embodiment, before the intelligent recording equipment records the collected audio information, the audio information pre-recorded by the user is received, the voice print identification is carried out on the pre-recorded audio information, and the voice print identification result is used as a voice print object; performing voiceprint recognition on the collected audio information, and judging whether a result of the voiceprint recognition on the collected audio information is matched with a voiceprint object; and when the voiceprint recognition result of the acquired audio information is matched with the voiceprint object, recording the acquired audio information. By doing so, be favorable to the pertinence to the speech information that specific vocal person sent record, avoided recording other vocal person's speech information, not only be favorable to filtering unimportant speech information that other vocal persons sent, still be favorable to making the audio file of recording clearer. The specific process is shown in fig. 3, and includes:
step 301, the intelligent sound recording device establishes communication connection with the N mobile terminals in advance. N is a natural number greater than or equal to 1.
And step 302, receiving audio information pre-entered by a user, performing voiceprint recognition on the pre-entered audio information, and taking a voiceprint recognition result as a voiceprint object.
Specifically, the audio information pre-entered by the receiving user may be a sentence entered by the lecturer or the host of the conference on site, or a sentence played by the terminal device, which is not limited in this respect. The target audio information in the process of acquiring the audio information by the intelligent recording equipment can be obtained. Since voiceprint recognition belongs to the prior art, detailed description is omitted here.
And 303, performing voiceprint recognition on the acquired audio information, and judging whether a result of the voiceprint recognition on the acquired audio information is matched with a voiceprint object. If the result of voiceprint recognition on the acquired audio information is judged to be matched with the voiceprint object, the step 304 is carried out; otherwise, the process ends.
That is, if the result of the voiceprint recognition of the acquired audio information does not match the voiceprint object, the acquired audio information is not recorded, and only when the acquired audio information matches the voiceprint object, the recording is performed. For example, the audio information of the teacher in the king is input in advance, the voiceprint object is obtained by carrying out voiceprint recognition on the audio information, if the audio information of the classmate plum is acquired, the result of the voiceprint recognition obtained by the voiceprint recognition is not matched with the voiceprint object, and the audio information is not recorded.
And step 304, recording the acquired audio information, and judging whether a dotting marking instruction sent by any one of the N mobile terminals is received. If the dotting marking instruction sent by any one of the N mobile terminals is judged to be received, the step 305 is entered; otherwise, go to step 307.
Step 305, dotting mark is carried out on the audio information at the current time point.
Step 306, when the recording is finished, generating an audio file with the marking information.
It should be noted that before the recording is finished, whether a preset condition is met may also be detected; and if the preset condition is met, generating an audio file according to the dotting instruction received at present, and sending the audio file generated according to the dotting instruction received at present to a preset terminal. The preset conditions referred to herein may include: and receiving an ending instruction sent by the preset terminal, and/or reaching the recording time set by the preset terminal. That is, if a certain speaker leaves the conference 30 minutes after the conference and the conference is not yet finished, the audio file generated by the dotting instruction received 30 minutes, which is the part that the speaker has participated in, can be transmitted to the speaker. Or, if the total time of a lecture is 2 hours, the user can split the 2-hour audio file into two audio files with 1 hour of time, and only needs to set the recording time to be 1 hour, by using the method, if an audio file is generated every 1 hour, at this time, if the lecture actually speaks for 40 minutes in 1 hour, an audio file with 1 hour of time is generated, and an audio file with 40 minutes of time is generated. Of course, the above description is merely illustrative and not restrictive.
Step 307, when the recording is finished, directly generating an audio file.
And step 308, sending the audio file to the N mobile terminals.
Since steps 301, 304 to 308 in this embodiment are substantially the same as steps 101 to 106 in the first embodiment, it is intended that the intelligent sound recording apparatus establishes communication connection with N mobile terminals in advance, where N is a natural number greater than or equal to 1; in the process of recording the acquired audio information by the intelligent recording equipment, if a dotting marking instruction sent by any one of the N mobile terminals is received, dotting and marking the audio information at the current time point, and generating an audio file with marked information when the recording is finished; and sending the audio file to the N mobile terminals. And will not be described in detail herein.
Compared with the prior art, the audio recording method provided by the embodiment receives the audio information pre-recorded by the user before the intelligent recording equipment records the acquired audio information, performs voiceprint recognition on the pre-recorded audio information, and takes a voiceprint recognition result as a voiceprint object; performing voiceprint recognition on the collected audio information, and judging whether a result of the voiceprint recognition on the collected audio information is matched with a voiceprint object; and when the voiceprint recognition result of the acquired audio information is matched with the voiceprint object, recording the acquired audio information. By doing so, be favorable to the pertinence to the speech information that specific vocal person sent record, avoided recording other vocal person's speech information, not only be favorable to filtering unimportant speech information that other vocal persons sent, still be favorable to making the audio file of recording clearer.
The steps of the above methods are divided for clarity, and the implementation may be combined into one step or split some steps, and the steps are divided into multiple steps, so long as the same logical relationship is included, which are all within the protection scope of the present patent; it is within the scope of the patent to add insignificant modifications to the algorithms or processes or to introduce insignificant design changes to the core design without changing the algorithms or processes.
The fourth embodiment of the invention relates to an intelligent sound recording device, as shown in fig. 4, comprising at least one processor 401; and a memory 402 communicatively coupled to the at least one processor 401; the memory 402 stores instructions executable by the at least one processor 401, and the instructions are executable by the at least one processor 401 to enable the at least one processor 401 to perform the audio recording method as described above.
Where the memory 402 and the processor 401 are coupled by a bus, which may include any number of interconnected buses and bridges that couple one or more of the various circuits of the processor 401 and the memory 402 together. The bus may also connect various other circuits such as peripherals, voltage regulators, power management circuits, and the like, which are well known in the art, and therefore, will not be described any further herein. A bus interface provides an interface between the bus and the transceiver. The transceiver may be one element or a plurality of elements, such as a plurality of receivers and transmitters, providing a means for communicating with various other apparatus over a transmission medium. The data processed by the processor 401 may be transmitted over a wireless medium via an antenna, which may receive the data and transmit the data to the processor 401.
The processor 401 is responsible for managing the bus and general processing and may provide various functions including timing, peripheral interfaces, voltage regulation, power management, and other control functions. And memory 402 may be used to store data used by processor 401 in performing operations.
A fifth embodiment of the present invention relates to a computer-readable storage medium storing a computer program. The computer program realizes the above-described method embodiments when executed by a processor.
That is, as can be understood by those skilled in the art, all or part of the steps in the method for implementing the embodiments described above may be implemented by a program instructing related hardware, where the program is stored in a storage medium and includes several instructions to enable a device (which may be a single chip, a chip, or the like) or a processor (processor) to execute all or part of the steps of the method described in the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
It will be understood by those of ordinary skill in the art that the foregoing embodiments are specific examples for carrying out the invention, and that various changes in form and details may be made therein without departing from the spirit and scope of the invention in practice.