CN106921802B - Audio data playing method and device - Google Patents

Audio data playing method and device Download PDF

Info

Publication number
CN106921802B
CN106921802B CN201710161455.5A CN201710161455A CN106921802B CN 106921802 B CN106921802 B CN 106921802B CN 201710161455 A CN201710161455 A CN 201710161455A CN 106921802 B CN106921802 B CN 106921802B
Authority
CN
China
Prior art keywords
playing
target audio
speed
operation instruction
speech rate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710161455.5A
Other languages
Chinese (zh)
Other versions
CN106921802A (en
Inventor
孙策
乔立君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Neusoft Corp
Original Assignee
Neusoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Neusoft Corp filed Critical Neusoft Corp
Priority to CN201710161455.5A priority Critical patent/CN106921802B/en
Publication of CN106921802A publication Critical patent/CN106921802A/en
Application granted granted Critical
Publication of CN106921802B publication Critical patent/CN106921802B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47217End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for controlling playback functions for recorded or on-demand content, e.g. using progress bars, mode or play-point indicators or bookmarks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/8106Monomedia components thereof involving special audio data, e.g. different tracks for different languages

Abstract

The invention discloses a method and a device for playing audio data, relates to the technical field of information processing, and mainly aims to solve the problem that in the prior art, when a user fast forwards played audio in an application program, the content of a target audio can be directly skipped in the fast forwarding process, so that the user cannot know the audio content in the fast forwarding process. The technical scheme of the invention comprises the following steps: acquiring an operation instruction for adjusting the playing speed of the target audio, wherein the operation instruction comprises a to-be-adjusted interval of the target audio and the speed grade of the playing speed; adjusting the playing speed of the target audio in the interval to be adjusted according to the speed level; and after the adjusted target audio is played, restoring the normal playing speed of the target audio.

Description

Audio data playing method and device
Technical Field
The present invention relates to the field of information processing technologies, and in particular, to a method and an apparatus for playing audio data.
Background
Along with the explosive growth and rapid popularization of mobile terminal users, especially the wide use and popularization of smart phones, the smart phones become indispensable portable digital assistants in daily life of people.
At present, more and more applications are collected in the smart phone, for example, a listening application is installed in the smart phone, and the listening application plays a target audio file in a form of voice. The book listening application program is provided with a fast forward function, for example, the reading duration of a target audio file in the book listening application program is 05:00:20, and a user fast forwards the reading duration to 10:00:03 by dragging a progress bar, so that the book listening application program starts to play voice from the time of 10:00:03 of the target audio file, but audio content in a time interval of 05:00:21-10:00:02 is not played. Although the listening book application allows the user to fast forward the voice playing according to the requirement, the content of the target audio is directly skipped over during the fast forward process, so that the user cannot know the audio content during the fast forward process.
Disclosure of Invention
In view of this, the present invention provides a method and an apparatus for playing audio data, which mainly aim to solve the problem in the prior art that when a user fast forwards a played audio in an application, the content of a target audio is directly skipped over during fast forwarding, so that the user cannot know the audio content during fast forwarding.
According to an aspect of the present invention, the present invention provides a method for playing audio data, including:
acquiring an operation instruction for adjusting the playing speed of the target audio, wherein the operation instruction comprises a to-be-adjusted interval of the target audio and the speed grade of the playing speed;
adjusting the playing speed of the target audio in the interval to be adjusted according to the speed level;
and after the adjusted target audio is played, restoring the normal playing speed of the target audio.
According to another aspect of the present invention, there is provided an audio data playback apparatus, comprising:
the first obtaining unit is used for obtaining an operation instruction for adjusting the playing speed of the target audio, wherein the operation instruction comprises a to-be-adjusted interval of the target audio and the speed grade of the playing speed;
the adjusting unit is used for adjusting the playing speed of speech of the target audio in the interval to be adjusted according to the speed of speech grade acquired by the first acquiring unit;
and the restoring unit is used for restoring the normal playing speed of the target audio after the target audio adjusted by the adjusting unit is played.
By means of the technical scheme, the method and the device for playing the audio data, provided by the invention, obtain an operation instruction for adjusting the playing speed of the target audio, wherein the operation instruction comprises a to-be-adjusted interval of the target audio and the speed grade of the playing speed; adjusting the playing speed of the target audio in the interval to be adjusted according to the speed level; after the adjusted target audio is played, recovering the normal playing speed of the target audio; compared with the prior art, when the target audio is adjusted, the method and the device realize the adjustment by adjusting the playing speech speed in the interval to be adjusted instead of skipping the audio content in the fast forward or slow down process, so that a user can not only realize the fast forward or slow down of the target audio, but also know the brief description of the fast forward or slow down content.
The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.
Drawings
Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:
fig. 1 is a flowchart illustrating a method for playing audio data according to an embodiment of the present invention;
FIG. 2 is a diagram illustrating displacement directions generated by an operation command according to an embodiment of the present invention;
FIG. 3 is a schematic diagram illustrating an operation of a user in a touch screen of a terminal according to an embodiment of the present invention;
FIG. 4 is a schematic diagram illustrating another operation of a user in a touch screen of a terminal according to an embodiment of the present invention;
fig. 5 is a block diagram illustrating a playing apparatus for audio data according to an embodiment of the present invention;
fig. 6 is a block diagram illustrating another audio data playing apparatus according to an embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
An embodiment of the present invention provides a method for playing audio data, where the method is applied to a client, and as shown in fig. 1, the method includes:
101. the client acquires an operation instruction for adjusting the playing speed of the target audio.
It should be noted that the client according to the embodiment of the present invention may include, but is not limited to, the following contents, for example: the system comprises a book listening client, a video client, a music client and the like, wherein the clients need to be installed in terminal equipment, and the terminal equipment comprises a smart phone, a tablet computer, a personal computer and the like. The following description will be given by taking the client as a listening book client and the terminal device as a mobile phone, but it should be clear that the description is not intended to limit the specific types of the client and the terminal device.
When a user listens to audio through a book listening Application (APP) installed in a touch screen mobile phone and performs fast forward or slow down of the audio through operation on the touch screen, the user operates through a preset operation gesture on the user plane to realize fast play or slow down play of target audio. From the machine perspective, the client receives an operation instruction for adjusting the playing speed of the target audio sent to the client by a user through the touch screen, wherein the operation instruction comprises a to-be-adjusted interval of the target audio and the speed grade of the playing speed.
It should be noted that, in the embodiment of the present invention, when performing fast-forward playing or slow-down playing of a target audio, a to-be-adjusted interval included in an operation instruction may be only a part of the target audio, or may also be all the target audio, where the part is determined according to an actual requirement of a user, and a specific embodiment of the present invention is not limited. Whether the section to be adjusted of the adjustment target audio is partially adjusted or fully adjusted, the section to be adjusted comprises an adjustment starting point and an adjustment end point, and is a continuous section; for example, assuming that the total duration of the target audio is 60:00, when the target audio is played to 25:00, the speech rate is accelerated to play, and when the target audio is played to 31:00, the normal playing of the target audio is resumed, then the interval to be adjusted is: 25:00-31:00, wherein 25:00 of the target audio time length is an adjustment starting point, and 31:00 is an adjustment end point.
It should be noted that, when the to-be-adjusted interval in the operation instruction is acquired, an important point is to convert the operation of the user in the touch screen into the to-be-adjusted interval. Taking an operation instruction as a sliding operation as an example, when the user slides horizontally in the touch screen for a distance of M centimeters and turns into a section to be adjusted, the corresponding fast forward/slow playback time length is N minutes, and the like, the above description does not limit the type of the operation instruction and the operation limitation for adjusting the fast forward/slow playback of audio playing, but gives a description of the conversion relationship between the operation instruction and the section to be adjusted.
In the specific implementation process, the operation instruction and the speech rate level need to be configured in advance. The operation instruction may include, but is not limited to, a drag operation, a rotation operation, a multi-touch operation, a click operation, and the like; different operation instructions are different in the mode of converting the operation instructions into the intervals to be adjusted, and under the normal condition, the larger the relative displacement of the operation instructions in the touch screen is, the larger the corresponding interval to be adjusted is, the smaller the relative displacement of the operation instructions in the touch screen is, and the smaller the corresponding interval to be adjusted is; the speech rate level also comprises a plurality of levels, such as a first level, a second level and a third level; or, the lower the level value is, the lower the corresponding playing speed is; the larger the grade value is, the larger the playing speed is. Specifically, the embodiment of the present invention does not specifically limit the preset operation finger, the interval to be adjusted, and the speed level.
102. And the client adjusts the playing speed of speech of the target audio in the interval to be adjusted according to the speed of speech grade.
The audio playback speech rate is the number of words or the speed of the words played in a unit time, and for example, it is assumed that the number of broadcast words in normal audio playback is 300. The types of target audio are different, and the corresponding speech rates are also different.
In the embodiment of the invention, when the client plays the target audio quickly or slowly, the client is related to the speech speed grade information in the operation instruction. In a specific implementation process, the speech rate level may be a specific numerical value corresponding to a speech rate level, or may be a numerical range corresponding to a speech rate level. If the operation command is a drag operation command, the speech rate level can be determined by the speed of the drag, for example, if the drag speed is S1, the corresponding speech rate level is low, and if the drag speed is S2, the corresponding speech rate level is medium. If the operation instruction is a rotation operation instruction, the speech rate grade can be determined through the number of turns of rotation, and if the number of turns of rotation to the right is 2 turns, the quick-playing speech rate grade is a middle grade; if the left turn is 1 turn, the slow playback rate level is low. Specifically, the embodiment of the present invention does not limit the speed level.
And the client acquires a corresponding audio file from a path corresponding to the target audio, and adjusts the playing speed of the target audio in the interval to be adjusted according to the speed level. For example, assume that the normal playing speech rate of the target audio is 340 words/minute, and the playing time of the target audio is 00: in the interval to be adjusted of 00-15:00, when a user needs to slowly play the target audio, the client determines that the speech rate grade is high (220 characters/minute), and performs slow adjustment on the target audio in the interval to be adjusted according to the speech rate grade. The above description is only exemplary, and the embodiments of the present invention are not limited thereto.
103. And after the adjusted target audio is played, the client recovers the normal playing speed of the target audio.
Following the example of step 102, when the speech rate level is determined to be high (220 words/min), then the playback is slowed based on the adjusted target audio, i.e., the target audio is played at the playback speech rate of 220 words per minute from the adjustment start point in the interval to be adjusted of the target audio. It should be noted that, when the slow play of the target audio is performed, the embodiment of the present invention does not perform reverse play on the content that is originally played in the forward order, but refers to the slow play speed. When the target audio in the interval to be adjusted is played completely, the normal playing of the target audio is resumed, and when the user has a demand for fast forward or slow playback, the method shown in fig. 1 can be executed in a circulating manner until the target audio is played completely.
The audio data playing method provided by the embodiment of the invention obtains an operation instruction for adjusting the playing speed of the target audio, wherein the operation instruction comprises a to-be-adjusted interval of the target audio and the speed grade of the playing speed; adjusting the playing speed of the target audio in the interval to be adjusted according to the speed level; after the adjusted target audio is played, recovering the normal playing speed of the target audio; compared with the prior art, when the target audio is adjusted, the method and the device realize the adjustment by adjusting the playing speed in the interval to be adjusted instead of skipping the audio content in the fast forwarding or slow down process, so that the user can not only realize the fast forwarding or slow down of the target audio, but also know the brief description of the fast forwarding or slow down content.
At present, with the continuous development of terminal devices, terminals are more and more abundant, the embodiment of the present invention will be described by taking a terminal with a specific touch function and a terminal with a physical keyboard as examples, and for different operation objects, different identification methods need to be adopted when a client obtains an operation instruction for adjusting the playing speech rate of a target audio, and specifically include:
(1) and if the terminal is provided with a touch screen, acquiring an operation instruction for adjusting the playing speed of the target audio.
Firstly, a client displays a playing interface of a target audio based on the touch screen; monitoring the touch screen, and verifying whether an instruction triggered based on the touch screen is the operating instruction; and if the received instruction is an operation instruction, acquiring the operation instruction.
After the operation instruction is obtained, the displacement size and the displacement direction for triggering the operation instruction are obtained, wherein the displacement size is used for determining the length of the interval to be adjusted, and the displacement direction is used for indicating that the starting point of the interval to be determined is determined after the time length corresponding to the interval to be determined is determined.
It should be noted that, when determining the interval to be adjusted, the following manners may be adopted, but not limited to: calling a preset interface function, and determining the displacement size generated by the operation instruction, wherein the determined displacement size is a directed line segment from an initial position corresponding to the operation instruction to a final position corresponding to the operation instruction, and the length of the directed line segment is a numerical value, so that the displacement size needs to be converted into a time length related to the target audio. In the embodiment of the invention, when the interval to be adjusted is determined, firstly, the time length corresponding to the displacement size can be calculated according to the positive correlation between the displacement size and the length of the target audio progress bar; the length of the progress bar is used for representing the duration of the target audio; secondly, acquiring the current playing time of the target audio; and finally, determining the interval to be adjusted according to the current playing time of the target audio and the duration corresponding to the displacement.
In practical applications, the playing time lengths corresponding to different target audios may be different, but the length of the progress bar is kept constant, when the playing time length of the target audio is shorter, the percentage corresponding to the progress bar per minute is larger, and when the playing time length of the target audio is longer, the percentage corresponding to the progress bar per minute may be smaller. According to the embodiment of the invention, through the positive correlation between the displacement and the length of the target audio progress bar, the problem that the same interval to be adjusted occurs in different target audio durations on the premise of the same displacement in actual operation is ensured.
Before executing an operation instruction for adjusting the playing speed of the target audio, configuring the preset operation instruction, presetting a touch rule to map the operation instruction, and determining that the operation instruction triggered by a user is the operation instruction for fast playing or slow playing the target audio when the operation instruction received by the client is consistent with the preset operation instruction; and before the playing speed of the target audio is adjusted according to the speed grade, configuring the corresponding relation between the speed grade and the playing speed.
For example, as shown in table 1, table 1 shows a mapping relationship list between a preset operation instruction and a speech rate level according to an embodiment of the present invention, where the table illustrates a relationship between the preset operation instruction and the speech rate level when a target audio is played slowly or played quickly. Table 2 shows a mapping list of the speech rate levels and the number of words played, where the number of words broadcast per minute in table 2 is only an exemplary example, and is not intended to limit the normal speech rate of the target audio and the number of words after adjusting the speech rate. It should be noted that tables 1 and 2 are only exemplary examples, and the description form between the preset operation instruction, the speech rate level, and the number of words played in the embodiment of the present invention is not limited.
TABLE 1
Figure BDA0001248668990000071
TABLE 2
Normal speech rate Speech rate level 1 Speech rate level 2 Speech rate level 3 Speech rate level 3
300 words/minute 200 words/minute 260 words/minute 350 word/minute 400 words/minute
It is emphasized again that tables 1 and 2 are merely exemplary distances, and the distances of the speech rate levels shown in the tables are not meant to be specific limitations of the embodiments of the present invention, and in practical applications, there may be more speech rate levels, or fewer speech rate levels, and there may also be more or less speech rates of the target audio than 300 words/minute, and in particular, the embodiments of the present invention are not limited.
The embodiment of the present invention aims to fast forward or slow down a target audio by increasing or decreasing the speech rate of the target audio, and further understand the summary of the content during fast forward or slow down of the target audio, so the following will describe in detail the playing of the target video in a section to be adjusted. As shown in fig. 2, fig. 2 is a schematic diagram illustrating a displacement direction generated by an operation instruction according to an embodiment of the present invention; taking an operation instruction as an example of dragging, point a is a starting point of a user operation, and in the first case: when the user performs the dragging from a to B (L3), the displacement direction generated by the corresponding operation instruction is rightward, in this case, the target audio in the interval to be adjusted is played according to the speech rate level with the second time point (the current playing time of the target audio) as the playing starting point. In the second case: when the user performs the drag operations a to E (L4), the generated displacement direction is not parallel to the progress bar, in which case the default is the user's right drag operation, i.e. as shown in fig. 2, a vertical line of the AB line segment is made, and the displacement direction pointing to the right side of the vertical line is determined as the displacement direction of the operation instruction being the right side.
Similarly, in the third case, when the user performs the dragging from a to C (L1), the line segment AC is located on the left side of the vertical line, and the direction of the displacement from a to C is determined to be left, in this case, after playing the audio of the plaque, the target audio in the interval to be adjusted is played according to the speech rate level with a first time point as a playing starting point, where the first time point is a time point obtained by subtracting a duration corresponding to the displacement from the current playing time. For convenience of understanding, it is assumed that, for example, the interval to be adjusted, which is determined by the above method according to the time length corresponding to the displacement size, is 10min at the point a (the current target audio playing time is 50:00), and then, when the target audio playing is performed, the target audio is played from 40:00 of the target audio in combination with the speech rate level of the playing speech rate determined by the above method. The above are merely exemplary, and specific numerical values are not limited.
The above embodiment is described by taking an example in which the operation instruction is triggered by a drag method, and the following description will be given by taking an example in which the operation instruction is triggered by a rotation operation. For example, as shown in fig. 3, fig. 3 shows an operation schematic diagram of a user in a terminal touch screen according to an embodiment of the present invention, when a playing interface of a target audio is displayed in a current screen, a client monitors an operation instruction, the client obtains a rotation direction included in the operation instruction by 2 turns, and the rotation direction is rightward, so that a speech rate level 2 for adjusting a playing speech rate of the target audio in a to-be-adjusted interval L can be determined, and the playing of the target audio can be executed from a current playing time.
The two-point touch operation is also briefly described. Illustratively, as shown in fig. 4, a user operates the touch screen by two fingers, wherein one finger is located at point a, point a is fixed, and from the point a adjacent to the other finger, the other finger slides to point B in parallel to the right, and the client acquires that the displacement between point a and point B corresponding to the operation instruction is L1, so that the interval to be adjusted T1 corresponding to L1 can be determined, and the sliding speed is V1, so that the target audio in the interval to be adjusted T1 is played according to the speech rate level corresponding to V1 from the current playing time point.
In practical applications, when the preset operation instruction is a shake terminal operation, the preset condition of shake needs to be limited, which is used for distinguishing accidental shake from shake motion intended by the user, or can also be used for distinguishing shake motion for other operation functions from shake motion for data acquisition in the embodiment. The preset operation instruction condition may be set by the user through shaking of the mobile phone in the setting stage, for example, the preset condition may be set to "shake once", "shake twice" or "shake three times" or the like.
In practical application, when the terminal shakes, the terminal can monitor shaking actions through the motion sensor, the motion sensor can be but is not limited to a gravity acceleration sensor, and when the mobile phone shakes, the sensor can detect the generation of acceleration and can obtain the direction of the acceleration. When the preset conditions are set, the mobile phone can set the preset conditions according to shaking data (whether acceleration is generated, the magnitude and direction of the acceleration and the like) returned by the sensor. In a feasible scheme of this embodiment, since the gravitational acceleration sensor can detect not only whether the gravitational acceleration is generated, but also the magnitude of the gravitational acceleration, the mobile phone can also distinguish the intensity of the shaking of the user according to the obtained specific shaking data, thereby enriching the number of shaking instructions. Generally, the gravity acceleration generated by a user shaking the mobile phone lightly is smaller than that generated by shaking the mobile phone vigorously, when a preset operation instruction is set, the mobile phone can divide the magnitude of the gravity acceleration into intervals, the user is allowed to set the shaking action of the preset operation instruction to be stronger, and the shaking action of the other function is set to be weaker, so that two different operations are distinguished. Along with the continuous increase of the functions of the mobile phone, limited shaking operation types cannot meet the increasing functions, in the scheme, the mobile phone can develop a new method in the dimension of the gravity acceleration (the strength of shaking in the user using layer) and combines the existing shaking operation to expand the quantity of the shaking operation in multiples, so that the operation mode of the mobile phone is enriched.
In practical application, when a user selects a terminal or an application program, consideration of selection also depends on the use experience and convenience of the user, and the application program can be operated by presetting an operation instruction (an operation instruction triggered by a gesture instruction) in the embodiment, so that the diversity of operation can be enhanced, and the use experience of the user is enhanced.
(2) And the specific physical keyboard of the terminal acquires an operation instruction for adjusting the playing speed of the target audio.
When the target audio is played in the terminal, monitoring a physical keyboard in the terminal, and receiving an operation instruction for adjusting the playing speed triggered based on the physical key; matching the operation instruction with a preset operation instruction; and if the operation instruction is consistent with the preset operation instruction, acquiring the operation instruction.
Before executing the operation instruction for adjusting the playing speech rate of the target audio, one or more physical keys may be preset as the triggered operation instruction, and taking preset one physical key as an example, a power key, a confirmation key or an external key may be preset as the preset operation instruction. The speech rate grade corresponding to the preset operation instruction can also be determined by the duration of triggering the physical key. For example, the duration of triggering the physical key may be set to 2 seconds corresponding to the speech rate level 1, the duration of triggering the physical key may be set to 4 seconds corresponding to the speech rate level 2, the duration of triggering the physical key may be set to 5 seconds corresponding to the speech rate level 3, and the like.
In addition, the client may also obtain the operation instruction based on various sensors built in or external to the terminal, for example, a microphone, and the like.
In an implementation manner of this embodiment, the terminal may collect an acoustic signal sent by the user through the microphone, and when the user sends a preset voice instruction (for example, some keywords), the terminal obtains the operation instruction and sends the operation instruction to the client. The preset voice command can be set by the user according to the preference of the user, and for example, the user can set the voice command to be "fast forward one level", "slow down one level", and the like. The present embodiment does not limit the carrying form and the obtaining manner of the operation instruction.
In another embodiment of the present invention, the client may further monitor a light sensor built in the terminal, and a light sensor is generally disposed on the front of the terminal near the earpiece, and is used to detect the intensity of ambient light outside the terminal, so as to adjust the screen brightness. In this embodiment, the mobile phone may recognize the operation gesture of the user by using the light sensor, and when the user slides across the terminal screen with a palm or covers the mobile phone screen, the terminal recognizes that the external light intensity is smaller than the preset brightness threshold, so as to obtain the operation instruction, and sends the operation instruction to the client.
Further, as an implementation of the method shown in fig. 1, another embodiment of the present invention further provides an apparatus for playing audio data. The embodiment of the apparatus corresponds to the embodiment of the method, and for convenience of reading, details in the embodiment of the apparatus are not repeated one by one, but it should be clear that the apparatus in the embodiment can correspondingly implement all the contents in the embodiment of the method.
An embodiment of the present invention further provides a device for playing audio data, as shown in fig. 5, including:
the first obtaining unit 21 is configured to obtain an operation instruction for adjusting a playing speech rate of a target audio, where the operation instruction includes a to-be-adjusted interval of the target audio and a speech rate level of the playing speech rate;
the adjusting unit 22 is configured to adjust the playing speech rate of the target audio in the interval to be adjusted according to the speech rate level acquired by the first acquiring unit 21;
a restoring unit 23, configured to restore the normal playing speed of the target audio after the target audio adjusted by the adjusting unit 22 is played.
Further, as shown in fig. 6, the apparatus further includes:
a first determining unit 24, configured to determine, after the first obtaining unit 21 obtains an operation instruction for adjusting a playing speed of a target audio, a displacement size generated by triggering the operation instruction;
a calculating unit 25, configured to calculate a duration corresponding to the displacement according to the positive correlation between the displacement determined by the first determining unit 24 and the length of the target audio progress bar; the length of the progress bar is used for representing the duration of the target audio;
a second obtaining unit 26, configured to obtain a current playing time of the target audio;
a second determining unit 27, configured to determine the interval to be adjusted according to the duration corresponding to the current playing time and the displacement size acquired by the second acquiring unit 26.
Further, as shown in fig. 6, the apparatus further includes:
a third obtaining unit 28, configured to obtain a displacement direction generated by the operation instruction before the restoring unit 23 restores the normal playing speech speed of the target audio after the target audio to be adjusted is played;
a first playing unit 29, configured to play the target audio in the interval to be adjusted according to the speech rate level by using a first time point as a playing start point when the displacement direction acquired by the third acquiring unit 28 is leftward, where the first time point is a time point obtained by subtracting a duration corresponding to the displacement from the current playing time;
a second playing unit 210, configured to play the target audio in the interval to be adjusted according to the speech rate level with a second time point as a playing start point when the displacement direction acquired by the third acquiring unit 28 is rightward, where the second time point is the current playing time.
The operation instructions comprise dragging, rotating operation and multi-point touch operation instructions;
if the operation instruction is triggered by dragging operation, determining the speech rate grade of the playing speech rate according to the dragging speed;
if the operation instruction is triggered by rotation operation, determining the speech rate grade of the playing speech rate according to the rotation speed or the rotation number;
and if the operation instruction is triggered by multi-point touch operation, determining the speech rate grade of the playing speech rate according to the sliding speed of one touch point.
Further, as shown in fig. 6, if the terminal has a touch screen, the first obtaining unit 21 includes:
the display module 211 is configured to display a playing interface of the target audio based on the touch screen;
a monitoring module 212, configured to monitor the touch screen;
the verification module is used for verifying whether the instruction triggered based on the touch screen is the operation instruction;
the obtaining module 213 is configured to obtain the operation instruction when the verification module verifies that the instruction triggered by the touch screen is the operation instruction.
The audio data playing device provided by the embodiment of the invention obtains an operation instruction for adjusting the playing speed of the target audio, wherein the operation instruction comprises a to-be-adjusted interval of the target audio and the speed grade of the playing speed; adjusting the playing speed of the target audio in the interval to be adjusted according to the speed level; after the adjusted target audio is played, recovering the normal playing speed of the target audio; compared with the prior art, when the target audio is adjusted, the method and the device realize the adjustment by adjusting the playing speed in the interval to be adjusted instead of skipping the audio content in the fast forwarding or slow down process, so that the user can not only realize the fast forwarding or slow down of the target audio, but also know the brief description of the fast forwarding or slow down content.
In the foregoing embodiments, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
It will be appreciated that the relevant features of the method and apparatus described above are referred to one another. In addition, "first", "second", and the like in the above embodiments are for distinguishing the embodiments, and do not represent merits of the embodiments.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
The algorithms and displays presented herein are not inherently related to any particular computer, virtual machine, or other apparatus. Various general purpose systems may also be used with the teachings herein. The required structure for constructing such a system will be apparent from the description above. Moreover, the present invention is not directed to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any descriptions of specific languages are provided above to disclose the best mode of the invention.
In the description provided herein, numerous specific details are set forth. It is understood, however, that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects. However, the disclosed method should not be interpreted as reflecting an intention that: that the invention as claimed requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this invention.
Those skilled in the art will appreciate that the modules in the device in an embodiment may be adaptively changed and disposed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and furthermore they may be divided into a plurality of sub-modules or sub-units or sub-components. All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or elements of any method or apparatus so disclosed, may be combined in any combination, except combinations where at least some of such features and/or processes or elements are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
Furthermore, those skilled in the art will appreciate that while some embodiments described herein include some features included in other embodiments, rather than other features, combinations of features of different embodiments are meant to be within the scope of the invention and form different embodiments. For example, in the following claims, any of the claimed embodiments may be used in any combination.
The various component embodiments of the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that a microprocessor or Digital Signal Processor (DSP) may be used in practice to implement some or all of the functions of some or all of the components in the method and apparatus for playing back audio data according to embodiments of the present invention. The present invention may also be embodied as apparatus or device programs (e.g., computer programs and computer program products) for performing a portion or all of the methods described herein. Such programs implementing the present invention may be stored on computer-readable media or may be in the form of one or more signals. Such a signal may be downloaded from an internet website or provided on a carrier signal or in any other form.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.

Claims (10)

1. A method for playing audio data, comprising:
acquiring an operation instruction for adjusting the playing speed of the target audio, wherein the operation instruction comprises a to-be-adjusted interval of the target audio and a speed grade of the playing speed, and the speed grade is determined by the dragging speed or the speed grade is determined by the number of rotating circles;
adjusting the playing speed of the target audio in the interval to be adjusted according to the speed level;
acquiring a displacement direction generated by the operation instruction;
if the displacement direction is leftward, playing the target audio in the interval to be adjusted according to the speech rate level by taking a first time point as a playing starting point, wherein the first time point is obtained by subtracting a time length corresponding to the displacement generated by triggering the operation instruction from the current playing time of the target audio;
and after the adjusted target audio is played, restoring the normal playing speed of the target audio.
2. The method according to claim 1, wherein after obtaining the operation instruction for adjusting the playing speech rate of the target audio, the method further comprises:
determining the displacement size generated by triggering the operation instruction;
calculating the time length corresponding to the displacement according to the positive correlation between the displacement and the length of the target audio progress bar; the length of the progress bar is used for representing the duration of the target audio;
acquiring the current playing time of the target audio;
and determining the interval to be adjusted according to the duration corresponding to the current playing time and the displacement.
3. The method according to claim 2, wherein after the target audio to be adjusted is played, before resuming the normal playing speech rate of the target audio, the method further comprises:
and if the displacement direction is rightward, playing the target audio in the interval to be adjusted according to the speech speed grade by taking a second time point as a playing starting point, wherein the second time point is the current playing time.
4. The method according to claim 3, wherein the operation instruction comprises a drag, rotation operation, multi-touch operation instruction;
if the operation instruction is triggered by dragging operation, determining the speech rate grade of the playing speech rate according to the dragging speed;
if the operation instruction is triggered by rotation operation, determining the speech rate grade of the playing speech rate according to the rotation speed or the rotation number;
and if the operation instruction is triggered by multi-point touch operation, determining the speech rate grade of the playing speech rate according to the sliding speed of one touch point.
5. The method according to any one of claims 1 to 4, wherein if the terminal has a touch screen, the obtaining of the operation instruction for adjusting the playing speed of the target audio comprises:
displaying a playing interface of the target audio based on the touch screen;
monitoring the touch screen and verifying whether an instruction triggered based on the touch screen is the operating instruction;
and if so, acquiring the operation instruction.
6. An apparatus for playing audio data, comprising:
the first obtaining unit is used for obtaining an operation instruction for adjusting the playing speed of the target audio, wherein the operation instruction comprises a to-be-adjusted interval of the target audio and a speed grade of the playing speed, and the speed grade is determined by the dragging speed or the speed grade is determined by the number of rotating turns;
the adjusting unit is used for adjusting the playing speed of speech of the target audio in the interval to be adjusted according to the speed of speech grade acquired by the first acquiring unit;
a third obtaining unit, configured to obtain a displacement direction generated by the operation instruction;
a first playing unit, configured to play the target audio within the interval to be adjusted according to the speech rate level by using a first time point as a playing start point when the displacement direction acquired by the third acquiring unit is leftward, where the first time point is a time point obtained by subtracting a time length corresponding to a displacement generated by triggering the operation instruction from a current playing time of the target audio;
and the restoring unit is used for restoring the normal playing speed of the target audio after the target audio adjusted by the adjusting unit is played.
7. The apparatus of claim 6, further comprising:
the first determining unit is used for determining the displacement generated by triggering the operation instruction after the first acquiring unit acquires the operation instruction for adjusting the playing speed of the target audio;
the calculating unit is used for calculating the time length corresponding to the displacement according to the positive correlation relationship between the displacement determined by the first determining unit and the length of the target audio progress bar; the length of the progress bar is used for representing the duration of the target audio;
the second acquisition unit is used for acquiring the current playing time of the target audio;
and the second determining unit is used for determining the interval to be adjusted according to the duration corresponding to the current playing time and the displacement acquired by the second acquiring unit.
8. The apparatus of claim 7, further comprising:
and the second playing unit is used for playing the target audio in the interval to be adjusted according to the speech speed grade by taking a second time point as a playing starting point when the displacement direction acquired by the third acquiring unit is towards the right, wherein the second time point is the current playing time.
9. The apparatus according to claim 8, wherein the operation command comprises a drag, rotation, multi-touch operation command;
if the operation instruction is triggered by dragging operation, determining the speech rate grade of the playing speech rate according to the dragging speed;
if the operation instruction is triggered by rotation operation, determining the speech rate grade of the playing speech rate according to the rotation speed or the rotation number;
and if the operation instruction is triggered by multi-point touch operation, determining the speech rate grade of the playing speech rate according to the sliding speed of one touch point.
10. The apparatus according to any one of claims 6-9, wherein if the terminal has a touch screen, the first obtaining unit comprises:
the display module is used for displaying a playing interface of the target audio based on the touch screen;
the monitoring module is used for monitoring the touch screen;
the verification module is used for verifying whether the instruction triggered based on the touch screen is the operation instruction;
and the obtaining module is used for obtaining the operating instruction when the verifying module verifies that the instruction triggered by the touch screen is the operating instruction.
CN201710161455.5A 2017-03-17 2017-03-17 Audio data playing method and device Active CN106921802B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710161455.5A CN106921802B (en) 2017-03-17 2017-03-17 Audio data playing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710161455.5A CN106921802B (en) 2017-03-17 2017-03-17 Audio data playing method and device

Publications (2)

Publication Number Publication Date
CN106921802A CN106921802A (en) 2017-07-04
CN106921802B true CN106921802B (en) 2021-01-22

Family

ID=59461223

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710161455.5A Active CN106921802B (en) 2017-03-17 2017-03-17 Audio data playing method and device

Country Status (1)

Country Link
CN (1) CN106921802B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107992195A (en) * 2017-12-07 2018-05-04 百度在线网络技术(北京)有限公司 A kind of processing method of the content of courses, device, server and storage medium
CN112333336B (en) * 2020-10-26 2022-05-17 维沃移动通信(深圳)有限公司 Audio editing method and device, electronic equipment and storage medium
CN113821188A (en) * 2021-08-25 2021-12-21 深圳市声扬科技有限公司 Method and device for adjusting audio playing speed, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101770795A (en) * 2009-01-05 2010-07-07 联想(北京)有限公司 Computing device and video playing control method
CN102681687A (en) * 2011-03-16 2012-09-19 新奥特(北京)视频技术有限公司 Operation method and system for quickly browsing video
CN103116467A (en) * 2013-03-07 2013-05-22 东蓝数码股份有限公司 Video progress and volume control method based on multi-point touch control
CN103558969A (en) * 2013-10-28 2014-02-05 华为技术有限公司 Method and device for adjusting play

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105205083A (en) * 2014-06-27 2015-12-30 国际商业机器公司 Method and equipment for browsing content by means of key points in progress bar

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101770795A (en) * 2009-01-05 2010-07-07 联想(北京)有限公司 Computing device and video playing control method
CN102681687A (en) * 2011-03-16 2012-09-19 新奥特(北京)视频技术有限公司 Operation method and system for quickly browsing video
CN103116467A (en) * 2013-03-07 2013-05-22 东蓝数码股份有限公司 Video progress and volume control method based on multi-point touch control
CN103558969A (en) * 2013-10-28 2014-02-05 华为技术有限公司 Method and device for adjusting play

Also Published As

Publication number Publication date
CN106921802A (en) 2017-07-04

Similar Documents

Publication Publication Date Title
KR102054633B1 (en) Devices, methods, and graphical user interfaces for wireless pairing with peripherals and displaying status information about the peripherals
US20230127228A1 (en) Identifying applications on which content is available
CN110275664B (en) Apparatus, method and graphical user interface for providing audiovisual feedback
US10642574B2 (en) Device, method, and graphical user interface for outputting captions
CN109905754B (en) Virtual gift receiving method and device and storage equipment
CN107613131B (en) Application program disturbance-free method, mobile terminal and computer-readable storage medium
US9641471B2 (en) Electronic device, and method and computer-readable recording medium for displaying message in electronic device
CN108632658B (en) Bullet screen display method and terminal
CN105979312B (en) Information sharing method and device
KR20160026317A (en) Method and apparatus for voice recording
US20220391060A1 (en) Methods for displaying and providing multimedia resources
CN110740262A (en) Background music adding method and device and electronic equipment
US20190318169A1 (en) Method for Generating Video Thumbnail on Electronic Device, and Electronic Device
CN108616771B (en) Video playing method and mobile terminal
KR102090948B1 (en) Apparatus saving conversation and method thereof
CN110830368B (en) Instant messaging message sending method and electronic equipment
CN106921802B (en) Audio data playing method and device
EP4350535A2 (en) Content item module arrangements
CN105871696B (en) Information sending and receiving method and mobile terminal
CN112214112A (en) Parameter adjusting method and device
CN107728877B (en) Application recommendation method and mobile terminal
CN110798327B (en) Message processing method, device and storage medium
CN110109730B (en) Apparatus, method and graphical user interface for providing audiovisual feedback
CN108710521B (en) Note generation method and terminal equipment
WO2016206642A1 (en) Method and apparatus for generating control data of robot

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant