CN110636369A

CN110636369A - Multimedia file playing method and mobile terminal

Info

Publication number: CN110636369A
Application number: CN201910929797.6A
Authority: CN
Inventors: 罗征武
Original assignee: Vivo Mobile Communication Co Ltd
Current assignee: Vivo Mobile Communication Co Ltd
Priority date: 2019-09-27
Filing date: 2019-09-27
Publication date: 2019-12-31

Abstract

The invention discloses a multimedia file playing method and a mobile terminal, wherein the method comprises the following steps: performing sentence segmentation on the audio in the multimedia file according to a first preset condition; displaying at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio; and when the selection operation of the user on the mark is received, playing the multimedia file at the position corresponding to the mark. Because the user can select the mark when changing the playing progress of the multimedia file, and the mark corresponds to the starting position or the ending position of the statement in the audio, the user can accurately position the playing progress, thereby effectively improving the use experience of the user while reducing the complexity of the user operation.

Description

Multimedia file playing method and mobile terminal

Technical Field

The invention relates to the technical field of audio processing, in particular to a multimedia file playing method and a mobile terminal.

Background

In the existing mobile terminal, various multimedia applications, such as an audio player, a video player, etc., may be installed, and these multimedia applications may be used to play multimedia files, and may allow a user to change the playing progress of the multimedia files during the playing of the multimedia files, such as playing back or jumping to play the multimedia files.

Generally, when the user changes the playing progress of the multimedia file, the user can drag the progress bar to implement the process. However, in practical applications, the precision of the method for dragging the progress bar is not high, and the user may need to drag the progress bar for multiple times to position the playing progress to the desired playing progress position, so that the user experience is affected.

Disclosure of Invention

The embodiment of the invention provides a playing method of a multimedia file and a mobile terminal, and aims to solve the problem that in the prior art, when a user changes the playing progress of the multimedia file, the progress to be played is difficult to accurately position.

In order to solve the technical problem, the invention is realized as follows:

in a first aspect, a method for playing a multimedia file is provided, where the method includes:

performing sentence segmentation on the audio in the multimedia file according to a first preset condition;

displaying at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio;

and when the selection operation of the user on the mark is received, playing the multimedia file at the position corresponding to the mark.

In a second aspect, a terminal device is provided, which includes:

the sentence segmentation module is used for carrying out sentence segmentation on the audio in the multimedia file according to a first preset condition;

the display module displays at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio;

and the playing module plays the multimedia file at the position corresponding to the mark when receiving the selection operation of the user on the mark.

In a third aspect, a terminal device is provided, the terminal device comprising a processor, a memory and a computer program stored on the memory and executable on the processor, the computer program, when executed by the processor, implementing the steps of the method according to the first aspect.

In a fourth aspect, a computer-readable storage medium is provided, on which a computer program is stored, which computer program, when being executed by a processor, carries out the steps of the method according to the first aspect.

In the embodiment of the invention, the audio in the multimedia file can be segmented by sentences according to a first preset condition, and at least one mark is displayed according to the segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio, so that a user can select the mark when changing the playing progress, and the mobile terminal can play the multimedia file at the position corresponding to the mark after receiving the selection operation of the user. Because the user can select the mark when changing the playing progress of the multimedia file, and the mark corresponds to the starting position or the ending position of the statement in the audio, the user can accurately position the playing progress, thereby effectively improving the use experience of the user while reducing the complexity of the user operation.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention and not to limit the invention. In the drawings:

FIG. 1 is a flow chart illustrating a method for playing a multimedia file according to an embodiment of the present invention;

FIG. 2 is a diagram illustrating a method for playing a multimedia file according to an embodiment of the present invention;

fig. 3 is a schematic structural diagram of a mobile terminal according to an embodiment of the present invention;

fig. 4 is a schematic diagram of a hardware structure of a mobile terminal implementing various embodiments of the present invention.

Detailed Description

Generally, when a user plays a multimedia file using a multimedia application installed in a mobile terminal, the playing progress of the multimedia file may be changed by dragging a progress bar, such as playing the multimedia file back or jumping to play.

For example, when the mobile terminal plays an audio file for hearing learning, a pause/play button for pausing or playing, a button for switching the currently played audio file to the previous audio file, a button for switching the currently played audio file to the next audio file, and a play progress bar for showing the current play progress may be shown below the play interface. When a user wants to change the playing progress to play back a certain section of audio, the user can repeatedly drag the progress bar displayed in the playing interface to adjust the current playing progress to the position of the progress to be played, and then the key hearing learning of the section of audio is realized.

However, in practical applications, the precision of the method for dragging the progress bar is not high, and the user needs to repeatedly drag the progress bar to position the playing progress to the desired progress position, which not only wastes time, but also increases the complexity of user operations, thereby affecting the user experience. In addition, if the user does not know the playing progress corresponding to the multimedia content that the user wants to play, the user cannot adjust the playing progress by dragging the progress bar.

In order to solve the above technical problem, an embodiment of the present invention provides a method for playing a multimedia file and a mobile terminal, where the method includes: performing sentence segmentation on the audio in the multimedia file according to a first preset condition; displaying at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio; and when the selection operation of the user on the mark is received, playing the multimedia file at the position corresponding to the mark.

Therefore, when the user changes the playing progress, the mark can be selected, and after the mobile terminal receives the selection operation of the user, the multimedia file can be played at the position corresponding to the mark. Because the user can select the mark when changing the playing progress of the multimedia file, and the mark corresponds to the starting position or the ending position of the statement in the audio, the user can accurately position the playing progress, thereby effectively improving the use experience of the user while reducing the complexity of the user operation.

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The technical solutions provided by the embodiments of the present invention are described in detail below with reference to the accompanying drawings.

Fig. 1 is a flowchart illustrating a method for playing a multimedia file according to an embodiment of the present invention. The method is as follows.

S102: and carrying out sentence segmentation on the audio in the multimedia file according to a first preset condition.

In S102, the mobile terminal may automatically perform sentence segmentation on the audio in the multimedia file according to a first preset condition in the process of playing the multimedia file.

Optionally, in the process of playing the multimedia file, the mobile terminal may further trigger the sentence segmentation of the audio in the multimedia file by the user. The specific implementation mode is as follows:

when a user wants to change the playing progress of the multimedia file in the process of playing the multimedia file by using the multimedia application, the user can execute preset operation in a playing interface of the multimedia file, and the preset operation is used for starting a sentence segmentation mode.

In this embodiment, when the user performs the preset operation, at least three ways may be implemented, including:

the first method comprises the following steps: in the process of playing the multimedia file, a pause/play button and a button for starting the sentence segmentation mode can be displayed in a playing interface of the multimedia file, and when a user successively clicks the pause/play button and the button for starting the sentence segmentation mode, the user can be regarded as executing preset operation;

and the second method comprises the following steps: in the process of playing the multimedia file, a button for starting the sentence segmentation mode can be displayed in a playing interface of the multimedia file, and when a user clicks the button of the sentence segmentation mode, the user can be regarded as executing preset operation.

Optionally, the button of the sentence segmentation mode may also be displayed in an option interface of the multimedia application, which is not specifically limited herein.

And the third is that: the user can set the pause button in the mobile terminal in advance based on the setting interface of the multimedia application, and the pause button is used as a button of the sentence segmentation mode, so that when the user clicks the pause button in the playing process of the multimedia file, the user can be regarded as executing preset operation.

After the user executes the preset operation, the mobile terminal can receive the preset operation, at the moment, the mobile terminal can automatically pause the currently played multimedia file, and start the sentence segmentation mode, and when the sentence segmentation mode is started, perform sentence segmentation on the audio in the multimedia file.

In this embodiment, when performing sentence segmentation, the mobile terminal may default to perform sentence segmentation on the audio in the entire multimedia file, or may default to perform sentence segmentation within a preset progress range, and in addition, the sentence segmentation range of the audio in the multimedia file may also be preset by the user, for example, the user may preset to perform sentence segmentation on the audio whose play progress is within 1 minute to 2 minutes, which is not specifically limited in this embodiment.

It should be noted that when the mobile terminal performs statement segmentation on the audio within a preset progress range (which may be a range preset by a user or a default range of the mobile terminal), the audio corresponding to the preset progress range may be a section of audio before the current playing progress, or a section of audio after the current playing progress.

In this embodiment, when the mobile terminal performs statement segmentation on the audio in the multimedia file according to the first preset condition, specific implementation manners may include the following three types:

the first implementation mode comprises the following steps:

in a first implementation manner, the first preset condition may include at least one of the following: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration; the part of speech of the first vocabulary after the pause position is a noun, wherein the preset duration can be determined according to practical situations, for example, the preset duration can be set to 2s, or the average phoneme interval time of the whole audio can be obtained by counting the interval time between each adjacent phoneme in the audio, because the interval time between phonemes is shorter in the same sentence, and the interval time between the last phoneme of the previous sentence and the first phoneme of the next sentence is longer between sentences, the average phoneme interval time can be used as the preset duration, or the average phoneme interval time is multiplied by a certain multiple to be used as the preset duration.

When the audio is subjected to sentence division according to the first preset condition, the mobile terminal can detect the audio in the currently played multimedia file and determine the pause position of the audio during playing. One or more pause positions of the audio may be provided, and the following description may use one pause position as an example.

After the pause position is determined, whether the pause duration corresponding to the pause position and the part of speech of the first vocabulary after the pause position meet a first preset condition or not can be judged, if so, the segmentation position of the audio can be determined according to the first preset condition, namely, the pause position meeting the first preset condition is determined as the segmentation position for segmenting the audio, and the audio is segmented according to the segmentation position.

When the mobile terminal performs sentence segmentation on the audio based on the segmentation positions, one segmentation position may correspond to the start position or the end position of one sentence, and if the segmentation positions are multiple, the audio between two adjacent segmentation positions may correspond to one sentence.

The second implementation mode comprises the following steps:

in a second implementation manner, the first preset condition may include: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration, and the loudness of the audio after the pause position meets a second preset condition. The preset duration may be determined according to actual conditions, and the second preset condition may include at least one of the audio loudness being greater than or equal to the preset loudness and the audio loudness gradually increasing from an extremely low value to a peak value.

After the pause position is determined, whether the pause duration corresponding to the pause position and the loudness of the audio after the pause position meet a second preset condition or not can be judged, if so, the segmentation position of the audio can be determined according to a first preset condition, namely, the pause position meeting the first preset condition is determined as the segmentation position for segmenting the audio, and the audio is segmented according to the segmentation position.

The third implementation mode comprises the following steps:

in a third implementation manner, before performing statement segmentation on the audio according to a first preset condition, the mobile terminal may detect the audio in the currently played multimedia file, and determine an extremely low value corresponding to the loudness of the audio in the multimedia file, where the number of the extremely low values may be one or multiple.

After the extremely low value is determined, when the audio is subjected to sentence segmentation according to the first preset condition, the extremely low value can be used as the first preset condition, and the audio is subjected to sentence segmentation based on the extremely low value. When the mobile terminal performs statement segmentation on the audio based on the extremely low values, one extremely low value can correspond to the starting position or the ending position of one statement, and if the number of the extremely low values is multiple, the audio between two adjacent extremely low values can correspond to one statement.

S104: and displaying at least one mark according to the sentence segmentation result, wherein one mark corresponds to the starting position or the ending position of one sentence in the audio.

In S104, after performing sentence segmentation on the audio in the multimedia file, the mobile terminal may display at least one mark according to a result of the sentence segmentation, so that the user changes the playing progress based on the mark. Wherein a marker may correspond to the start or end position of a sentence in audio.

The mobile terminal may display at least one mark in a play progress bar of the multimedia file when displaying the at least one mark. The mark displayed in the progress bar may be a dot, an arrow, or other symbols, which is not limited herein.

In addition, when the mobile terminal displays at least one mark, the at least one mark can be displayed in a play interface of the multimedia file in a list form. When the mobile terminal displays at least one mark in the form of a list, the mark may be a text, such as a previous sentence, a next sentence, or the like, and may also be another text, which is not limited specifically herein.

S106: and when the selection operation of the user on the mark is received, playing the multimedia file at the position corresponding to the mark.

In S106, after the mobile terminal displays at least one mark, the user may select a certain mark according to the actual requirement of the user, and when the mobile terminal receives the selection operation of the user, the mobile terminal may jump the playing progress of the multimedia file to a progress position corresponding to the mark selected by the user, and play the multimedia file at the progress position, thereby changing the playing progress of the multimedia file.

For easy understanding of the playing method of the multimedia file, refer to fig. 2. Fig. 2 is a schematic diagram illustrating a method for playing a multimedia file according to an embodiment of the present invention.

In fig. 2, in the process of playing the multimedia file, the mobile terminal may display a button for starting a sentence segmentation mode, a pause/play button, a sentence segmentation mode button, a play progress bar, an option button, and the like in a play interface of the multimedia file. When a user wants to change the playing progress of the multimedia, the user can directly click the button for opening the sentence segmentation mode shown in fig. 2, and when the user clicks the button, the user can be regarded as the user executing a preset operation to open the sentence segmentation module.

The mobile terminal can automatically pause the currently played multimedia file and start a sentence segmentation mode when receiving the preset operation of a user, and performs sentence segmentation on the audio frequency in the multimedia file when starting the sentence segmentation mode. When performing statement segmentation, the specific implementation manner may refer to relevant contents recorded in S102 in the embodiment shown in fig. 1, and a description thereof is not repeated here.

After the sentence segmentation is performed by the mobile terminal, a plurality of dots can be displayed in the play progress bar of the multimedia file according to the sentence segmentation result, wherein one dot can correspond to the starting position or the ending position of one sentence, and the audio frequency between two adjacent dots corresponds to one sentence. In addition, corresponding labeling can be performed below each dot. As shown in fig. 2, the first 1 sentence may be marked below a first dot before the current playing progress, the first 2 sentences may be marked below a second dot before the current playing progress, and the last 1 sentence may be marked below the first dot after the current playing progress.

After the mobile terminal displays the plurality of dots, a user can select to click a certain dot on the playing progress bar according to the actual requirement of the user, when the mobile terminal receives the selection operation of the user on the certain dot, the playing progress of the multimedia file can be jumped to the position of the dot, the multimedia file is played from the position of the dot, and the change of the playing progress of the multimedia file is achieved.

In another implementation manner, in the process of playing the multimedia file, the mobile terminal shown in fig. 2 may also automatically perform sentence segmentation on the audio in the multimedia file, and display the mark shown in fig. 2 according to the segmentation result, and perform sentence segmentation on the audio in the multimedia file after the user does not need to manually operate each button shown in fig. 2. The implementation manner of the mobile terminal automatically performing the sentence segmentation may refer to the content described in the embodiment shown in fig. 1, and will not be described repeatedly here.

It should be noted that, in the embodiment of the present invention, before the voice segmentation, the audio may be divided to obtain a plurality of sub-audio bands, and then the voice segmentation operation is performed on one or more selected sub-audio bands, so that a user may select a more important sub-audio band for voice segmentation or a less important sub-audio band for voice segmentation according to different time periods of the audio file.

Fig. 3 is a schematic structural diagram of a mobile terminal according to an embodiment of the present invention. The mobile terminal includes: sentence segmentation module 31, display module 32 and play module 33, wherein:

the sentence segmentation module 31 is used for performing sentence segmentation on the audio in the multimedia file according to a first preset condition;

the display module 32 is used for displaying at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio;

and the playing module 33, when receiving the selection operation of the user on the mark, plays the multimedia file at the position corresponding to the mark.

Optionally, the sentence segmentation module 31 performs sentence segmentation on the audio in the multimedia file according to a first preset condition, including:

determining the segmentation positions of the audio according to a first preset condition, wherein one segmentation position corresponds to the starting position or the ending position of one statement;

performing sentence segmentation on the audio in the multimedia file based on the segmentation position;

wherein the first preset condition comprises: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration, and/or the part of speech of the first vocabulary after the pause position is a noun.

determining the pause position of the audio during playing according to a first preset condition, wherein one segmentation position corresponds to the starting position or the ending position of one sentence;

wherein the first preset condition comprises: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration, the loudness of the audio after the pause position meets a second preset condition, and the second preset condition comprises at least one of the loudness of the audio being greater than or equal to the preset loudness and the loudness of the audio gradually increasing from an extremely low value to a peak value.

Optionally, before the step of performing sentence segmentation on the audio in the multimedia file according to the first preset condition, the sentence segmentation module 31 further includes:

detecting an extremely low value corresponding to the audio loudness when the audio is played;

the sentence segmentation module 31 performs sentence segmentation on the audio in the multimedia file according to a first preset condition, and specifically includes:

and performing statement segmentation on the audio according to the extremely low values, wherein one extremely low value corresponds to the starting position or the ending position of one statement.

Optionally, the display module 32 displays at least one mark, including at least one of:

displaying the at least one mark in a playing progress bar of the multimedia file;

and displaying the at least one mark in a playing interface of the multimedia file in a list form.

The mobile terminal provided by the embodiment of the present invention can implement each process implemented by the mobile terminal in the method embodiment of fig. 1, and is not described herein again in order to avoid repetition. In the embodiment of the invention, the audio in the multimedia file can be segmented by sentences according to a first preset condition, and at least one mark is displayed according to the segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio, so that a user can select the mark when changing the playing progress, and the mobile terminal can play the multimedia file at the position corresponding to the mark after receiving the selection operation of the user. Because the user can select the mark when changing the playing progress of the multimedia file, and the mark corresponds to the starting position or the ending position of the statement in the audio, the user can accurately position the playing progress, thereby effectively improving the use experience of the user while reducing the complexity of the user operation.

Figure 4 is a schematic diagram of a hardware configuration of a mobile terminal implementing various embodiments of the present invention,

the mobile terminal 400 includes, but is not limited to: radio frequency unit 401, network module 402, audio output unit 403, input unit 404, sensor 405, display unit 406, user input unit 407, interface unit 408, memory 409, processor 410, and power supply 411. Those skilled in the art will appreciate that the mobile terminal architecture shown in fig. 4 is not intended to be limiting of mobile terminals, and that a mobile terminal may include more or fewer components than shown, or some components may be combined, or a different arrangement of components. In the embodiment of the present invention, the mobile terminal includes, but is not limited to, a mobile phone, a tablet computer, a notebook computer, a palm computer, a vehicle-mounted terminal, a wearable device, a pedometer, and the like.

The processor 410 performs statement segmentation on the audio in the multimedia file according to a first preset condition; displaying at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio; and when the selection operation of the user on the mark is received, playing the multimedia file at the position corresponding to the mark.

It should be understood that, in the embodiment of the present invention, the radio frequency unit 401 may be used for receiving and sending signals during a message sending and receiving process or a call process, and specifically, receives downlink data from a base station and then processes the received downlink data to the processor 410; in addition, the uplink data is transmitted to the base station. Typically, radio unit 401 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like. Further, the radio unit 401 can also communicate with a network and other devices through a wireless communication system.

The mobile terminal provides the user with wireless broadband internet access through the network module 402, such as helping the user send and receive e-mails, browse web pages, and access streaming media.

The audio output unit 403 may convert audio data received by the radio frequency unit 401 or the network module 402 or stored in the memory 409 into an audio signal and output as sound. Also, the audio output unit 403 may also provide audio output related to a specific function performed by the mobile terminal 400 (e.g., a call signal reception sound, a message reception sound, etc.). The audio output unit 403 includes a speaker, a buzzer, a receiver, and the like.

The input unit 404 is used to receive audio or video signals. The input Unit 404 may include a Graphics Processing Unit (GPU) 4041 and a microphone 4042, and the Graphics processor 4041 processes image data of a still picture or video obtained by an image capturing apparatus (such as a camera) in a video capturing mode or an image capturing mode. The processed image frames may be displayed on the display unit 406. The image frames processed by the graphic processor 4041 may be stored in the memory 409 (or other storage medium) or transmitted via the radio frequency unit 401 or the network module 402. The microphone 4042 may receive sound, and may be capable of processing such sound into audio data. The processed audio data may be converted into a format output transmittable to a mobile communication base station via the radio frequency unit 401 in case of the phone call mode.

The mobile terminal 400 also includes at least one sensor 405, such as a light sensor, motion sensor, and other sensors. Specifically, the light sensor includes an ambient light sensor that can adjust the brightness of the display panel 4061 according to the brightness of ambient light, and a proximity sensor that can turn off the display panel 4061 and/or the backlight when the mobile terminal 400 is moved to the ear. As one of the motion sensors, the accelerometer sensor can detect the magnitude of acceleration in each direction (generally three axes), detect the magnitude and direction of gravity when stationary, and can be used to identify the posture of the mobile terminal (such as horizontal and vertical screen switching, related games, magnetometer posture calibration), and vibration identification related functions (such as pedometer, tapping); the sensors 405 may also include a fingerprint sensor, a pressure sensor, an iris sensor, a molecular sensor, a gyroscope, a barometer, a hygrometer, a thermometer, an infrared sensor, etc., which will not be described in detail herein.

The display unit 406 is used to display information input by the user or information provided to the user. The Display unit 406 may include a Display panel 4061, and the Display panel 4061 may be configured in the form of a Liquid Crystal Display (LCD), an organic light-Emitting Diode (OLED), or the like.

The user input unit 407 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function control of the mobile terminal. Specifically, the user input unit 407 includes a touch panel 4071 and other input devices 4072. Touch panel 4071, also referred to as a touch screen, may collect touch operations by a user on or near it (e.g., operations by a user on or near touch panel 4071 using a finger, a stylus, or any suitable object or attachment). The touch panel 4071 may include two parts, a touch detection device and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, sends the touch point coordinates to the processor 410, receives a command from the processor 410, and executes the command. In addition, the touch panel 4071 can be implemented by using various types such as a resistive type, a capacitive type, an infrared ray, and a surface acoustic wave. In addition to the touch panel 4071, the user input unit 407 may include other input devices 4072. Specifically, the other input devices 4072 may include, but are not limited to, a physical keyboard, function keys (such as volume control keys, switch keys, etc.), a track ball, a mouse, and a joystick, which are not described herein again.

Further, the touch panel 4071 can be overlaid on the display panel 4061, and when the touch panel 4071 detects a touch operation thereon or nearby, the touch operation is transmitted to the processor 410 to determine the type of the touch event, and then the processor 410 provides a corresponding visual output on the display panel 4061 according to the type of the touch event. Although in fig. 4, the touch panel 4071 and the display panel 4061 are two separate components to implement the input and output functions of the mobile terminal, in some embodiments, the touch panel 4071 and the display panel 4061 may be integrated to implement the input and output functions of the mobile terminal, which is not limited herein.

The interface unit 408 is an interface through which an external device is connected to the mobile terminal 400. For example, the external device may include a wired or wireless headset port, an external power supply (or battery charger) port, a wired or wireless data port, a memory card port, a port for connecting a device having an identification module, an audio input/output (I/O) port, a video I/O port, an earphone port, and the like. The interface unit 408 may be used to receive input (e.g., data information, power, etc.) from external devices and transmit the received input to one or more elements within the mobile terminal 400 or may be used to transmit data between the mobile terminal 400 and external devices.

The memory 409 may be used to store software programs as well as various data. The memory 409 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the cellular phone, and the like. Further, the memory 409 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.

The processor 410 is a control center of the mobile terminal, connects various parts of the entire mobile terminal using various interfaces and lines, and performs various functions of the mobile terminal and processes data by operating or executing software programs and/or modules stored in the memory 409 and calling data stored in the memory 409, thereby integrally monitoring the mobile terminal. Processor 410 may include one or more processing units; preferably, the processor 410 may integrate an application processor, which mainly handles operating systems, user interfaces, application programs, etc., and a modem processor, which mainly handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 410.

The mobile terminal 400 may further include a power supply 411 (e.g., a battery) for supplying power to various components, and preferably, the power supply 411 may be logically connected to the processor 410 through a power management system, so as to implement functions of managing charging, discharging, and power consumption through the power management system.

In addition, the mobile terminal 400 includes some functional modules that are not shown, and thus, are not described in detail herein.

Preferably, an embodiment of the present invention further provides a mobile terminal, which includes a processor 410, a memory 409, and a computer program that is stored in the memory 409 and can be run on the processor 410, and when being executed by the processor 410, the computer program implements each process of the above-mentioned multimedia file playing method embodiment, and can achieve the same technical effect, and in order to avoid repetition, details are not repeated here.

The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the computer program implements each process of the above-mentioned multimedia file playing method embodiment, and can achieve the same technical effect, and in order to avoid repetition, details are not repeated here. The computer-readable storage medium may be a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk.

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. The term "comprising" is used to specify the presence of stated features, integers, steps, operations, elements, components, operations.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal (such as a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method according to the embodiments of the present invention.

While the present invention has been described with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, which are illustrative and not restrictive, and it will be apparent to those skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope of the invention as defined in the appended claims.

Claims

1. A method for playing a multimedia file, comprising:

2. The method of claim 1, wherein the sentence slicing of the audio in the multimedia file according to the first preset condition comprises:

3. The method of claim 1, wherein the sentence slicing of the audio in the multimedia file according to the first preset condition comprises:

4. The method of claim 1, wherein before the step of sentence slicing audio in the multimedia file according to the first preset condition, further comprising:

the sentence segmentation of the audio in the multimedia file according to the first preset condition specifically includes:

5. The method of claim 1, wherein said displaying at least one indicia comprises at least one of:

6. A mobile terminal, comprising:

7. The mobile terminal of claim 6, wherein the sentence segmentation module performs sentence segmentation on the audio in the multimedia file according to a first preset condition, and comprises:

8. The mobile terminal of claim 6, wherein the sentence segmentation module performs sentence segmentation on the audio in the multimedia file according to a first preset condition, and comprises:

9. The mobile terminal of claim 6, wherein the sentence segmentation module, before the step of sentence segmentation of the audio in the multimedia file according to the first preset condition, further comprises:

the sentence segmentation module performs sentence segmentation on the audio in the multimedia file according to a first preset condition, and specifically includes:

10. The mobile terminal of claim 6, wherein the display module, displaying at least one indicia, comprises at least one of: