CN110636369A - Multimedia file playing method and mobile terminal - Google Patents

Multimedia file playing method and mobile terminal Download PDF

Info

Publication number
CN110636369A
CN110636369A CN201910929797.6A CN201910929797A CN110636369A CN 110636369 A CN110636369 A CN 110636369A CN 201910929797 A CN201910929797 A CN 201910929797A CN 110636369 A CN110636369 A CN 110636369A
Authority
CN
China
Prior art keywords
audio
multimedia file
segmentation
sentence
preset condition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910929797.6A
Other languages
Chinese (zh)
Inventor
罗征武
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vivo Mobile Communication Co Ltd
Original Assignee
Vivo Mobile Communication Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vivo Mobile Communication Co Ltd filed Critical Vivo Mobile Communication Co Ltd
Priority to CN201910929797.6A priority Critical patent/CN110636369A/en
Publication of CN110636369A publication Critical patent/CN110636369A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47217End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for controlling playback functions for recorded or on-demand content, e.g. using progress bars, mode or play-point indicators or bookmarks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Telephone Function (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention discloses a multimedia file playing method and a mobile terminal, wherein the method comprises the following steps: performing sentence segmentation on the audio in the multimedia file according to a first preset condition; displaying at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio; and when the selection operation of the user on the mark is received, playing the multimedia file at the position corresponding to the mark. Because the user can select the mark when changing the playing progress of the multimedia file, and the mark corresponds to the starting position or the ending position of the statement in the audio, the user can accurately position the playing progress, thereby effectively improving the use experience of the user while reducing the complexity of the user operation.

Description

Multimedia file playing method and mobile terminal
Technical Field
The invention relates to the technical field of audio processing, in particular to a multimedia file playing method and a mobile terminal.
Background
In the existing mobile terminal, various multimedia applications, such as an audio player, a video player, etc., may be installed, and these multimedia applications may be used to play multimedia files, and may allow a user to change the playing progress of the multimedia files during the playing of the multimedia files, such as playing back or jumping to play the multimedia files.
Generally, when the user changes the playing progress of the multimedia file, the user can drag the progress bar to implement the process. However, in practical applications, the precision of the method for dragging the progress bar is not high, and the user may need to drag the progress bar for multiple times to position the playing progress to the desired playing progress position, so that the user experience is affected.
Disclosure of Invention
The embodiment of the invention provides a playing method of a multimedia file and a mobile terminal, and aims to solve the problem that in the prior art, when a user changes the playing progress of the multimedia file, the progress to be played is difficult to accurately position.
In order to solve the technical problem, the invention is realized as follows:
in a first aspect, a method for playing a multimedia file is provided, where the method includes:
performing sentence segmentation on the audio in the multimedia file according to a first preset condition;
displaying at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio;
and when the selection operation of the user on the mark is received, playing the multimedia file at the position corresponding to the mark.
In a second aspect, a terminal device is provided, which includes:
the sentence segmentation module is used for carrying out sentence segmentation on the audio in the multimedia file according to a first preset condition;
the display module displays at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio;
and the playing module plays the multimedia file at the position corresponding to the mark when receiving the selection operation of the user on the mark.
In a third aspect, a terminal device is provided, the terminal device comprising a processor, a memory and a computer program stored on the memory and executable on the processor, the computer program, when executed by the processor, implementing the steps of the method according to the first aspect.
In a fourth aspect, a computer-readable storage medium is provided, on which a computer program is stored, which computer program, when being executed by a processor, carries out the steps of the method according to the first aspect.
In the embodiment of the invention, the audio in the multimedia file can be segmented by sentences according to a first preset condition, and at least one mark is displayed according to the segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio, so that a user can select the mark when changing the playing progress, and the mobile terminal can play the multimedia file at the position corresponding to the mark after receiving the selection operation of the user. Because the user can select the mark when changing the playing progress of the multimedia file, and the mark corresponds to the starting position or the ending position of the statement in the audio, the user can accurately position the playing progress, thereby effectively improving the use experience of the user while reducing the complexity of the user operation.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention and not to limit the invention. In the drawings:
FIG. 1 is a flow chart illustrating a method for playing a multimedia file according to an embodiment of the present invention;
FIG. 2 is a diagram illustrating a method for playing a multimedia file according to an embodiment of the present invention;
fig. 3 is a schematic structural diagram of a mobile terminal according to an embodiment of the present invention;
fig. 4 is a schematic diagram of a hardware structure of a mobile terminal implementing various embodiments of the present invention.
Detailed Description
Generally, when a user plays a multimedia file using a multimedia application installed in a mobile terminal, the playing progress of the multimedia file may be changed by dragging a progress bar, such as playing the multimedia file back or jumping to play.
For example, when the mobile terminal plays an audio file for hearing learning, a pause/play button for pausing or playing, a button for switching the currently played audio file to the previous audio file, a button for switching the currently played audio file to the next audio file, and a play progress bar for showing the current play progress may be shown below the play interface. When a user wants to change the playing progress to play back a certain section of audio, the user can repeatedly drag the progress bar displayed in the playing interface to adjust the current playing progress to the position of the progress to be played, and then the key hearing learning of the section of audio is realized.
However, in practical applications, the precision of the method for dragging the progress bar is not high, and the user needs to repeatedly drag the progress bar to position the playing progress to the desired progress position, which not only wastes time, but also increases the complexity of user operations, thereby affecting the user experience. In addition, if the user does not know the playing progress corresponding to the multimedia content that the user wants to play, the user cannot adjust the playing progress by dragging the progress bar.
In order to solve the above technical problem, an embodiment of the present invention provides a method for playing a multimedia file and a mobile terminal, where the method includes: performing sentence segmentation on the audio in the multimedia file according to a first preset condition; displaying at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio; and when the selection operation of the user on the mark is received, playing the multimedia file at the position corresponding to the mark.
Therefore, when the user changes the playing progress, the mark can be selected, and after the mobile terminal receives the selection operation of the user, the multimedia file can be played at the position corresponding to the mark. Because the user can select the mark when changing the playing progress of the multimedia file, and the mark corresponds to the starting position or the ending position of the statement in the audio, the user can accurately position the playing progress, thereby effectively improving the use experience of the user while reducing the complexity of the user operation.
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The technical solutions provided by the embodiments of the present invention are described in detail below with reference to the accompanying drawings.
Fig. 1 is a flowchart illustrating a method for playing a multimedia file according to an embodiment of the present invention. The method is as follows.
S102: and carrying out sentence segmentation on the audio in the multimedia file according to a first preset condition.
In S102, the mobile terminal may automatically perform sentence segmentation on the audio in the multimedia file according to a first preset condition in the process of playing the multimedia file.
Optionally, in the process of playing the multimedia file, the mobile terminal may further trigger the sentence segmentation of the audio in the multimedia file by the user. The specific implementation mode is as follows:
when a user wants to change the playing progress of the multimedia file in the process of playing the multimedia file by using the multimedia application, the user can execute preset operation in a playing interface of the multimedia file, and the preset operation is used for starting a sentence segmentation mode.
In this embodiment, when the user performs the preset operation, at least three ways may be implemented, including:
the first method comprises the following steps: in the process of playing the multimedia file, a pause/play button and a button for starting the sentence segmentation mode can be displayed in a playing interface of the multimedia file, and when a user successively clicks the pause/play button and the button for starting the sentence segmentation mode, the user can be regarded as executing preset operation;
and the second method comprises the following steps: in the process of playing the multimedia file, a button for starting the sentence segmentation mode can be displayed in a playing interface of the multimedia file, and when a user clicks the button of the sentence segmentation mode, the user can be regarded as executing preset operation.
Optionally, the button of the sentence segmentation mode may also be displayed in an option interface of the multimedia application, which is not specifically limited herein.
And the third is that: the user can set the pause button in the mobile terminal in advance based on the setting interface of the multimedia application, and the pause button is used as a button of the sentence segmentation mode, so that when the user clicks the pause button in the playing process of the multimedia file, the user can be regarded as executing preset operation.
After the user executes the preset operation, the mobile terminal can receive the preset operation, at the moment, the mobile terminal can automatically pause the currently played multimedia file, and start the sentence segmentation mode, and when the sentence segmentation mode is started, perform sentence segmentation on the audio in the multimedia file.
In this embodiment, when performing sentence segmentation, the mobile terminal may default to perform sentence segmentation on the audio in the entire multimedia file, or may default to perform sentence segmentation within a preset progress range, and in addition, the sentence segmentation range of the audio in the multimedia file may also be preset by the user, for example, the user may preset to perform sentence segmentation on the audio whose play progress is within 1 minute to 2 minutes, which is not specifically limited in this embodiment.
It should be noted that when the mobile terminal performs statement segmentation on the audio within a preset progress range (which may be a range preset by a user or a default range of the mobile terminal), the audio corresponding to the preset progress range may be a section of audio before the current playing progress, or a section of audio after the current playing progress.
In this embodiment, when the mobile terminal performs statement segmentation on the audio in the multimedia file according to the first preset condition, specific implementation manners may include the following three types:
the first implementation mode comprises the following steps:
in a first implementation manner, the first preset condition may include at least one of the following: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration; the part of speech of the first vocabulary after the pause position is a noun, wherein the preset duration can be determined according to practical situations, for example, the preset duration can be set to 2s, or the average phoneme interval time of the whole audio can be obtained by counting the interval time between each adjacent phoneme in the audio, because the interval time between phonemes is shorter in the same sentence, and the interval time between the last phoneme of the previous sentence and the first phoneme of the next sentence is longer between sentences, the average phoneme interval time can be used as the preset duration, or the average phoneme interval time is multiplied by a certain multiple to be used as the preset duration.
When the audio is subjected to sentence division according to the first preset condition, the mobile terminal can detect the audio in the currently played multimedia file and determine the pause position of the audio during playing. One or more pause positions of the audio may be provided, and the following description may use one pause position as an example.
After the pause position is determined, whether the pause duration corresponding to the pause position and the part of speech of the first vocabulary after the pause position meet a first preset condition or not can be judged, if so, the segmentation position of the audio can be determined according to the first preset condition, namely, the pause position meeting the first preset condition is determined as the segmentation position for segmenting the audio, and the audio is segmented according to the segmentation position.
When the mobile terminal performs sentence segmentation on the audio based on the segmentation positions, one segmentation position may correspond to the start position or the end position of one sentence, and if the segmentation positions are multiple, the audio between two adjacent segmentation positions may correspond to one sentence.
The second implementation mode comprises the following steps:
in a second implementation manner, the first preset condition may include: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration, and the loudness of the audio after the pause position meets a second preset condition. The preset duration may be determined according to actual conditions, and the second preset condition may include at least one of the audio loudness being greater than or equal to the preset loudness and the audio loudness gradually increasing from an extremely low value to a peak value.
When the audio is subjected to sentence division according to the first preset condition, the mobile terminal can detect the audio in the currently played multimedia file and determine the pause position of the audio during playing. One or more pause positions of the audio may be provided, and the following description may use one pause position as an example.
After the pause position is determined, whether the pause duration corresponding to the pause position and the loudness of the audio after the pause position meet a second preset condition or not can be judged, if so, the segmentation position of the audio can be determined according to a first preset condition, namely, the pause position meeting the first preset condition is determined as the segmentation position for segmenting the audio, and the audio is segmented according to the segmentation position.
When the mobile terminal performs sentence segmentation on the audio based on the segmentation positions, one segmentation position may correspond to the start position or the end position of one sentence, and if the segmentation positions are multiple, the audio between two adjacent segmentation positions may correspond to one sentence.
The third implementation mode comprises the following steps:
in a third implementation manner, before performing statement segmentation on the audio according to a first preset condition, the mobile terminal may detect the audio in the currently played multimedia file, and determine an extremely low value corresponding to the loudness of the audio in the multimedia file, where the number of the extremely low values may be one or multiple.
After the extremely low value is determined, when the audio is subjected to sentence segmentation according to the first preset condition, the extremely low value can be used as the first preset condition, and the audio is subjected to sentence segmentation based on the extremely low value. When the mobile terminal performs statement segmentation on the audio based on the extremely low values, one extremely low value can correspond to the starting position or the ending position of one statement, and if the number of the extremely low values is multiple, the audio between two adjacent extremely low values can correspond to one statement.
S104: and displaying at least one mark according to the sentence segmentation result, wherein one mark corresponds to the starting position or the ending position of one sentence in the audio.
In S104, after performing sentence segmentation on the audio in the multimedia file, the mobile terminal may display at least one mark according to a result of the sentence segmentation, so that the user changes the playing progress based on the mark. Wherein a marker may correspond to the start or end position of a sentence in audio.
The mobile terminal may display at least one mark in a play progress bar of the multimedia file when displaying the at least one mark. The mark displayed in the progress bar may be a dot, an arrow, or other symbols, which is not limited herein.
In addition, when the mobile terminal displays at least one mark, the at least one mark can be displayed in a play interface of the multimedia file in a list form. When the mobile terminal displays at least one mark in the form of a list, the mark may be a text, such as a previous sentence, a next sentence, or the like, and may also be another text, which is not limited specifically herein.
S106: and when the selection operation of the user on the mark is received, playing the multimedia file at the position corresponding to the mark.
In S106, after the mobile terminal displays at least one mark, the user may select a certain mark according to the actual requirement of the user, and when the mobile terminal receives the selection operation of the user, the mobile terminal may jump the playing progress of the multimedia file to a progress position corresponding to the mark selected by the user, and play the multimedia file at the progress position, thereby changing the playing progress of the multimedia file.
For easy understanding of the playing method of the multimedia file, refer to fig. 2. Fig. 2 is a schematic diagram illustrating a method for playing a multimedia file according to an embodiment of the present invention.
In fig. 2, in the process of playing the multimedia file, the mobile terminal may display a button for starting a sentence segmentation mode, a pause/play button, a sentence segmentation mode button, a play progress bar, an option button, and the like in a play interface of the multimedia file. When a user wants to change the playing progress of the multimedia, the user can directly click the button for opening the sentence segmentation mode shown in fig. 2, and when the user clicks the button, the user can be regarded as the user executing a preset operation to open the sentence segmentation module.
The mobile terminal can automatically pause the currently played multimedia file and start a sentence segmentation mode when receiving the preset operation of a user, and performs sentence segmentation on the audio frequency in the multimedia file when starting the sentence segmentation mode. When performing statement segmentation, the specific implementation manner may refer to relevant contents recorded in S102 in the embodiment shown in fig. 1, and a description thereof is not repeated here.
After the sentence segmentation is performed by the mobile terminal, a plurality of dots can be displayed in the play progress bar of the multimedia file according to the sentence segmentation result, wherein one dot can correspond to the starting position or the ending position of one sentence, and the audio frequency between two adjacent dots corresponds to one sentence. In addition, corresponding labeling can be performed below each dot. As shown in fig. 2, the first 1 sentence may be marked below a first dot before the current playing progress, the first 2 sentences may be marked below a second dot before the current playing progress, and the last 1 sentence may be marked below the first dot after the current playing progress.
After the mobile terminal displays the plurality of dots, a user can select to click a certain dot on the playing progress bar according to the actual requirement of the user, when the mobile terminal receives the selection operation of the user on the certain dot, the playing progress of the multimedia file can be jumped to the position of the dot, the multimedia file is played from the position of the dot, and the change of the playing progress of the multimedia file is achieved.
In another implementation manner, in the process of playing the multimedia file, the mobile terminal shown in fig. 2 may also automatically perform sentence segmentation on the audio in the multimedia file, and display the mark shown in fig. 2 according to the segmentation result, and perform sentence segmentation on the audio in the multimedia file after the user does not need to manually operate each button shown in fig. 2. The implementation manner of the mobile terminal automatically performing the sentence segmentation may refer to the content described in the embodiment shown in fig. 1, and will not be described repeatedly here.
It should be noted that, in the embodiment of the present invention, before the voice segmentation, the audio may be divided to obtain a plurality of sub-audio bands, and then the voice segmentation operation is performed on one or more selected sub-audio bands, so that a user may select a more important sub-audio band for voice segmentation or a less important sub-audio band for voice segmentation according to different time periods of the audio file.
In the embodiment of the invention, the audio in the multimedia file can be segmented by sentences according to a first preset condition, and at least one mark is displayed according to the segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio, so that a user can select the mark when changing the playing progress, and the mobile terminal can play the multimedia file at the position corresponding to the mark after receiving the selection operation of the user. Because the user can select the mark when changing the playing progress of the multimedia file, and the mark corresponds to the starting position or the ending position of the statement in the audio, the user can accurately position the playing progress, thereby effectively improving the use experience of the user while reducing the complexity of the user operation.
Fig. 3 is a schematic structural diagram of a mobile terminal according to an embodiment of the present invention. The mobile terminal includes: sentence segmentation module 31, display module 32 and play module 33, wherein:
the sentence segmentation module 31 is used for performing sentence segmentation on the audio in the multimedia file according to a first preset condition;
the display module 32 is used for displaying at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio;
and the playing module 33, when receiving the selection operation of the user on the mark, plays the multimedia file at the position corresponding to the mark.
Optionally, the sentence segmentation module 31 performs sentence segmentation on the audio in the multimedia file according to a first preset condition, including:
determining the segmentation positions of the audio according to a first preset condition, wherein one segmentation position corresponds to the starting position or the ending position of one statement;
performing sentence segmentation on the audio in the multimedia file based on the segmentation position;
wherein the first preset condition comprises: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration, and/or the part of speech of the first vocabulary after the pause position is a noun.
Optionally, the sentence segmentation module 31 performs sentence segmentation on the audio in the multimedia file according to a first preset condition, including:
determining the pause position of the audio during playing according to a first preset condition, wherein one segmentation position corresponds to the starting position or the ending position of one sentence;
performing sentence segmentation on the audio in the multimedia file based on the segmentation position;
wherein the first preset condition comprises: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration, the loudness of the audio after the pause position meets a second preset condition, and the second preset condition comprises at least one of the loudness of the audio being greater than or equal to the preset loudness and the loudness of the audio gradually increasing from an extremely low value to a peak value.
Optionally, before the step of performing sentence segmentation on the audio in the multimedia file according to the first preset condition, the sentence segmentation module 31 further includes:
detecting an extremely low value corresponding to the audio loudness when the audio is played;
the sentence segmentation module 31 performs sentence segmentation on the audio in the multimedia file according to a first preset condition, and specifically includes:
and performing statement segmentation on the audio according to the extremely low values, wherein one extremely low value corresponds to the starting position or the ending position of one statement.
Optionally, the display module 32 displays at least one mark, including at least one of:
displaying the at least one mark in a playing progress bar of the multimedia file;
and displaying the at least one mark in a playing interface of the multimedia file in a list form.
The mobile terminal provided by the embodiment of the present invention can implement each process implemented by the mobile terminal in the method embodiment of fig. 1, and is not described herein again in order to avoid repetition. In the embodiment of the invention, the audio in the multimedia file can be segmented by sentences according to a first preset condition, and at least one mark is displayed according to the segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio, so that a user can select the mark when changing the playing progress, and the mobile terminal can play the multimedia file at the position corresponding to the mark after receiving the selection operation of the user. Because the user can select the mark when changing the playing progress of the multimedia file, and the mark corresponds to the starting position or the ending position of the statement in the audio, the user can accurately position the playing progress, thereby effectively improving the use experience of the user while reducing the complexity of the user operation.
Figure 4 is a schematic diagram of a hardware configuration of a mobile terminal implementing various embodiments of the present invention,
the mobile terminal 400 includes, but is not limited to: radio frequency unit 401, network module 402, audio output unit 403, input unit 404, sensor 405, display unit 406, user input unit 407, interface unit 408, memory 409, processor 410, and power supply 411. Those skilled in the art will appreciate that the mobile terminal architecture shown in fig. 4 is not intended to be limiting of mobile terminals, and that a mobile terminal may include more or fewer components than shown, or some components may be combined, or a different arrangement of components. In the embodiment of the present invention, the mobile terminal includes, but is not limited to, a mobile phone, a tablet computer, a notebook computer, a palm computer, a vehicle-mounted terminal, a wearable device, a pedometer, and the like.
The processor 410 performs statement segmentation on the audio in the multimedia file according to a first preset condition; displaying at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio; and when the selection operation of the user on the mark is received, playing the multimedia file at the position corresponding to the mark.
Therefore, when the user changes the playing progress, the mark can be selected, and after the mobile terminal receives the selection operation of the user, the multimedia file can be played at the position corresponding to the mark. Because the user can select the mark when changing the playing progress of the multimedia file, and the mark corresponds to the starting position or the ending position of the statement in the audio, the user can accurately position the playing progress, thereby effectively improving the use experience of the user while reducing the complexity of the user operation.
It should be understood that, in the embodiment of the present invention, the radio frequency unit 401 may be used for receiving and sending signals during a message sending and receiving process or a call process, and specifically, receives downlink data from a base station and then processes the received downlink data to the processor 410; in addition, the uplink data is transmitted to the base station. Typically, radio unit 401 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like. Further, the radio unit 401 can also communicate with a network and other devices through a wireless communication system.
The mobile terminal provides the user with wireless broadband internet access through the network module 402, such as helping the user send and receive e-mails, browse web pages, and access streaming media.
The audio output unit 403 may convert audio data received by the radio frequency unit 401 or the network module 402 or stored in the memory 409 into an audio signal and output as sound. Also, the audio output unit 403 may also provide audio output related to a specific function performed by the mobile terminal 400 (e.g., a call signal reception sound, a message reception sound, etc.). The audio output unit 403 includes a speaker, a buzzer, a receiver, and the like.
The input unit 404 is used to receive audio or video signals. The input Unit 404 may include a Graphics Processing Unit (GPU) 4041 and a microphone 4042, and the Graphics processor 4041 processes image data of a still picture or video obtained by an image capturing apparatus (such as a camera) in a video capturing mode or an image capturing mode. The processed image frames may be displayed on the display unit 406. The image frames processed by the graphic processor 4041 may be stored in the memory 409 (or other storage medium) or transmitted via the radio frequency unit 401 or the network module 402. The microphone 4042 may receive sound, and may be capable of processing such sound into audio data. The processed audio data may be converted into a format output transmittable to a mobile communication base station via the radio frequency unit 401 in case of the phone call mode.
The mobile terminal 400 also includes at least one sensor 405, such as a light sensor, motion sensor, and other sensors. Specifically, the light sensor includes an ambient light sensor that can adjust the brightness of the display panel 4061 according to the brightness of ambient light, and a proximity sensor that can turn off the display panel 4061 and/or the backlight when the mobile terminal 400 is moved to the ear. As one of the motion sensors, the accelerometer sensor can detect the magnitude of acceleration in each direction (generally three axes), detect the magnitude and direction of gravity when stationary, and can be used to identify the posture of the mobile terminal (such as horizontal and vertical screen switching, related games, magnetometer posture calibration), and vibration identification related functions (such as pedometer, tapping); the sensors 405 may also include a fingerprint sensor, a pressure sensor, an iris sensor, a molecular sensor, a gyroscope, a barometer, a hygrometer, a thermometer, an infrared sensor, etc., which will not be described in detail herein.
The display unit 406 is used to display information input by the user or information provided to the user. The Display unit 406 may include a Display panel 4061, and the Display panel 4061 may be configured in the form of a Liquid Crystal Display (LCD), an organic light-Emitting Diode (OLED), or the like.
The user input unit 407 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function control of the mobile terminal. Specifically, the user input unit 407 includes a touch panel 4071 and other input devices 4072. Touch panel 4071, also referred to as a touch screen, may collect touch operations by a user on or near it (e.g., operations by a user on or near touch panel 4071 using a finger, a stylus, or any suitable object or attachment). The touch panel 4071 may include two parts, a touch detection device and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, sends the touch point coordinates to the processor 410, receives a command from the processor 410, and executes the command. In addition, the touch panel 4071 can be implemented by using various types such as a resistive type, a capacitive type, an infrared ray, and a surface acoustic wave. In addition to the touch panel 4071, the user input unit 407 may include other input devices 4072. Specifically, the other input devices 4072 may include, but are not limited to, a physical keyboard, function keys (such as volume control keys, switch keys, etc.), a track ball, a mouse, and a joystick, which are not described herein again.
Further, the touch panel 4071 can be overlaid on the display panel 4061, and when the touch panel 4071 detects a touch operation thereon or nearby, the touch operation is transmitted to the processor 410 to determine the type of the touch event, and then the processor 410 provides a corresponding visual output on the display panel 4061 according to the type of the touch event. Although in fig. 4, the touch panel 4071 and the display panel 4061 are two separate components to implement the input and output functions of the mobile terminal, in some embodiments, the touch panel 4071 and the display panel 4061 may be integrated to implement the input and output functions of the mobile terminal, which is not limited herein.
The interface unit 408 is an interface through which an external device is connected to the mobile terminal 400. For example, the external device may include a wired or wireless headset port, an external power supply (or battery charger) port, a wired or wireless data port, a memory card port, a port for connecting a device having an identification module, an audio input/output (I/O) port, a video I/O port, an earphone port, and the like. The interface unit 408 may be used to receive input (e.g., data information, power, etc.) from external devices and transmit the received input to one or more elements within the mobile terminal 400 or may be used to transmit data between the mobile terminal 400 and external devices.
The memory 409 may be used to store software programs as well as various data. The memory 409 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the cellular phone, and the like. Further, the memory 409 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
The processor 410 is a control center of the mobile terminal, connects various parts of the entire mobile terminal using various interfaces and lines, and performs various functions of the mobile terminal and processes data by operating or executing software programs and/or modules stored in the memory 409 and calling data stored in the memory 409, thereby integrally monitoring the mobile terminal. Processor 410 may include one or more processing units; preferably, the processor 410 may integrate an application processor, which mainly handles operating systems, user interfaces, application programs, etc., and a modem processor, which mainly handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 410.
The mobile terminal 400 may further include a power supply 411 (e.g., a battery) for supplying power to various components, and preferably, the power supply 411 may be logically connected to the processor 410 through a power management system, so as to implement functions of managing charging, discharging, and power consumption through the power management system.
In addition, the mobile terminal 400 includes some functional modules that are not shown, and thus, are not described in detail herein.
Preferably, an embodiment of the present invention further provides a mobile terminal, which includes a processor 410, a memory 409, and a computer program that is stored in the memory 409 and can be run on the processor 410, and when being executed by the processor 410, the computer program implements each process of the above-mentioned multimedia file playing method embodiment, and can achieve the same technical effect, and in order to avoid repetition, details are not repeated here.
The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the computer program implements each process of the above-mentioned multimedia file playing method embodiment, and can achieve the same technical effect, and in order to avoid repetition, details are not repeated here. The computer-readable storage medium may be a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. The term "comprising" is used to specify the presence of stated features, integers, steps, operations, elements, components, operations.
Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal (such as a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method according to the embodiments of the present invention.
While the present invention has been described with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, which are illustrative and not restrictive, and it will be apparent to those skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (10)

1. A method for playing a multimedia file, comprising:
performing sentence segmentation on the audio in the multimedia file according to a first preset condition;
displaying at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio;
and when the selection operation of the user on the mark is received, playing the multimedia file at the position corresponding to the mark.
2. The method of claim 1, wherein the sentence slicing of the audio in the multimedia file according to the first preset condition comprises:
determining the segmentation positions of the audio according to a first preset condition, wherein one segmentation position corresponds to the starting position or the ending position of one statement;
performing sentence segmentation on the audio in the multimedia file based on the segmentation position;
wherein the first preset condition comprises: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration, and/or the part of speech of the first vocabulary after the pause position is a noun.
3. The method of claim 1, wherein the sentence slicing of the audio in the multimedia file according to the first preset condition comprises:
determining the segmentation positions of the audio according to a first preset condition, wherein one segmentation position corresponds to the starting position or the ending position of one statement;
performing sentence segmentation on the audio in the multimedia file based on the segmentation position;
wherein the first preset condition comprises: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration, the loudness of the audio after the pause position meets a second preset condition, and the second preset condition comprises at least one of the loudness of the audio being greater than or equal to the preset loudness and the loudness of the audio gradually increasing from an extremely low value to a peak value.
4. The method of claim 1, wherein before the step of sentence slicing audio in the multimedia file according to the first preset condition, further comprising:
detecting an extremely low value corresponding to the audio loudness when the audio is played;
the sentence segmentation of the audio in the multimedia file according to the first preset condition specifically includes:
and performing statement segmentation on the audio according to the extremely low values, wherein one extremely low value corresponds to the starting position or the ending position of one statement.
5. The method of claim 1, wherein said displaying at least one indicia comprises at least one of:
displaying the at least one mark in a playing progress bar of the multimedia file;
and displaying the at least one mark in a playing interface of the multimedia file in a list form.
6. A mobile terminal, comprising:
the sentence segmentation module is used for carrying out sentence segmentation on the audio in the multimedia file according to a first preset condition;
the display module displays at least one mark according to the sentence segmentation result, wherein each mark corresponds to the starting position or the ending position of one sentence in the audio;
and the playing module plays the multimedia file at the position corresponding to the mark when receiving the selection operation of the user on the mark.
7. The mobile terminal of claim 6, wherein the sentence segmentation module performs sentence segmentation on the audio in the multimedia file according to a first preset condition, and comprises:
determining the segmentation positions of the audio according to a first preset condition, wherein one segmentation position corresponds to the starting position or the ending position of one statement;
performing sentence segmentation on the audio in the multimedia file based on the segmentation position;
wherein the first preset condition comprises: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration, and/or the part of speech of the first vocabulary after the pause position is a noun.
8. The mobile terminal of claim 6, wherein the sentence segmentation module performs sentence segmentation on the audio in the multimedia file according to a first preset condition, and comprises:
determining the pause position of the audio during playing according to a first preset condition, wherein one segmentation position corresponds to the starting position or the ending position of one sentence;
performing sentence segmentation on the audio in the multimedia file based on the segmentation position;
wherein the first preset condition comprises: the pause duration corresponding to the pause position of the audio during playing is greater than or equal to the preset duration, the loudness of the audio after the pause position meets a second preset condition, and the second preset condition comprises at least one of the loudness of the audio being greater than or equal to the preset loudness and the loudness of the audio gradually increasing from an extremely low value to a peak value.
9. The mobile terminal of claim 6, wherein the sentence segmentation module, before the step of sentence segmentation of the audio in the multimedia file according to the first preset condition, further comprises:
detecting an extremely low value corresponding to the audio loudness when the audio is played;
the sentence segmentation module performs sentence segmentation on the audio in the multimedia file according to a first preset condition, and specifically includes:
and performing statement segmentation on the audio according to the extremely low values, wherein one extremely low value corresponds to the starting position or the ending position of one statement.
10. The mobile terminal of claim 6, wherein the display module, displaying at least one indicia, comprises at least one of:
displaying the at least one mark in a playing progress bar of the multimedia file;
and displaying the at least one mark in a playing interface of the multimedia file in a list form.
CN201910929797.6A 2019-09-27 2019-09-27 Multimedia file playing method and mobile terminal Pending CN110636369A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910929797.6A CN110636369A (en) 2019-09-27 2019-09-27 Multimedia file playing method and mobile terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910929797.6A CN110636369A (en) 2019-09-27 2019-09-27 Multimedia file playing method and mobile terminal

Publications (1)

Publication Number Publication Date
CN110636369A true CN110636369A (en) 2019-12-31

Family

ID=68973224

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910929797.6A Pending CN110636369A (en) 2019-09-27 2019-09-27 Multimedia file playing method and mobile terminal

Country Status (1)

Country Link
CN (1) CN110636369A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111954012A (en) * 2020-08-12 2020-11-17 上海遥知信息技术有限公司 Multimedia resource file playing method and device, computer equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140188259A1 (en) * 2009-05-27 2014-07-03 Hon Hai Precision Industry Co., Ltd. Audio playback positioning method and electronic device system utilizing the same
CN104078044A (en) * 2014-07-02 2014-10-01 深圳市中兴移动通信有限公司 Mobile terminal and sound recording search method and device of mobile terminal
CN105632484A (en) * 2016-02-19 2016-06-01 上海语知义信息技术有限公司 Voice synthesis database pause information automatic marking method and system
CN108074574A (en) * 2017-11-29 2018-05-25 维沃移动通信有限公司 Audio-frequency processing method, device and mobile terminal
CN108924610A (en) * 2018-07-20 2018-11-30 网易(杭州)网络有限公司 Multimedia file processing method, device, medium and calculating equipment
CN109994126A (en) * 2019-03-11 2019-07-09 北京三快在线科技有限公司 Audio message segmentation method, device, storage medium and electronic equipment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140188259A1 (en) * 2009-05-27 2014-07-03 Hon Hai Precision Industry Co., Ltd. Audio playback positioning method and electronic device system utilizing the same
CN104078044A (en) * 2014-07-02 2014-10-01 深圳市中兴移动通信有限公司 Mobile terminal and sound recording search method and device of mobile terminal
CN105632484A (en) * 2016-02-19 2016-06-01 上海语知义信息技术有限公司 Voice synthesis database pause information automatic marking method and system
CN108074574A (en) * 2017-11-29 2018-05-25 维沃移动通信有限公司 Audio-frequency processing method, device and mobile terminal
CN108924610A (en) * 2018-07-20 2018-11-30 网易(杭州)网络有限公司 Multimedia file processing method, device, medium and calculating equipment
CN109994126A (en) * 2019-03-11 2019-07-09 北京三快在线科技有限公司 Audio message segmentation method, device, storage medium and electronic equipment

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111954012A (en) * 2020-08-12 2020-11-17 上海遥知信息技术有限公司 Multimedia resource file playing method and device, computer equipment and storage medium

Similar Documents

Publication Publication Date Title
CN109343759B (en) Screen-turning display control method and terminal
CN108415652B (en) Text processing method and mobile terminal
CN108737904B (en) Video data processing method and mobile terminal
CN109525874B (en) Screen capturing method and terminal equipment
CN110109593B (en) Screen capturing method and terminal equipment
CN111596818A (en) Message display method and electronic equipment
CN108391008B (en) Message reminding method and mobile terminal
CN109710349B (en) Screen capturing method and mobile terminal
CN109189303B (en) Text editing method and mobile terminal
CN111010608B (en) Video playing method and electronic equipment
CN110995919B (en) Message processing method and electronic equipment
CN109412932B (en) Screen capturing method and terminal
CN110221795B (en) Screen recording method and terminal
CN111026305A (en) Audio processing method and electronic equipment
CN110855549A (en) Message display method and terminal equipment
CN110062281B (en) Play progress adjusting method and terminal equipment thereof
CN111061446A (en) Display method and electronic equipment
CN108520760B (en) Voice signal processing method and terminal
CN108628534B (en) Character display method and mobile terminal
CN108270928B (en) Voice recognition method and mobile terminal
CN108307048B (en) Message output method and device and mobile terminal
CN108108338B (en) Lyric processing method, lyric display method, server and mobile terminal
CN112217713B (en) Method and device for displaying message
CN111273827B (en) Text processing method and electronic equipment
CN111049977B (en) Alarm clock reminding method and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191231

RJ01 Rejection of invention patent application after publication