WO2020119455A1 - Method for repeating word or sentence during video playback, and electronic device - Google Patents

Method for repeating word or sentence during video playback, and electronic device Download PDF

Info

Publication number
WO2020119455A1
WO2020119455A1 PCT/CN2019/121187 CN2019121187W WO2020119455A1 WO 2020119455 A1 WO2020119455 A1 WO 2020119455A1 CN 2019121187 W CN2019121187 W CN 2019121187W WO 2020119455 A1 WO2020119455 A1 WO 2020119455A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
interface
user
word
electronic device
Prior art date
Application number
PCT/CN2019/121187
Other languages
French (fr)
Chinese (zh)
Inventor
王有俊
祁毅
郭志刚
胡惠淳
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2020119455A1 publication Critical patent/WO2020119455A1/en

Links

Images

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B19/00Teaching not covered by other main groups of this subclass
    • G09B19/06Foreign languages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/462Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/475End-user interface for inputting end-user data, e.g. personal identification number [PIN], preference data

Definitions

  • the present application relates to the field of electronic technology, and in particular to a method and electronic device for repetition of words or sentences in a video process.
  • the correlation between English video resources and English words is low, and it is not easy for users to learn English words or sentences through video resources during the English learning process.
  • the user can only repeat the English words or sentences by dragging the progress bar, and the time during the dragging process of the progress bar is not precise enough. In order to realize the re-reading of words or sentences, if the video content is edited, it will affect the duration of the video itself, and the user experience is poor.
  • the present application provides a video playback method and electronic device, which can realize the repetition of words or sentences during video playback, improve the user's English learning effect, and improve the user experience.
  • a method for playing a video includes: displaying a first interface, the first interface displaying a first video being played and subtitles of the first video, the subtitles of the first video including the first A text unit and a second text unit; when the first segment of the first video corresponding to the first text unit is played, the first segment is automatically played repeatedly on the first interface; the user is detected on the first interface In response to the first operation, a second interface is displayed, and the first information associated with the first text unit is displayed on the second interface.
  • the text unit (for example, the first text unit and the second text unit) in the user learning process may be a single word, or the text unit may include phrases, sentences, etc. of multiple words, which is not limited in this application.
  • the first interface can correspond to many possible situations, for example, in the full-screen playback mode of the electronic device, the first interface refers to the video playback display area, the display area displays subtitles, etc.; or the electronic device is in the non-full-screen mode
  • the display area of the video playback In addition to the display area of the video playback, other display areas may be included, such as an area for analyzing details of the text unit, and other multiple learning video resources associated with the text unit. This application does not limit this.
  • the first information may refer to the parsing details of the first text unit, such as the English, American pronunciation, Chinese interpretation, Chinese and English example sentences, and related learning videos of the first text unit.
  • the second interface is an interface after the user clicks on the text unit in the subtitle to display the analysis details of the text unit, for example, an interface after the video playback interface pops up the analysis window of the text unit.
  • the first interface for example, there are many possible situations of the second interface.
  • the second interface also includes analysis after the user clicks the text unit in the subtitles Pop-ups.
  • the parsing window of the text unit may include the English, American pronunciation, Chinese interpretation, Chinese and English example sentences of the text unit, and the like. This application does not limit this.
  • word analysis and analysis details may come from the English dictionary built into the system itself, or may be associated with other English online dictionaries, etc., which is not limited in this application.
  • phrases in video subtitles users can also click on phrases in video subtitles. For example, some words appear basically in the form of phrases, and during the user's click, the parsing of the phrase may appear in the form of phrases. For example, when the detailed content presented in the word analysis interface is associated with an English dictionary, the word in the dictionary mainly appears in the form of a phrase. When the user clicks on the word while watching the video, the analysis or interpretation of the phrase may also pop up. limited.
  • the above-mentioned method for learning words, phrases or sentences during video playback can realize learning English words while watching videos. According to user needs, click to enter word learning at any time, which can simplify the search operation of word learning. Increase the convenience of user learning and enhance user experience.
  • the first segment is a video segment corresponding to the first text unit, or the first segment is a video segment corresponding to the entire sentence where the first text unit is located .
  • the video segment from the start time to the end time corresponding to the keyword input by the user is repeatedly played on the first interface.
  • the entire sentence where the keyword entered by the user is repeatedly played on the first interface, and the video segment from the start time corresponding to the sentence to the end time is repeated.
  • the number of times to repeatedly play the first segment of the first video is a preset number of times preset by the system or set by the user.
  • the number of repeated playbacks may be set by the user in the background, or may be the system default. In the case that the user does not set the number of repeated playbacks, the number of repeated playbacks may be the system default 3 times. This application does not limit this.
  • the method further includes: before displaying the first interface, displaying a third interface, the third interface displaying the first text unit input by the user ,
  • the third interface includes second information and a first video list associated with the first text unit, the first video list includes the first video; detecting a second operation of the user on the third interface; in response to the In the second operation, the first interface is displayed.
  • the first video list further includes a second video, and the second operation is used to select the first video.
  • the method further includes: detecting a third operation of the user on the third interface; in response to the third operation, displaying a fourth interface,
  • the fourth interface includes second information of the first text unit and a second video list, the second video list includes at least one video, and the subtitle of each video in the second video list includes the first text unit.
  • the first video is paused to play.
  • the video when the user clicks to close the repeat setting control, subtitle setting control, or close the word analysis box of the key word to exit the learning mode of the word, the video can automatically continue to play without requiring the user to click to play again Controls. Or the video is in a paused playback state, and the user can click the playback controls on the video display interface to continue playing the video. This application does not limit this.
  • the method for re-reading the words in the video provided above can realize the learning of English words in the ordinary viewing process. According to the needs of users, click to enter the word learning at any time, which can simplify the search operation of word learning and increase the convenience of user learning. Improve user experience.
  • the present application also provides a word learning method that can provide users with centralized learning of multiple words included in a certain vocabulary set.
  • the subtitles of the learning video and the progress bar of the learning video are presented at different positions in the playback interface. If possible, if both the subtitle of the learning video and the progress bar of the learning video are displayed in the same position area of the playback interface, when the user clicks on the word contained in the subtitle, the click effect may be poor, for example, the process of clicking the word may be triggered by mistake Clicked the progress bar. This situation is particularly prominent when the display screen of the electronic device where the user watches the learning video is small, or the interface for playing the learning video is small. Therefore, the subtitles of the learning video and the progress bar of the learning video are displayed at different positions in the playback interface.
  • the progress bar of the video playback is displayed at the top of the screen, and the subtitles are displayed at the bottom of the screen.
  • the playback interface can also be the playback interface.
  • Other locations can improve the sensitivity of user operations and improve the user experience. This application does not limit the position of the video playback progress bar and the display of subtitles.
  • the display effect of the first text unit is different from the display effect of the second text unit.
  • the words learned by the user are highlighted in the subtitles.
  • the "message” that the user wants to learn appears in the subtitles the “message” is different from the display of other words in the subtitles to remind the user of the position of the word and pay attention to the pronunciation of the word.
  • the user when the user is relaxing and watching the movie, if he wants to learn a certain word in the subtitle, he can click the word in the subtitle through the above method to enter the learning mode of the word.
  • the user may need to learn some key words in a targeted manner, for example, the user needs to learn multiple words in a certain vocabulary set, and the vocabulary set may be English level 4 or 6 vocabulary or IELTS vocabulary.
  • the present application also provides a word learning method that can provide users with centralized learning of multiple words included in a certain vocabulary set. That is, for a video resource, all words in a vocabulary set in the video resource can be extracted in advance. Before selecting a movie, the user can view all the key words included in each movie, and can click to select the key words to be learned, or the user can select the movie resource according to the number of key words, for example, select the movie with the most key words included As the currently watched movie, click to enter the movie learning mode.
  • users can use the English subtitles of the video to watch the English video while using the word index and the player's ability to rewind, etc. to realize the repetition and follow-up of English words And other functions to improve the user's English learning effect and improve the user experience.
  • an electronic device including: one or more processors; a memory; multiple application programs; and one or more programs, wherein the one or more programs are stored in the memory when the When one or more programs are executed by the processor, the electronic device is caused to perform the following steps: display a first interface, the first interface displays the first video being played and the subtitles of the first video, and the subtitles of the first video It includes a first text unit and a second text unit; when the first segment of the first video corresponding to the first text unit is played, the first segment is automatically and repeatedly played on the first interface; on the first interface The first operation of the user is detected on the top; in response to the first operation, a second interface is displayed, and the first information associated with the first text unit is displayed on the second interface.
  • the first segment is a video segment corresponding to the first text unit, or the first segment is a video segment corresponding to the entire sentence where the first text unit is located .
  • the electronic device when the one or more programs are executed by the processor, the electronic device is caused to perform the following steps: before displaying the first interface, display A third interface that displays the first text unit input by the user, the third interface includes second information associated with the first text unit and a first video list, and the first video list includes the first video Detecting the user's second operation on the third interface; in response to the second operation, displaying the first interface.
  • the first video list further includes a second video
  • the second operation is used to select the first video
  • the electronic device when the one or more programs are executed by the processor, the electronic device is caused to perform the following steps: detect the user on the third interface A third operation; in response to the third operation, displaying a fourth interface, the fourth interface includes the second information of the first text unit and a second video list, the second video list includes at least one video, the second The subtitle of each video in the video list includes the first text unit.
  • the first video is paused for playback.
  • the number of times to repeatedly play the first segment of the first video is a preset number of times preset by the system or set by the user.
  • the display effect of the first text unit is different from the display effect of the second text unit.
  • the present application provides an apparatus, which is included in an electronic device, and the apparatus has a function of implementing the above aspect and the possible implementation manners of the above aspect.
  • the function can be realized by hardware, and can also be realized by hardware executing corresponding software.
  • the hardware or software includes one or more modules or units corresponding to the above functions. For example, display modules or units, detection modules or units, processing modules or units, etc.
  • the present application provides an electronic device, including: a touch display screen, wherein the touch display screen includes a touch-sensitive surface and a display; a camera; one or more processors; a memory; a plurality of application programs; and one or Multiple computer programs.
  • one or more computer programs are stored in the memory, and the one or more computer programs include instructions.
  • the electronic device is caused to execute the video playback method in any possible implementation of any one of the above aspects.
  • the present application provides an electronic device, including one or more processors and one or more memories.
  • the one or more memories are coupled to one or more processors.
  • the one or more memories are used to store computer program code.
  • the computer program codes include computer instructions.
  • the one or more processors execute the computer instructions, the electronic device is executed.
  • a video playback method in any possible implementation of any of the above aspects.
  • the present application provides a computer storage medium, including computer instructions, which, when the computer instructions run on an electronic device, cause the electronic device to perform any possible video playback method of any one of the above aspects.
  • the present application provides a computer program product that, when the computer program product runs on an electronic device, causes the electronic device to perform any possible video playback method according to any one of the above aspects.
  • FIG. 1 is a schematic diagram of a hardware structure of an electronic device provided by an embodiment of the present application.
  • FIG. 2 is a schematic diagram of a software structure of an electronic device provided by an embodiment of the present application.
  • FIG. 3 is a schematic diagram of a user interface for realizing word repetition in a video provided by an embodiment of the present application.
  • FIG. 4 is a schematic diagram of another example of a user interface for learning words during movie watching provided by an embodiment of the present application.
  • FIG. 5 is a schematic diagram of another example of a user interface for learning words during movie viewing provided by an embodiment of the present application.
  • FIG. 6 is a schematic diagram of an example of an HMM model provided by an embodiment of the present application.
  • FIG. 7 is an implementation flowchart of an example of an acoustic model generation and forced alignment process provided by this application.
  • FIG. 8 is a flowchart of an example of generating a word time series provided by an embodiment of the present application.
  • FIG. 9 is a schematic diagram of an example of a content association index provided by an embodiment of the present application.
  • FIG. 10 is a flowchart of an implementation of a word or sentence repetition process provided by an embodiment of the present application.
  • FIG. 11 is a schematic diagram of an implementation process of a method for implementing word or sentence repetition in a video provided by an embodiment of the present application.
  • FIG. 12 is a schematic flowchart of a video playback method provided by an embodiment of the present application.
  • FIG. 13 is a schematic diagram of an example of an electronic device provided by an embodiment of the present application.
  • first and second are used for description purposes only, and cannot be understood as indicating or implying relative importance or implicitly indicating the number of indicated technical features.
  • the features defined as “first” and “second” may explicitly or implicitly include one or more of the features.
  • the meaning of “plurality” is two or more.
  • the embodiment of the present application provides a method for re-reading words or sentences in a video, which can be applied to an electronic device or a separate application program, which can implement the method for re-reading words or sentences in the present application.
  • a method for re-reading words or sentences in a video which can be applied to an electronic device or a separate application program, which can implement the method for re-reading words or sentences in the present application.
  • the user can use the word index and the player's ability to rewind, etc. to realize the functions of repeating and following English words, improving the user's English learning effect and improving the user Experience.
  • the method for realizing the repetition of words or sentences in the video can be applied to mobile phones, tablet computers, wearable devices, vehicle-mounted devices, augmented reality (augmented reality (AR)/virtual reality (VR) devices, notebooks)
  • augmented reality augmented reality (AR)/virtual reality (VR) devices
  • notebooks On electronic devices such as computers, ultra-mobile personal computers (UMPCs), netbooks, personal digital assistants (PDAs), etc.
  • UMPCs ultra-mobile personal computers
  • PDAs personal digital assistants
  • the embodiments of the present application do not limit the specific types of electronic devices.
  • FIG. 1 shows a schematic structural diagram of an electronic device 100.
  • the electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charging management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2 , Mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone jack 170D, sensor module 180, key 190, motor 191, indicator 192, camera 193, display screen 194, and Subscriber identification module (SIM) card interface 195, etc.
  • SIM Subscriber identification module
  • the sensor module 180 may include a pressure sensor 180A, a gyro sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light Sensor 180L, bone conduction sensor 180M, etc.
  • the structure illustrated in the embodiment of the present application does not constitute a specific limitation on the electronic device 100.
  • the electronic device 100 may include more or fewer components than shown, or combine some components, or split some components, or arrange different components.
  • the illustrated components can be implemented in hardware, software, or a combination of software and hardware.
  • the processor 110 may include one or more processing units, for example, the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processor (graphics processing unit, GPU), and an image signal processor (image)signal processor (ISP), controller, memory, video codec, digital signal processor (DSP), baseband processor, and/or neural-network processing unit (NPU) Wait.
  • application processor application processor
  • AP application processor
  • modem processor graphics processor
  • GPU graphics processor
  • ISP image signal processor
  • controller memory
  • video codec video codec
  • DSP digital signal processor
  • NPU neural-network processing unit
  • different processing units may be independent devices, or may be integrated in one or more processors.
  • the controller may be the nerve center and command center of the electronic device 100.
  • the controller can generate the operation control signal according to the instruction operation code and the timing signal to complete the control of fetching instructions and executing instructions.
  • the processor 110 may also be provided with a memory for storing instructions and data.
  • the memory in the processor 110 is a cache memory.
  • the memory may store instructions or data that the processor 110 has just used or recycled. If the processor 110 needs to use the instruction or data again, it can be directly called from the memory. Avoid repeated access, reduce the waiting time of the processor 110, thus improving the efficiency of the system.
  • the processor 110 may include one or more interfaces.
  • Interfaces can include integrated circuit (inter-integrated circuit, I2C) interface, integrated circuit built-in audio (inter-integrated circuit, sound, I2S) interface, pulse code modulation (pulse code modulation (PCM) interface, universal asynchronous transceiver (universal asynchronous) receiver/transmitter, UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and And/or universal serial bus (USB) interface, etc.
  • I2C integrated circuit
  • I2S integrated circuit built-in audio
  • PCM pulse code modulation
  • PCM pulse code modulation
  • UART universal asynchronous transceiver
  • MIPI mobile industry processor interface
  • GPIO general-purpose input/output
  • SIM subscriber identity module
  • USB universal serial bus
  • the I2C interface is a bidirectional synchronous serial bus, including a serial data line (serial data line, SDA) and a serial clock line (derail clock line, SCL).
  • the processor 110 may include multiple sets of I2C buses.
  • the processor 110 may respectively couple the touch sensor 180K, the charger, the flash, the camera 193, etc. through different I2C bus interfaces.
  • the processor 110 may couple the touch sensor 180K through the I2C interface, so that the processor 110 and the touch sensor 180K communicate through the I2C bus interface to realize the touch function of the electronic device 100.
  • the I2S interface can be used for audio communication.
  • the processor 110 may include multiple sets of I2S buses.
  • the processor 110 may be coupled to the audio module 170 through an I2S bus to implement communication between the processor 110 and the audio module 170.
  • the audio module 170 can transfer audio signals to the wireless communication module 160 through the I2S interface, so as to realize the function of answering the call through the Bluetooth headset.
  • the PCM interface can also be used for audio communication, sampling, quantizing and encoding analog signals.
  • the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface.
  • the audio module 170 can also transmit audio signals to the wireless communication module 160 through the PCM interface to realize the function of answering the phone call through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
  • the UART interface is a universal serial data bus used for asynchronous communication.
  • the bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication.
  • the UART interface is generally used to connect the processor 110 and the wireless communication module 160.
  • the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to implement the Bluetooth function.
  • the audio module 170 can transmit audio signals to the wireless communication module 160 through the UART interface, so as to realize the function of playing music through the Bluetooth headset.
  • the MIPI interface can be used to connect the processor 110 to peripheral devices such as the display screen 194 and the camera 193.
  • MIPI interface includes camera serial interface (camera serial interface, CSI), display serial interface (display serial interface, DSI) and so on.
  • the processor 110 and the camera 193 communicate through a CSI interface to implement the shooting function of the electronic device 100.
  • the processor 110 and the display screen 194 communicate through the DSI interface to realize the display function of the electronic device 100.
  • the GPIO interface can be configured via software.
  • the GPIO interface can be configured as a control signal or a data signal.
  • the GPIO interface may be used to connect the processor 110 to the camera 193, the display screen 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like.
  • GPIO interface can also be configured as I2C interface, I2S interface, UART interface, MIPI interface, etc.
  • the USB interface 130 is an interface that conforms to the USB standard, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, and so on.
  • the USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transfer data between the electronic device 100 and peripheral devices. It can also be used to connect headphones and play audio through the headphones.
  • the interface can also be used to connect other electronic devices, such as AR devices.
  • the interface connection relationship between the modules illustrated in the embodiments of the present application is only a schematic description, and does not constitute a limitation on the structure of the electronic device 100.
  • the electronic device 100 may also use different interface connection methods in the foregoing embodiments, or a combination of multiple interface connection methods.
  • the charging management module 140 is used to receive charging input from the charger.
  • the charger may be a wireless charger or a wired charger.
  • the charging management module 140 may receive the charging input of the wired charger through the USB interface 130.
  • the charging management module 140 may receive wireless charging input through the wireless charging coil of the electronic device 100. While the charging management module 140 charges the battery 142, it can also supply power to the electronic device through the power management module 141.
  • the power management module 141 is used to connect the battery 142, the charging management module 140 and the processor 110.
  • the power management module 141 receives input from the battery 142 and/or the charging management module 140, and supplies power to the processor 110, internal memory 121, external memory, display screen 194, camera 193, wireless communication module 160, and the like.
  • the power management module 141 can also be used to monitor battery capacity, battery cycle times, battery health status (leakage, impedance) and other parameters.
  • the power management module 141 may also be disposed in the processor 110.
  • the power management module 141 and the charging management module 140 may also be set in the same device.
  • the wireless communication function of the electronic device 100 can be realized by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modem processor, and the baseband processor.
  • Antenna 1 and antenna 2 are used to transmit and receive electromagnetic wave signals.
  • Each antenna in the electronic device 100 can be used to cover a single or multiple communication frequency bands. Different antennas can also be reused to improve antenna utilization.
  • the antenna 1 can be multiplexed as a diversity antenna of a wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
  • the mobile communication module 150 can provide a wireless communication solution including 2G/3G/4G/5G and the like applied to the electronic device 100.
  • the mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA), and so on.
  • the mobile communication module 150 can receive electromagnetic waves from the antenna 1 and filter, amplify, etc. the received electromagnetic waves, and transmit them to the modem processor for demodulation.
  • the mobile communication module 150 can also amplify the signal modulated by the modulation and demodulation processor and convert it to electromagnetic wave radiation through the antenna 1.
  • at least part of the functional modules of the mobile communication module 150 may be provided in the processor 110.
  • at least part of the functional modules of the mobile communication module 150 and at least part of the modules of the processor 110 may be provided in the same device.
  • the modem processor may include a modulator and a demodulator.
  • the modulator is used to modulate the low-frequency baseband signal to be transmitted into a high-frequency signal.
  • the demodulator is used to demodulate the received electromagnetic wave signal into a low-frequency baseband signal.
  • the demodulator then transmits the demodulated low-frequency baseband signal to the baseband processor for processing.
  • the low-frequency baseband signal is processed by the baseband processor and then passed to the application processor.
  • the application processor outputs a sound signal through an audio device (not limited to a speaker 170A, a receiver 170B, etc.), or displays an image or video through a display screen 194.
  • the modem processor may be an independent device.
  • the modem processor may be independent of the processor 110, and may be set in the same device as the mobile communication module 150 or other functional modules.
  • the wireless communication module 160 can provide wireless local area networks (wireless local area networks, WLAN) (such as wireless fidelity (Wi-Fi) networks), Bluetooth (bluetooth, BT), and global navigation satellites that are applied to the electronic device 100.
  • System global navigation satellite system, GNSS
  • frequency modulation frequency modulation, FM
  • near field communication technology near field communication, NFC
  • infrared technology infrared, IR
  • the wireless communication module 160 may be one or more devices integrating at least one communication processing module.
  • the wireless communication module 160 receives electromagnetic waves via the antenna 2, frequency-modulates and filters electromagnetic wave signals, and transmits the processed signals to the processor 110.
  • the wireless communication module 160 may also receive the signal to be transmitted from the processor 110, frequency-modulate it, amplify it, and convert it to electromagnetic wave radiation through the antenna 2.
  • the antenna 1 of the electronic device 100 and the mobile communication module 150 are coupled, and the antenna 2 and the wireless communication module 160 are coupled so that the electronic device 100 can communicate with the network and other devices through wireless communication technology.
  • the wireless communication technology may include a global mobile communication system (global system for mobile communications, GSM), general packet radio service (general packet radio service, GPRS), code division multiple access (code division multiple access, CDMA), broadband Wideband code division multiple access (WCDMA), time-division code division multiple access (TD-SCDMA), long-term evolution (LTE), BT, GNSS, WLAN, NFC , FM, and/or IR technology, etc.
  • the GNSS may include a global positioning system (GPS), a global navigation satellite system (GLONASS), a beidou navigation system (BDS), and a quasi-zenith satellite system (quasi -zenith satellite system (QZSS) and/or satellite-based augmentation systems (SBAS).
  • GPS global positioning system
  • GLONASS global navigation satellite system
  • BDS beidou navigation system
  • QZSS quasi-zenith satellite system
  • SBAS satellite-based augmentation systems
  • the electronic device 100 realizes a display function through a GPU, a display screen 194, and an application processor.
  • the GPU is a microprocessor for image processing, connecting the display screen 194 and the application processor.
  • the GPU is used to perform mathematical and geometric calculations, and is used for graphics rendering.
  • the processor 110 may include one or more GPUs that execute program instructions to generate or change display information.
  • the display screen 194 is used to display images, videos and the like.
  • the display screen 194 includes a display panel.
  • the display panel may use a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active matrix organic light-emitting diode or an active matrix organic light-emitting diode (active-matrix organic light) emitting diode, AMOLED, flexible light-emitting diode (FLED), Miniled, MicroLed, Micro-oLed, quantum dot light emitting diode (QLED), etc.
  • the electronic device 100 may include 1 or N display screens 194, where N is a positive integer greater than 1.
  • the electronic device 100 can realize a shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
  • the ISP processes the data fed back by the camera 193. For example, when taking a picture, the shutter is opened, the light is transmitted to the camera photosensitive element through the lens, and the optical signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing, which is converted into an image visible to the naked eye.
  • ISP can also optimize the image noise, brightness, and skin color. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene.
  • the ISP may be set in the camera 193.
  • the camera 193 is used to capture still images or video.
  • the object generates an optical image through the lens and projects it onto the photosensitive element.
  • the photosensitive element may be a charge coupled device (charge coupled device, CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor.
  • CCD charge coupled device
  • CMOS complementary metal-oxide-semiconductor
  • the photosensitive element converts the optical signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal.
  • the ISP outputs the digital image signal to the DSP for processing.
  • DSP converts digital image signals into standard RGB, YUV and other image signals.
  • the electronic device 100 may include 1 or N cameras 193, where N is a positive integer greater than 1.
  • the digital signal processor is used to process digital signals. In addition to digital image signals, it can also process other digital signals. For example, when the electronic device 100 is selected at a frequency point, the digital signal processor is used to perform Fourier transform on the energy at the frequency point.
  • Video codec is used to compress or decompress digital video.
  • the electronic device 100 may support one or more video codecs. In this way, the electronic device 100 can play or record videos in various encoding formats, for example: moving picture experts group (MPEG) 1, MPEG2, MPEG3, MPEG4, etc.
  • MPEG moving picture experts group
  • NPU is a neural-network (NN) computing processor.
  • NN neural-network
  • the NPU can realize applications such as intelligent recognition of the electronic device 100, such as image recognition, face recognition, voice recognition, and text understanding.
  • the external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100.
  • the external memory card communicates with the processor 110 through the external memory interface 120 to realize the data storage function. For example, save music, video and other files in an external memory card.
  • the internal memory 121 may be used to store computer executable program code, where the executable program code includes instructions.
  • the processor 110 executes instructions stored in the internal memory 121 to execute various functional applications and data processing of the electronic device 100.
  • the internal memory 121 may include a storage program area and a storage data area.
  • the storage program area may store an operating system, at least one function required application programs (such as sound playback function, image playback function, etc.).
  • the storage data area may store data (such as audio data, phone book, etc.) created during use of the electronic device 100 and the like.
  • the internal memory 121 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one disk storage device, a flash memory device, a universal flash memory (universal flash storage, UFS), and so on.
  • a non-volatile memory such as at least one disk storage device, a flash memory device, a universal flash memory (universal flash storage, UFS), and so on.
  • the electronic device 100 may implement audio functions through an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, a headphone interface 170D, and an application processor. For example, music playback, recording, etc.
  • the audio module 170 is used to convert digital audio information into analog audio signal output, and also used to convert analog audio input into digital audio signal.
  • the audio module 170 can also be used to encode and decode audio signals.
  • the audio module 170 may be disposed in the processor 110, or some functional modules of the audio module 170 may be disposed in the processor 110.
  • the speaker 170A also called “speaker” is used to convert audio electrical signals into sound signals.
  • the electronic device 100 can listen to music through the speaker 170A, or listen to a hands-free call.
  • the receiver 170B also known as "handset" is used to convert audio electrical signals into sound signals.
  • the voice can be received by bringing the receiver 170B close to the ear.
  • the microphone 170C also called “microphone”, “microphone”, is used to convert sound signals into electrical signals.
  • the user can make a sound by approaching the microphone 170C through a person's mouth, and input a sound signal to the microphone 170C.
  • the electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C. In addition to collecting sound signals, it may also implement a noise reduction function. In other embodiments, the electronic device 100 may further include three, four, or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and implement directional recording functions.
  • the headset interface 170D is used to connect wired headsets.
  • the earphone interface 170D may be a USB interface 130, or a 3.5mm open mobile electronic device (open terminal) platform (OMTP) standard interface, and the American Telecommunications Industry Association (cellular telecommunications industry association of the United States, CTIA) standard interface.
  • OMTP open mobile electronic device
  • CTIA American Telecommunications Industry Association
  • the pressure sensor 180A is used to sense the pressure signal and can convert the pressure signal into an electrical signal.
  • the pressure sensor 180A may be provided on the display screen 194.
  • the capacitive pressure sensor may be at least two parallel plates with conductive materials. When force is applied to the pressure sensor 180A, the capacitance between the electrodes changes.
  • the electronic device 100 determines the strength of the pressure according to the change in capacitance.
  • the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A.
  • the electronic device 100 may also calculate the touched position based on the detection signal of the pressure sensor 180A.
  • touch operations that act on the same touch position but have different touch operation intensities may correspond to different operation instructions. For example, when a touch operation with a touch operation intensity less than the first pressure threshold acts on the short message application icon, an instruction to view the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold acts on the short message application icon, an instruction to create a new short message is executed.
  • the gyro sensor 180B may be used to determine the movement posture of the electronic device 100. In some embodiments, the angular velocity of the electronic device 100 around three axes (ie, x, y, and z axes) may be determined by the gyro sensor 180B.
  • the gyro sensor 180B can be used for image stabilization. Exemplarily, when the shutter is pressed, the gyro sensor 180B detects the jitter angle of the electronic device 100, calculates the distance that the lens module needs to compensate based on the angle, and allows the lens to counteract the jitter of the electronic device 100 through reverse movement to achieve anti-shake.
  • the gyro sensor 180B can also be used for navigation and somatosensory game scenes.
  • the air pressure sensor 180C is used to measure air pressure.
  • the electronic device 100 calculates the altitude using the air pressure value measured by the air pressure sensor 180C to assist positioning and navigation.
  • the magnetic sensor 180D includes a Hall sensor.
  • the electronic device 100 can detect the opening and closing of the flip holster using the magnetic sensor 180D.
  • the electronic device 100 may detect the opening and closing of the clamshell according to the magnetic sensor 180D.
  • features such as automatic unlocking of the flip cover are set.
  • the acceleration sensor 180E can detect the magnitude of acceleration of the electronic device 100 in various directions (generally three axes). When the electronic device 100 is stationary, the magnitude and direction of gravity can be detected. It can also be used to recognize the posture of electronic devices, and be used in applications such as horizontal and vertical screen switching and pedometers.
  • the distance sensor 180F is used to measure the distance.
  • the electronic device 100 can measure the distance by infrared or laser. In some embodiments, when shooting scenes, the electronic device 100 may use the distance sensor 180F to measure distance to achieve fast focusing.
  • the proximity light sensor 180G may include, for example, a light emitting diode (LED) and a light detector, such as a photodiode.
  • the light emitting diode may be an infrared light emitting diode.
  • the electronic device 100 emits infrared light outward through the light emitting diode.
  • the electronic device 100 uses a photodiode to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it may be determined that there is an object near the electronic device 100. When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100.
  • the electronic device 100 can use the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear to talk, so as to automatically turn off the screen to save power.
  • the proximity light sensor 180G can also be used in leather case mode, pocket mode automatically unlocks and locks the screen.
  • the ambient light sensor 180L is used to sense the brightness of ambient light.
  • the electronic device 100 can adaptively adjust the brightness of the display screen 194 according to the perceived ambient light brightness.
  • the ambient light sensor 180L can also be used to automatically adjust the white balance when taking pictures.
  • the ambient light sensor 180L can also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in a pocket to prevent accidental touch.
  • the fingerprint sensor 180H is used to collect fingerprints.
  • the electronic device 100 can use the collected fingerprint characteristics to realize fingerprint unlocking, access to application locks, fingerprint photographing, and fingerprint answering calls.
  • the temperature sensor 180J is used to detect the temperature.
  • the electronic device 100 uses the temperature detected by the temperature sensor 180J to execute a temperature processing strategy. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold, the electronic device 100 performs performance reduction of the processor located near the temperature sensor 180J in order to reduce power consumption and implement thermal protection. In some other embodiments, when the temperature is below another threshold, the electronic device 100 heats the battery 142 to avoid the abnormal shutdown of the electronic device 100 due to the low temperature. In some other embodiments, when the temperature is below another threshold, the electronic device 100 performs boosting on the output voltage of the battery 142 to avoid abnormal shutdown due to low temperature.
  • Touch sensor 180K also known as "touch panel”.
  • the touch sensor 180K may be provided on the display screen 194, and the touch sensor 180K and the display screen 194 constitute a touch screen, also called a "touch screen”.
  • the touch sensor 180K is used to detect a touch operation acting on or near it.
  • the touch sensor can pass the detected touch operation to the application processor to determine the type of touch event.
  • the visual output related to the touch operation may be provided through the display screen 194.
  • the touch sensor 180K may also be disposed on the surface of the electronic device 100, which is different from the location where the display screen 194 is located.
  • the bone conduction sensor 180M can acquire vibration signals.
  • the bone conduction sensor 180M can acquire the vibration signal of the vibrating bone mass of the human voice.
  • the bone conduction sensor 180M can also contact the pulse of the human body and receive a blood pressure beating signal.
  • the bone conduction sensor 180M may also be provided in the earphone and combined into a bone conduction earphone.
  • the audio module 170 may parse out the voice signal based on the vibration signal of the vibrating bone block of the voice part acquired by the bone conduction sensor 180M to realize the voice function.
  • the application processor may analyze the heart rate information based on the blood pressure beating signal acquired by the bone conduction sensor 180M to implement the heart rate detection function.
  • the key 190 includes a power-on key, a volume key, and the like.
  • the key 190 may be a mechanical key. It can also be a touch button.
  • the electronic device 100 can receive key input and generate key signal input related to user settings and function control of the electronic device 100.
  • the motor 191 may generate a vibration prompt.
  • the motor 191 can be used for vibration notification of incoming calls and can also be used for touch vibration feedback.
  • touch operations applied to different applications may correspond to different vibration feedback effects.
  • the motor 191 can also correspond to different vibration feedback effects.
  • Different application scenarios for example: time reminder, receiving information, alarm clock, game, etc.
  • Touch vibration feedback effect can also support customization.
  • the indicator 192 may be an indicator light, which may be used to indicate a charging state, a power change, and may also be used to indicate a message, a missed call, a notification, and the like.
  • the SIM card interface 195 is used to connect a SIM card.
  • the SIM card can be inserted into or removed from the SIM card interface 195 to achieve contact and separation with the electronic device 100.
  • the electronic device 100 may support 1 or N SIM card interfaces, where N is a positive integer greater than 1.
  • the SIM card interface 195 can support Nano SIM cards, Micro SIM cards, SIM cards, etc.
  • the same SIM card interface 195 can insert multiple cards at the same time. The types of the multiple cards may be the same or different.
  • the SIM card interface 195 can also be compatible with different types of SIM cards.
  • the SIM card interface 195 can also be compatible with external memory cards.
  • the electronic device 100 interacts with the network through the SIM card to realize functions such as call and data communication.
  • the electronic device 100 uses eSIM, that is, an embedded SIM card.
  • the eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100.
  • the software system of the electronic device 100 may adopt a layered architecture, event-driven architecture, micro-core architecture, micro-service architecture, or cloud architecture.
  • the embodiment of the present application takes an Android system with a layered architecture as an example to exemplarily explain the software structure of the electronic device 100.
  • the layered architecture divides the software into several layers, each of which has a clear role and division of labor.
  • the layers communicate with each other through a software interface.
  • the Android system is divided into four layers, from top to bottom are the application layer, the application framework layer, the Android runtime and the system library, and the kernel layer.
  • the application layer may include a series of application packages.
  • the application package may include applications such as camera, gallery, calendar, call, map, navigation, WLAN, Bluetooth, music, video, and short message.
  • the application framework layer provides an application programming interface (application programming interface) and programming framework for applications at the application layer.
  • the application framework layer includes some predefined functions.
  • the application framework layer may include a window manager, a content provider, a view system, a phone manager, a resource manager, a notification manager, and so on.
  • the window manager is used to manage window programs.
  • the window manager can obtain the size of the display screen, determine whether there is a status bar, lock the screen, intercept the screen, etc.
  • Content providers are used to store and retrieve data, and make these data accessible to applications.
  • the data may include videos, images, audio, calls made and received, browsing history and bookmarks, phonebooks, etc.
  • the view system includes visual controls, such as controls for displaying text and controls for displaying pictures.
  • the view system can be used to build applications.
  • the display interface can be composed of one or more views.
  • a display interface that includes an SMS notification icon may include a view that displays text and a view that displays pictures.
  • the phone manager is used to provide the communication function of the electronic device 100. For example, the management of the call state (including connection, hang up, etc.).
  • the resource manager provides various resources for the application, such as localized strings, icons, pictures, layout files, video files, and so on.
  • the notification manager enables applications to display notification information in the status bar, which can be used to convey notification-type messages, and can disappear after a short stay without user interaction.
  • the notification manager is used to notify the completion of downloading, message reminders, etc.
  • the notification manager can also be a notification that appears in the status bar at the top of the system in the form of a chart or scroll bar text, such as a notification of an application running in the background, or a notification that appears on the screen in the form of a dialog window.
  • the text message is displayed in the status bar, a sound is emitted, the electronic device vibrates, and the indicator light flashes.
  • Android runtime includes core library and virtual machine. Android runtime is responsible for the scheduling and management of the Android system.
  • the core library contains two parts: one part is the function function that Java language needs to call, and the other part is the core library of Android.
  • the application layer and the application framework layer run in the virtual machine.
  • the virtual machine executes the java files of the application layer and the application framework layer into binary files.
  • the virtual machine is used to perform functions such as object lifecycle management, stack management, thread management, security and exception management, and garbage collection.
  • the system library may include multiple functional modules. For example: surface manager (surface manager), media library (media library), 3D graphics processing library (eg: OpenGL ES), 2D graphics engine (eg: SGL), etc.
  • surface manager surface manager
  • media library media library
  • 3D graphics processing library eg: OpenGL ES
  • 2D graphics engine eg: SGL
  • the surface manager is used to manage the display subsystem and provides a combination of 2D and 3D layers for multiple applications.
  • the media library supports a variety of commonly used audio, video format playback and recording, and still image files.
  • the media library can support multiple audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
  • the 3D graphics processing library is used to realize 3D graphics drawing, image rendering, synthesis, and layer processing.
  • the 2D graphics engine is a drawing engine for 2D drawing.
  • the kernel layer is the layer between hardware and software.
  • the kernel layer contains at least the display driver, camera driver, audio driver, and sensor driver.
  • FIG. 1 and FIG. 2 will take the electronic device having the structure shown in FIG. 1 and FIG. 2 as an example, combined with the accompanying drawings and application scenarios, to specifically explain the method of implementing word repetition in the video provided by the embodiments of the present application.
  • this application will propose a method to realize the repetition of words or sentences in a video, which can be based on the English subtitles of the video, while watching the English video, and realize the functions of repetition and follow-up of English words to improve the user's English learning effect. Improve user experience.
  • FIG. 3 is a schematic diagram of an example of a graphical user interface (GUI) for repetition of words in a video provided by an embodiment of the present application.
  • GUI graphical user interface
  • This application will use a mobile phone as an electronic device to introduce in detail the implementation of words or sentences in a video provided by the present application Repeat the method.
  • FIG. 3(a) shows that in the unlocking mode of the mobile phone, the screen display system of the mobile phone displays the currently output interface content 301, which is the main interface of the mobile phone.
  • the interface content 301 displays various third-party applications (applications, apps), such as Alipay, task card store, photo album, WeChat, card package, settings, camera, and applications for English learning provided in the embodiments of the present application
  • the program is, for example, Fun V English shown in (a) in FIG. 3. It should be understood that the interface content 301 may also include other more applications, which is not limited in this application.
  • a user operation is input to the English learning application.
  • the user operation may include a user's click operation on the icon of the English learning application displayed on the mobile phone.
  • the main interface of the English learning shown in (b) in FIG. 3 is entered.
  • the main interface may include multiple functional areas, such as a daily recommended English word learning area for listing some words and corresponding videos, and the user may receive daily pushes and click to learn the word.
  • the top area on the main interface includes a search box 302, a browsing record control 303, and a message reminding control 304.
  • the search box 302 is used by the user to enter a word and enter the learning mode of the word;
  • the browsing record control 303 is used to record the user's search and learning record, which is convenient for the user to quickly find the learned word;
  • the message reminder control 304 may include the system push News etc.
  • the main interface may also include scene classification areas, such as different scene classifications such as restaurants, taxis, airplanes, conferences, airports, hotels, and shopping malls shown in (b) in FIG. 3. The user can click to select any scene, enter the scene category, and select English learning videos to learn.
  • the interface automatically displays the navigation bar 305 of the word, and the user clicks the navigation bar 305 of "message” to enter the word analysis interface shown in (d) of FIG. 3.
  • the "message" parsing interface includes English, American pronunciation, Chinese interpretation, Chinese and English example sentences, and video example sentences.
  • the detailed content presented on the word analysis interface may come from the English dictionary built into the system itself, or may be associated with other English online dictionaries, etc., which is not limited in this application.
  • the word analysis interface also includes a word addition control 306. Clicking the word addition control 306 can add a word to the word book. The user can click the word in the word book to quickly enter the word analysis interface, simplifying the search operation.
  • the video example sentence area may display all learning video resources related to the word.
  • the learning video resource may include video resources under different scene classifications.
  • the list of English learning videos in different scene classifications is obtained.
  • the user can click to select any scene to obtain the English learning videos in the scene classification for learning.
  • the user can click on different scene categories such as restaurants, taxis, and airplanes to view the video resource of the word in different scenes. This application does not limit the classification of video resources.
  • the video learning mode interface is shown in (e) in FIG. 3, for example, the video is an excerpt A 36-second clip from the film "When Happiness Comes to Knock".
  • the video learning mode interface includes a video playback area for playing learning videos related to "message”; it also includes a word analysis interface that displays the interpretation of "message” in detail, which is convenient for users to combine video scenes and Chinese interpretation at the same time.
  • the related videos of the same recommendation can be the same as the current Learning videos related to words played, for example, learning videos in other scenes related to "message", or other videos that are the same as the current scene, for example, currently playing learning videos for "message” are videos under the restaurant scene category
  • the related videos recommended by this category are also other learning videos under the restaurant scene classification, which is not limited in this application.
  • the user can view the position of the time slice where the English word to be learned is located through the mark in the video playback progress bar.
  • the learning video may include multiple identifiers, and the number of the identifiers matches the number of times the word appears in the video.
  • the user can control the progress of the video playback by dragging the progress bar. For example, when the learning video is long, the user can drag the progress bar to the mark near the word for playback.
  • the words learned by the user are highlighted during the video playback.
  • the "message” that the user wants to learn appears in the subtitles the "message” is different from the display of other words in the subtitles to remind the user of the word position.
  • the video playback area of "message” includes a repeat setting control 306, as shown in (f) of FIG. 3.
  • a repeat setting box 307 shown in (g) in FIG. 3 may pop up in the video playback area.
  • the setting options of the number of loops and the setting of subtitles are included.
  • the user can select the content to be repeated through the cycle number setting option. For example, the user can click "None” to set to no repeat mode, that is, the word or sentence is not repeated during the current learning video playback process. Alternatively, the user can click "Word” to set the word repeat mode, that is, in the current "message” learning video playback process, when the "message” is played, the audio and video frames corresponding to the "message” will be played in a loop. Or, the user can click "sentence” to set the sentence repetition mode, that is, in the current "message” learning video playback process, when a sentence containing "message” is played, the audio and video corresponding to the sentence will be played in a loop. frame.
  • the sentence "message” includes: “Yes, I'd like to leave message for Mr. Jay Twistle”.
  • the audio and the corresponding sentence of the sentence will be played cyclically. Video frame.
  • the number of loop playbacks may be set by the user in the background, or may be the system default. In the case that the user does not set the number of looped playbacks, the number of looped playbacks may be the system default 3 times. This application does not limit this.
  • the user can set the presentation form of the subtitles through the subtitle setting options. For example, the user can click the first control of the subtitles in the repeat setting box 307, which corresponds to the no subtitles mode, that is, no subtitles in Chinese or English are displayed during the learning video playback process. Alternatively, the user can click the second control "A" of the subtitle in the repeat setting box 307, which corresponds to the English subtitle mode, that is, only English subtitles are displayed during the learning video playback process. Alternatively, the user can click the third control "A+" of the subtitle in the repeat setting box 307, which corresponds to the full subtitle mode, that is, English subtitles and Chinese subtitles are simultaneously displayed during the learning video playback process.
  • the video when the user clicks to close the repeat setting control or close the word analysis box of the key word to exit the learning mode of the word, the video can continue to play, or the video is paused, and the user can click the video display
  • the playback controls on the interface continue to play the video. This application does not limit this.
  • the method for re-reading words or sentences in the video provided by the present application as described above can provide users with an environment more conducive to learning English. Based on the method of this application, the user can select video resources in different scenarios according to the English words to be learned. During the video learning process, based on the English subtitles of the video, the user can watch the English video while realizing the functions of repeating and following the English words, improving the user's English learning effect and improving the user experience.
  • FIG. 4 is a schematic diagram of another example of a user interface for learning words during a viewing process provided by the present application, which will be described below in conjunction with FIG. 4.
  • FIG. 4(a) shows that in the unlocking mode of the mobile phone, the screen display system of the mobile phone displays the currently output interface content 401, which is the main interface of the mobile phone.
  • the interface content 401 shows a variety of third-party applications, including applications for users to watch movies, such as Huawei’s film and television application Huawei Movies.
  • the interface may include various classified film and television resources, as well as various recommended film and television resources, as shown in the wonderful recommended movie "When Happiness Comes to Knock". Click the recommended movie to enter the play mode of the movie.
  • the word is highlighted in the subtitles below the video.
  • close in the subtitle is different from the display of other words in the subtitle, which is used to clarify that each word is in the audio and
  • the position in the subtitles is convenient for users to learn the pronunciation and interpretation of the word.
  • the movie playback interface may include a loop number setting control 402 and a subtitle setting control 403.
  • the user can select the content to be repeated through the loop number setting control. For example, the user can click "None" to set to no repeat mode, that is, the word or sentence is not repeated during the current learning video playback process. Alternatively, the user can click "Word” to set the word repeat mode. Or, the user can click "sentence” to set the sentence repetition mode, that is, in the current learning video playback process, after the user sets, the audio and video frames corresponding to the sentence before or after the setting will be cyclically played.
  • the repeat setting at this time may default to the last word or the corresponding audio and video frames of the time slice when the user starts the setting, or may default to the previous word of the time slice when the user starts the setting or
  • the user can also change the relationship between the repeated words or sentences and the time slice in which the setting mode is turned on. This application does not limit this.
  • the user does not hear a clear sentence or word, or the user wants to learn the sentence or word.
  • the user can directly click the repeat setting control to set the repeat mode and the number of repeat times to exit the repeat mode. , Repeat the previous sentence or word directly.
  • the user may encounter strange words.
  • the user can click the strange word in the video subtitle.
  • the user can click “close” on the subtitle of the video playback interface to enter the interface shown in (e) of FIG. 4. That is, the user can enter the learning mode of the word by clicking on the word in the subtitle, as shown in (e) in FIG. 4, the close word resolution shown in the pop-up box 404 in the figure, and the pop-up box 404 includes a “details” control 403 ⁇ Add to wordbook control 406.
  • the "details" control 405 is used for the user to quickly enter the parsing interface of the word as shown in (f) of FIG. 4, the parsing interface includes English, American pronunciation, Chinese interpretation, Chinese and English example sentences and related Users can view related learning content of the word and video resources related to the word on the word analysis interface.
  • the detailed content presented on the word analysis interface can come from the English dictionary built into the system itself, or it can be associated with other English online dictionaries, etc., which is not limited in this application.
  • the user can click on the video resource of the word to learn.
  • the specific operation process please refer to the related introduction in FIG. 3, which will not be repeated here.
  • phrases in video subtitles users can also click on phrases in video subtitles. For example, some words appear basically in the form of phrases, and during the user's click, the parsing of the phrase may appear in the form of phrases. For example, when the detailed content presented in the word analysis interface is associated with an English dictionary, the word in the dictionary mainly appears in the form of a phrase. When the user clicks on the word while watching the video, the analysis or interpretation of the phrase may also pop up. limited.
  • the subtitles of the learning video and the progress bar of the learning video are presented at different positions in the playback interface. If possible, if both the subtitle of the learning video and the progress bar of the learning video are displayed in the same position area of the playback interface, when the user clicks on the word contained in the subtitle, the click effect may be poor, for example, the process of clicking the word may be triggered by mistake Clicked the progress bar. This situation is particularly prominent when the display screen of the electronic device where the user watches the learning video is small, or the interface for playing the learning video is small. Therefore, the subtitles of the learning video and the progress bar of the learning video are presented at different positions in the playback interface, for example, as shown in FIG. 4(c) and FIG.
  • the video is played
  • the progress bar is displayed at the top of the screen, and the subtitles are displayed at the bottom of the screen.
  • it can also be in other positions of the playback interface, which can improve the sensitivity of user operations and improve the user experience.
  • This application does not limit the position of the video playback progress bar and the display of subtitles.
  • the video when the user clicks to close the repeat setting control 402, the subtitle setting control 403, or close the word parsing box 404 of the key word to exit the learning mode of the word, the video can be automatically resumed without requiring the user Then click the playback control. Or the video is in a paused playback state, and the user can click the playback controls on the video display interface to continue playing the video. This application does not limit this.
  • the above-mentioned method for learning words in the process of watching movies can realize the learning of English words in the ordinary process of watching movies. According to the needs of users, click to enter the learning of words at any time, which can simplify the search operation of word learning and increase the user’s learning. Convenience, enhance user experience.
  • the present application also provides a word learning method that can provide users with centralized learning of multiple words included in a certain vocabulary set.
  • FIG. 5 is a schematic diagram of another example of a user interface for learning words during movie viewing provided by an embodiment of the present application.
  • all words in a vocabulary set in the video resource can be extracted in advance.
  • all six levels of vocabulary included in the subtitles of the movie are extracted to form a key word set, as shown in the word list 503 in the key word area in the figure.
  • a word set 503 is displayed.
  • the user can set the kind of the word set.
  • the user can click on the "emphasis word” control to set the word list 503 as a six-level vocabulary or an IELTS vocabulary.
  • the user can click on the word in the word list 503 and list the selected word as the key word for video learning.
  • the user may click on a word in the word list 503, and list all words in the word list 503 except the selected word as key words for video learning. This application does not limit this.
  • the user before selecting a movie, the user can view all the key words included in each movie, and can click to select the key words to be learned, or the user can select the movie resource according to the number of key words, for example, select the included key
  • the movie with the most words is regarded as the currently watched movie, and click to enter the movie learning mode.
  • the user can check the position of the time slice where the key word in the movie is located through the mark in the video playback progress bar.
  • the user can find the position of the key word by dragging the video progress bar during the viewing process.
  • the key words in the movie are highlighted during the video playback.
  • the display of "abandon” is different from the display of other words in the subtitles to remind the user The position and pronunciation of the word.
  • the subtitles are displayed in black as a whole.
  • key words appear in the subtitles they are displayed in blue and highlighted to remind users to pay attention to the position of the word and related pronunciation.
  • the video playback area may include a loop number setting control 501 and a subtitle setting control 502, as shown in (b) of FIG. 5.
  • the user can set the number of repetitions of the key word by clicking the cycle number setting control 501, and set the presentation form of the subtitle of the movie by clicking the subtitle setting control 502.
  • the function is similar to that of the repeat setting control 306 described in the related description of FIG. 3 above. For the sake of simplicity, it will not be repeated here.
  • the key words in the movie can be looped by default.
  • the word is looped three times by default, which can reduce the user's setting steps and improve the user's learning effect.
  • the user can click on any word and enter the learning mode of the word, as shown in (b) of FIG. 5, after clicking on the abandon Word parsing box.
  • the word parsing box includes a detail control and a control added to the word book. The user can click the detail control to enter the learning interface of abandon, which will not be repeated here.
  • a word analysis box 504 for the key word is automatically popped up, as shown in (b) of FIG. 5.
  • the time for popping up the word parsing box 504 can be set to a fixed duration, for example, the time for popping up the word parsing box 504 is 5 seconds, and the word parsing box 504 is automatically closed after 5 seconds. This application does not limit this.
  • the word analysis box 504 pops up to enter a certain word learning mode, the video is paused.
  • the user can click on any word in the subtitle and enter the learning mode of the word, or when the aforementioned movie is played to the key word, a word analysis box 504 of the key word automatically pops up to enter the word learning mode, the video They are all paused.
  • the word analysis box 504 when the user clicks to close the word analysis box 504 of the key word to exit the learning mode of the word, or the display time of the word analysis box 504 reaches the set fixed duration, the word analysis box 504 is automatically closed After exiting the word learning mode, the video can continue to play after the word parsing box 504 is closed, or the video is in a paused state, and the user can click the playback controls on the video display interface to continue playing the video. This application does not limit this.
  • the method for repetition of words or sentences provided by this application needs to generate a word search function associated with a video based on speech recognition technology, generate a correspondence index from multiple words to multiple videos, and generate a single video Correspondence index to multiple words enables users to search from words to related learning videos.
  • this application reuses the word search function and the player's ability to retreat, and locates the start and end times and key frames of words to realize the repeat function. Specifically, it includes the following implementation steps:
  • Step 1 Generate an acoustic model
  • the acoustic model (acoustic model) is one of the most important parts in the speech recognition system.
  • speech recognition the acoustic model is used to represent the relationship between the sound signal and the phoneme, or to represent each language unit that constitutes speech Relationship. Among them, phoneme is the smallest unit of pronunciation.
  • Hidden Markov Model (HMM) for modeling.
  • Hidden Markov model is the most common acoustic model.
  • the concept of hidden Markov model is a discrete time domain finite state automaton. HMM means that the internal state of this Markov model is not visible to the outside world, and only visible to the outside world. The output value at each moment.
  • Figure 6 is a schematic diagram of an HMM model. This application will use HMM as an example. In Fig. 6, 1 to 6 show each phoneme of a word, and 1 and 6 are the head and tail of the word. HMM can get the best phoneme, word and sentence sequence according to each probability.
  • each phoneme is subjected to model training.
  • This training is performed through a large number of speech signals.
  • This application uses existing models, including monophone models of monophones and triphone models of triphones.
  • the monophone model uses an HMM to represent one phoneme
  • the triphone model uses an HMM to represent three phonemes. Because different pronunciations will change during continuous reading, for example, the continuous pronunciation of two words in English pronunciation may produce a new pronunciation. For example, the two words can and I are read consecutively, and they sound like "cannai" together. Therefore, you need to use multiple phonemes to represent the pronunciation of can.
  • Forced alignment is a technique for obtaining the correct spelling and pronunciation of dictionary vocabulary through audio files and generating a point in time. Specifically, forced alignment actually uses the aforementioned acoustic model and alternative words. It is necessary to solve how to place these words, generate phonemes from the obtained audio signals, and how to connect the acoustic models together. . E.g:
  • the phoneme generated by good morning is: G IH0 D M AO1 R N IH0 NG
  • Kaldi algorithm which is Kaldi's open source toolkit (please refer to http://kaldi-asr.org/doc/index.html ).
  • 7 is an implementation flowchart of the process of generating an acoustic model and forcibly aligning provided by the present application. This includes:
  • 701 includes a feature extraction process and a sound model establishment process.
  • the feature extraction process regardless of the context, a large number of sample libraries of different contexts are obtained. Import the files related to the language model prepared in advance, extract the features of the sample, train the Gaussian (mixture model), GMM-based acoustic model for maximum likelihood estimation, and then perform iterative loop operation, constantly re-starting Estimate GMM and combine the results scattered on different processors.
  • Gaussian mixture model
  • GMM-based acoustic model for maximum likelihood estimation
  • a triphone triphone model is generated based on the monophonic model.
  • Good is composed of 3 phonemes, and only 3 HMM models need to be established according to the monophone model (monophone).
  • monophone model monophone
  • the syntactic pronunciation effect of the context that is, the context phoneme will affect the current pronunciation of the central phoneme and will produce a synergistic change, which is different from the individual pronunciation of the phoneme.
  • triphone models triphone
  • the traditional triphone method is model binding, that is, normalized triphone, using a posterior smoothing method. Or, if the pronunciation types of the context are similar, the impact on the current phoneme is similar, then these data can be clustered.
  • the Kald algorithm can automatically generate a problem set, and automatically cluster into a class based on the similarity of the phonemes themselves.
  • the linear discriminant analysis (LDA) algorithm uses the projection method to project the feature vector into a space with a lower dimension, so that the projected points will be differentiated by category in the projected space. Closer. That is, the LDA algorithm uses a change matrix to achieve the purpose of dimensionality reduction for feature vectors, so that the distribution within the sample is condensed, and the distribution between the samples is alienated, so that the extracted features are more representative and make the classification better.
  • the maximum likelihood linear transformation can use a linear transformation matrix to decorrelate the parameter feature vector under the maximum likelihood (ML) criterion, so that in the new space, the model and the training set Likelihood is increased, and the process of feature extraction is optimized.
  • each triphone model finally corresponds to a sound signal, that is, the start and end time of a sound signal is determined, and this start and end time is the time when the phoneme levels are aligned.
  • Step 3 Generate word time series
  • FIG. 8 is a flowchart of an example of generating a time series of words provided by an embodiment of the present application. The generation process includes the following:
  • Import subtitle files extract words one by one, and generate an acoustic model of each word.
  • step three for English video resources, through the background processing of the administrator, the start and end time of each word can be obtained to achieve the accurate positioning of the time slice of each word, that is, the file corresponding to each word and audio.
  • Step 4 Generate content association index
  • FIG. 9 is a schematic diagram of an example of a content association index provided by an embodiment of the present application. Taking a four-level vocabulary as an example, FIG. 9 shows that after the foregoing steps are processed, files corresponding to words and audio are obtained.
  • the English subtitle file includes a corresponding index for each word and time slice
  • the audio file includes a time index.
  • Use the time slice information to establish the index relationship between the four-level vocabulary in all words of the English video resource and the four-level vocabulary in the audio file generate multiple content association index tables, and generate the corresponding relationship index from a single word to multiple videos, or Generate a relationship index corresponding to multiple words from a single video.
  • Table 1 is used to represent the correspondence between a single word and a video, and is used for users to find relevant English learning videos through word search for word learning;
  • Table 2 is used to represent a single The corresponding relationship between the video and the word is used for the user to display the English word or sentence that the user wants to learn while watching the English video, and to implement the repeat function.
  • Step 5 Generate content metadata
  • the content metadata may refer to the words input by the user and the start and end times of the words, or the words included in the English video watched by the user and the start and end times of the words.
  • the client requests content metadata, the corresponding relationship between the video and the word is queried, and the metadata is integrated into the start and end time of the word and returned. It should be understood that in this application, the start and end times are all in the order of milliseconds.
  • the process of content metadata is completed through steps one to five, that is, a speech recognition algorithm is used to extract the start and end time of the word granularity of the audio files in the video, and the corresponding relationship between the words and the video content is generated. After that, you can repeat the words or sentences according to the user's request.
  • Step 6 Locate key frames based on word start and end time, start timed tasks, and enable repeat function
  • the user After requesting the cloud to obtain content metadata, the user needs to locate the core frame position of the keyword after the keyword and the start and end time of the keyword are included in the content, according to the user's settings Repeat words or sentences.
  • FIG. 10 is a flowchart of a word or sentence repetition process provided by an embodiment of the present application. According to FIG. 10, the entire process includes the following:
  • the user terminal receives the user instruction, and requests the cloud to obtain content metadata.
  • the time slot where the keyword is located is judged. It should be understood that the key sentence here refers to the sentence where the keyword is located, and the time slice here and the foregoing start-end time are both millisecond-level time.
  • the learning video is played.
  • the user can import a playback link to the player and start playing the video.
  • the user terminal searches for the time of the key sentence through the keyword, and searches the current repeat mode to confirm that it is currently word repeat or sentence repeat.
  • the source of the video resource may be a video resource stored in the cloud, the user obtains the video resource by sending a request to the cloud, or the video resource may also be a local resource, which is not limited in this application.
  • the scheduled task is started, and the scheduled task is triggered at the end time of the key word.
  • the key frame is retrieved using the fallback method, the key frame is retrieved to the start time position of the keyword, the key frame is retrieved, and the key frame is started to be played back to realize re-reading.
  • the key frame is the video frame corresponding to the start time of the keyword; when the current repeat mode is sentence repeat, the key frame is the start time of the key sentence where the keyword is located Corresponding video frame.
  • the number of backtracking key frames is determined.
  • the number of repetitions matches the set number of times, for example, when the number of repetitions is 3 by default, when the cumulative number of retrievals is greater than or equal to 3, the repetition is stopped, and the video continues to play forward.
  • the video resource administrator includes the following operations: 1101, the administrator operates the management console to extract video resources; 1102, calls the algorithm, preprocesses the video, and calls the voice algorithm to automatically split; 1103, the output includes time Poke word sequence; 1104, generate video content metadata, that is, generate word sequence search index.
  • the user may include the following operations: 1105, the user enters keywords, through scene search or word search, short video clips are searched, and the client may display the video keywords and content; 1106, Go to the video details page; 1107, the progress bar identifies the keyword, the user can view the location of the keyword through the player, for example, the progress bar identifies the location of the keyword; 1108, select vocabulary repeat, the user can set the repeat mode to word repeat or Sentences are repeated, and the number of repetitions can be set through the setting interface, the default is 3 times; 1109, highlight keywords and enable repetition.
  • the words can be highlighted and automatically re-read; when the cumulative number of times is greater than or equal to 3 after the re-reading, stop re-reading, and the video continues to play without affecting.
  • a user can click to extract the word contained in the subtitles and display the word card, and manually perform re-reading of a single word in the movie.
  • a user can click to extract the word contained in the subtitles and display the word card, and manually perform re-reading of a single word in the movie.
  • the user can learn professional words while watching the movie in the video playback application. Specifically, when a user opens an English movie, he can check which professional vocabulary is included in the current movie, such as Level 4, TOEIC, TOEFL, etc. During the movie watching, play to the position of the professional vocabulary and enable the repeat function.
  • professional vocabulary such as Level 4, TOEIC, TOEFL, etc.
  • the method for repetition of words or sentences provided by this application is based on speech recognition technology, generates a word search function associated with videos, generates a correspondence index from multiple words to multiple videos, and generates a single video to multiple
  • the correspondence index of words enables users to search from words to related learning videos.
  • the user requests content metadata from the cloud, and the acquired content metadata contains words and timeline information.
  • the length of the replayed content does not affect the duration of the video resource content itself. High-frequency changes in playback time. The existing video content editing process is avoided, and if the words are to be repeated, the content time will be lengthened, etc., which improves the user experience.
  • FIG. 12 is a schematic flowchart of a video playback method provided by an embodiment of the present application. As shown in FIG. 12, the method may include the following steps:
  • a first interface is displayed, where the first interface displays the first video being played and the subtitles of the first video.
  • the subtitles of the first video include a first text unit and a second text unit.
  • the text unit (for example, the first text unit and the second text unit) in the user learning process may be a single word, or the text unit may include phrases, sentences, etc. of multiple words, which is not limited in this application.
  • the first interface is the interface shown in (e) or (f) in FIG. 3.
  • the first interface includes the first video being played and the subtitles of the first video.
  • the first text unit "message" to be learned by the user is included, and the “message” is removed from the subtitles. Words outside are called the second text unit.
  • the first interface may also include parsing details of the first text unit, such as English, American pronunciation, Chinese interpretation, Chinese and English example sentences and video example sentences of "message".
  • the detailed content presented by the detailed analysis of the word may come from the English dictionary built into the system itself, or may be associated with other English online dictionaries, etc., which is not limited in this application.
  • the first segment is a video segment corresponding to the first text unit, or the first segment is a video segment corresponding to the entire sentence where the first text unit is located.
  • the first interface repeats playing from the start time to the end corresponding to "message" shown in the picture (h) in FIG. Video clip within time.
  • the number of times to repeatedly play the first segment of the first video is a preset number of times preset by the system or set by a user.
  • the number of repeated playbacks may be set by the user in the background, or may be the system default. In the case that the user does not set the number of repeated playbacks, the number of repeated playbacks may be the system default 3 times. This application does not limit this.
  • the first operation may be an operation in which the user clicks the first text unit on the subtitle of the first video. For example, the user clicks on "close” in the subtitles.
  • the mobile phone When the mobile phone detects that the user clicks on the first text unit in the subtitle, the mobile phone enters the second interface shown in (e) of FIG. 4. Among them, the second interface displays information such as analysis details associated with the first text unit clicked by the user.
  • the method 1200 before displaying the first interface, the method 1200 further includes:
  • the third interface is the interface shown in (d) of FIG. 3.
  • the third interface is an interface displayed after the user performs the operation shown in (c) in FIG. 3, enters the "message” to be learned, and clicks the navigation box 305.
  • the third interface includes word resolution details of "message” and a video list associated with "message”.
  • the first video list further includes a second video, and the second operation is used to select the first video.
  • the video list may include multiple videos, and the user performs an operation of swiping upward on the third interface similar to (h) in FIG. 3 to see more selectable videos.
  • the first interface is displayed.
  • the second operation may be a user's click operation on the first video, and the user may click the first video to enter the first interface.
  • the method 1200 further includes:
  • the third operation of the user is detected on the third interface.
  • the third operation may be that the user clicks the detail control in the word analysis popup box 404.
  • a fourth interface is displayed, the fourth interface includes second information of the first text unit and a second video list, the second video list includes at least one video, the second The subtitle of each video in the video list includes the first text unit.
  • the fourth interface is a user’s click on the detail control to enter the parsing interface of the text unit, including word parsing details and a video list associated with the word.
  • the first video is paused.
  • the display effect of the first text unit is different from the display effect of the second text unit.
  • the "message” in the subtitle is different from the display of other words, or, as shown in (c) and (d) in Figure 4
  • the "close” in the subtitle is different from the display of other words.
  • highlight effect display or as shown in (b) in Figure 5
  • the subtitle “abandon” is different from the display of other words.
  • the electronic device includes hardware and/or software modules corresponding to performing each function.
  • the present application can be implemented in the form of hardware or a combination of hardware and computer software. Whether a function is executed by hardware or computer software driven hardware depends on the specific application and design constraints of the technical solution. Those skilled in the art may use different methods to implement the described functions for each specific application in combination with the embodiments, but such implementation should not be considered beyond the scope of the present application.
  • the electronic device may be divided into function modules according to the above method example.
  • each function module may be divided corresponding to each function, or two or more functions may be integrated into one processing module.
  • the above integrated module can be implemented in the form of hardware. It should be noted that the division of the modules in this embodiment is schematic, and is only a division of logical functions. In actual implementation, there may be another division manner.
  • FIG. 13 shows a schematic diagram of a possible composition of the electronic device 1300 involved in the above embodiment.
  • the electronic device 1300 may include: a display unit 1301, a detection unit 1302, and a processing unit 1303.
  • the display unit 1301 may be used to support the electronic device 1300 to perform the above steps 1201 and 1204, and/or other processes used in the technology described herein.
  • the detection unit 1302 may be used to support the electronic device 1300 to perform the above steps 1203, etc., and/or other processes for the technology described herein.
  • the processing unit 1303 may be used to support the electronic device 1300 to perform the above steps 1202, etc., and/or other processes for the technology described herein.
  • the electronic device provided in this embodiment is used to execute the above-mentioned video playback method, and therefore can achieve the same effect as the above-mentioned implementation method.
  • the electronic device may include a processing module, a storage module, and a communication module.
  • the processing module may be used to control and manage the actions of the electronic device. For example, it may be used to support the electronic device to execute the steps performed by the display unit 1301, the detection unit 1302, and the processing unit 1303.
  • the storage module can be used to support electronic devices to execute stored program codes and data.
  • the communication module can be used to support communication between electronic devices and other devices.
  • the processing module may be a processor or a controller. It can implement or execute various exemplary logical blocks, modules, and circuits described in conjunction with the disclosure of the present application.
  • the processor may also be a combination of computing functions, such as a combination of one or more microprocessors, a combination of digital signal processing (DSP) and a microprocessor, and so on.
  • the storage module may be a memory.
  • the communication module may specifically be a device that interacts with other electronic devices, such as a radio frequency circuit, a Bluetooth chip, or a Wi-Fi chip.
  • the electronic device involved in this embodiment may be a device having the structure shown in FIG. 1.
  • This embodiment also provides a computer storage medium that stores computer instructions.
  • the computer instructions run on the electronic device, the electronic device is allowed to perform the above-mentioned related method steps to realize the shooting of long-exposure images in the above embodiment. Methods.
  • This embodiment also provides a computer program product, which, when the computer program product runs on a computer, causes the computer to perform the above-mentioned relevant steps to implement the method of shooting a long exposure image in the above embodiment.
  • the embodiments of the present application also provide an apparatus.
  • the apparatus may specifically be a chip, a component, or a module.
  • the apparatus may include a connected processor and a memory; wherein the memory is used to store computer-executed instructions.
  • the processor may execute computer execution instructions stored in the memory, so that the chip executes the method for shooting a long exposure image in each of the above method embodiments.
  • the electronic devices, computer storage media, computer program products, or chips provided in this embodiment are used to perform the corresponding methods provided above. Therefore, for the beneficial effects that can be achieved, refer to the corresponding The beneficial effects in the method will not be repeated here.
  • the disclosed device and method may be implemented in other ways.
  • the device embodiments described above are only schematic.
  • the division of modules or units is only a division of logical functions.
  • there may be other divisions for example, multiple units or components may be combined or Can be integrated into another device, or some features can be ignored, or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may be one physical unit or multiple physical units, that is, they may be located in one place, or may be distributed in multiple different places. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above integrated unit may be implemented in the form of hardware or software functional unit.
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a readable storage medium.
  • the technical solutions of the embodiments of the present application may be essentially or part of the contribution to the existing technology or all or part of the technical solutions may be embodied in the form of software products, which are stored in a storage medium
  • several instructions are included to enable a device (which may be a single-chip microcomputer, chip, etc.) or processor to execute all or part of the steps of the methods of the embodiments of the present application.
  • the foregoing storage media include various media that can store program codes, such as a USB flash drive, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The present application provides a video playback method and an electronic device. The method can be employed in a video playback process according to requirements of a user, and enables the user to click a text unit in subtitles at any time to perform learning of the same; or the method can be employed during the video playback process to identify a text unit input by the user, highlight and display the text unit, and automatically and repeatedly play, by means of configuring a current repeat mode, the text unit or a video clip corresponding to a sentence containing the text unit, so as to achieve repetition of the text unit. The method can facilitate effective English learning for users, simplify user operation, and improve the user experience.

Description

视频播放过程实现单词或语句复读的方法及电子设备Method and electronic device for re-reading words or sentences during video playback
本申请要求在2018年12月10日提交中国国家知识产权局、申请号为201811502510.3、发明名称为“视频播放过程实现单词或语句复读的方法及电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application requires the priority of the Chinese patent application submitted to the State Intellectual Property Office of China on December 10, 2018, with the application number 201811502510.3 and the invention titled "method and electronic device for re-reading words or sentences during video playback", all of which are The content is incorporated into this application by reference.
技术领域Technical field
本申请涉及电子技术领域,尤其涉及一种视频过程实现单词或语句复读的方法及电子设备。The present application relates to the field of electronic technology, and in particular to a method and electronic device for repetition of words or sentences in a video process.
背景技术Background technique
在英语学习过程中,现有的英语学习资源,大部分停留在单词、释义、例句阶段。英语学习类应用程序对视频资源的利用比较匮乏。In the process of English learning, most of the existing English learning resources stay at the stage of words, interpretation and example sentences. The use of video resources by English learning applications is scarce.
此外,英语视频资源与英语单词的关联度较低,用户在英语学习过程中,不便于通过视频资源进行英文单词或者语句的学习。而且,现有的英文视频,用户要实现英语单词或语句的复读,只能通过进度条的拖动等实现,且在进度条的拖动过程时间不够精确。为了实现单词或语句的复读,如果通过视频内容剪辑则会影响视频本身的内容的时长,用户体验较差。In addition, the correlation between English video resources and English words is low, and it is not easy for users to learn English words or sentences through video resources during the English learning process. Moreover, in the existing English video, the user can only repeat the English words or sentences by dragging the progress bar, and the time during the dragging process of the progress bar is not precise enough. In order to realize the re-reading of words or sentences, if the video content is edited, it will affect the duration of the video itself, and the user experience is poor.
发明内容Summary of the invention
本申请提供一种视频播放的方法及电子设备,能够在视频播放过程中实现单词或语句复读,提升用户的英语学习效果,提高用户体验。The present application provides a video playback method and electronic device, which can realize the repetition of words or sentences during video playback, improve the user's English learning effect, and improve the user experience.
第一方面,提供了一种视频播放的方法,该方法包括:显示第一界面,该第一界面显示正在播放的第一视频和该第一视频的字幕,该第一视频的字幕包括第一文本单元和第二文本单元;当播放到所述第一文本单元对应的该第一视频的第一片段时,在第一界面上自动重复播放该第一片段;在该第一界面上检测用户的第一操作;响应于该第一操作,显示第二界面,在该第二界面上显示与该第一文本单元关联的第一信息。In a first aspect, a method for playing a video is provided. The method includes: displaying a first interface, the first interface displaying a first video being played and subtitles of the first video, the subtitles of the first video including the first A text unit and a second text unit; when the first segment of the first video corresponding to the first text unit is played, the first segment is automatically played repeatedly on the first interface; the user is detected on the first interface In response to the first operation, a second interface is displayed, and the first information associated with the first text unit is displayed on the second interface.
应理解,在用户学习过程中的文本单元(例如第一文本单元和第二文本单元)可以是单个单词,或者文本单元可以包括多个单词的词组、句子等,本申请对此不作限定。It should be understood that the text unit (for example, the first text unit and the second text unit) in the user learning process may be a single word, or the text unit may include phrases, sentences, etc. of multiple words, which is not limited in this application.
需要说明的是,第一界面可以对应多种可能的情况,例如电子设备全屏播放模式下,第一界面就是指该视频播放显示区域,播放区域上显示字幕等;或者电子设备在非全屏模式下,除了视频播放的显示区域之外,还可以包括其他的显示区域,例如该文本单元的解析详情的区域,以及与该文本单元相关联的其他多个学习视频资源等。本申请对此不作限定。It should be noted that the first interface can correspond to many possible situations, for example, in the full-screen playback mode of the electronic device, the first interface refers to the video playback display area, the display area displays subtitles, etc.; or the electronic device is in the non-full-screen mode In addition to the display area of the video playback, other display areas may be included, such as an area for analyzing details of the text unit, and other multiple learning video resources associated with the text unit. This application does not limit this.
这里第一信息可以指该第一文本单元的解析详情,例如该第一文本单元的英式、美式发音、中文释义、中英文例句以及相关的学习视频等。Here, the first information may refer to the parsing details of the first text unit, such as the English, American pronunciation, Chinese interpretation, Chinese and English example sentences, and related learning videos of the first text unit.
还应理解,第二界面是用户通过点击字幕中的文本单元后,显示了该文本单元的解析详情后的界面,例如在该视频播放界面弹出该文本单元的解析窗之后的界面。对应于第一 界面的多种可能的情况,示例性的,第二界面有多种可能的情况,在第一界面的显示基础上,第二界面还包括用户点击字幕中的文本单元后的解析弹窗。例如该文本单元的解析弹窗可以包括该文本单元的英式、美式发音、中文释义、中英文例句等。本申请对此不作限定。It should also be understood that the second interface is an interface after the user clicks on the text unit in the subtitle to display the analysis details of the text unit, for example, an interface after the video playback interface pops up the analysis window of the text unit. Corresponding to many possible situations of the first interface, for example, there are many possible situations of the second interface. Based on the display of the first interface, the second interface also includes analysis after the user clicks the text unit in the subtitles Pop-ups. For example, the parsing window of the text unit may include the English, American pronunciation, Chinese interpretation, Chinese and English example sentences of the text unit, and the like. This application does not limit this.
可选地,本申请中所说的单词解析、解析详情等呈现的详细内容可以来源于系统本身内置的英语词典,也可以关联其他的英语在线词典等,本申请对此不作限定。Optionally, the detailed content presented in this application such as word analysis and analysis details may come from the English dictionary built into the system itself, or may be associated with other English online dictionaries, etc., which is not limited in this application.
还应理解,除了点击视频字幕中的单词,用户还可以点击视频字幕中的词组。例如,某些单词的出现基本都是以词组的形式出现,则在用户点击过程中,可以以词组的形式出现该词组的解析。例如,当单词解析界面呈现的详细内容关联英语词典,词典中该单词主要以词组形式出现,在用户观看视频过程中,点击该单词,也可以弹出该词组的解析或释义,本申请对此不作限定。It should also be understood that in addition to clicking on words in video subtitles, users can also click on phrases in video subtitles. For example, some words appear basically in the form of phrases, and during the user's click, the parsing of the phrase may appear in the form of phrases. For example, when the detailed content presented in the word analysis interface is associated with an English dictionary, the word in the dictionary mainly appears in the form of a phrase. When the user clicks on the word while watching the video, the analysis or interpretation of the phrase may also pop up. limited.
上述提供的在视频播放过程中学习单词、词组或者语句的方法,能够实现在观看视频的过程中学习英文单词,按照用户的需求,随时点击进入单词的学习,可以简化单词学习的搜索操作,同时增加用户学习的便捷性,提升用户体验。The above-mentioned method for learning words, phrases or sentences during video playback can realize learning English words while watching videos. According to user needs, click to enter word learning at any time, which can simplify the search operation of word learning. Increase the convenience of user learning and enhance user experience.
结合第一方面,在第一方面的某些实现方式中,该第一片段是该第一文本单元对应的视频片段,或者该第一片段是该第一文本单元所在的整句对应的视频片段。With reference to the first aspect, in some implementations of the first aspect, the first segment is a video segment corresponding to the first text unit, or the first segment is a video segment corresponding to the entire sentence where the first text unit is located .
示例性的,通过复读设置框设置当前的复读模式为词复读后,在该第一界面重复播放用户输入的关键词对应的起始时间到结束时间内的视频片段。Exemplarily, after setting the current repeating mode to the word repeating through the repeating setting box, the video segment from the start time to the end time corresponding to the keyword input by the user is repeatedly played on the first interface.
或者,通过复读设置框设置当前的复读模式为句复读后,在该第一界面重复播放用户输入的关键词所在的整个语句,重复从该语句对应的起始时间到结束时间内的视频片段。Or, after setting the current repetition mode to sentence repetition through the repetition setting box, the entire sentence where the keyword entered by the user is repeatedly played on the first interface, and the video segment from the start time corresponding to the sentence to the end time is repeated.
结合第一方面和上述实现方式,在第一方面的某些实现方式中,该重复播放该第一视频的第一片段的次数是系统默认的预设次数或者用户设置的。With reference to the first aspect and the foregoing implementation manners, in some implementation manners of the first aspect, the number of times to repeatedly play the first segment of the first video is a preset number of times preset by the system or set by the user.
示例性的,重复播放的次数可以是用户后台设置的,也可以是系统默认的。在用户未设置重复播放的次数的情况下,重复播放的次数可以为系统默认的3次。本申请对此不作限定。Exemplarily, the number of repeated playbacks may be set by the user in the background, or may be the system default. In the case that the user does not set the number of repeated playbacks, the number of repeated playbacks may be the system default 3 times. This application does not limit this.
结合第一方面和上述实现方式,在第一方面的某些实现方式中,该方法还包括:在显示第一界面之前,显示第三界面,该第三界面显示用户输入的该第一文本单元,该第三界面包括与该第一文本单元关联的第二信息和第一视频列表,该第一视频列表包括该第一视频;在该第三界面上检测用户的第二操作;响应于该第二操作,显示该第一界面。With reference to the first aspect and the foregoing implementation manners, in some implementation manners of the first aspect, the method further includes: before displaying the first interface, displaying a third interface, the third interface displaying the first text unit input by the user , The third interface includes second information and a first video list associated with the first text unit, the first video list includes the first video; detecting a second operation of the user on the third interface; in response to the In the second operation, the first interface is displayed.
结合第一方面和上述实现方式,在第一方面的某些实现方式中,该第一视频列表进一步包括第二视频,该第二操作用于选择该第一视频。With reference to the first aspect and the foregoing implementation manners, in some implementation manners of the first aspect, the first video list further includes a second video, and the second operation is used to select the first video.
结合第一方面和上述实现方式,在第一方面的某些实现方式中,该方法还包括:在该第三界面上检测用户的第三操作;响应于该第三操作,显示第四界面,该第四界面包括该第一文本单元的第二信息和第二视频列表,该第二视频列表包括至少一个视频,该第二视频列表中的每个视频的字幕包括该第一文本单元。With reference to the first aspect and the foregoing implementation manners, in some implementation manners of the first aspect, the method further includes: detecting a third operation of the user on the third interface; in response to the third operation, displaying a fourth interface, The fourth interface includes second information of the first text unit and a second video list, the second video list includes at least one video, and the subtitle of each video in the second video list includes the first text unit.
结合第一方面和上述实现方式,在第一方面的某些实现方式中,显示该第二界面时,该第一视频被暂停播放。With reference to the first aspect and the foregoing implementation manners, in some implementation manners of the first aspect, when the second interface is displayed, the first video is paused to play.
在视频播放过程中,只要点击复读设置控件设置复读模式和复读次数,或者用户点击字幕中的任意一个单词并进入该单词的学习模式,弹出该单词的单词解析框时,该视频都是暂停播放的。During the video playback, just click the repeat setting control to set the repeat mode and repeat times, or the user clicks any word in the subtitle and enters the learning mode of the word. When the word analysis box of the word pops up, the video is paused. of.
在一种可能的实现方式中,当用户点击关闭复读设置控件、字幕设置控件或者关闭该重点单词的单词解析框退出该单词的学习模式之后,视频可以实现自动继续播放,不需要用户再点击播放控件。或者视频处于暂停播放的状态,用户可以点击视频显示界面的播放控件,继续播放该视频。本申请对此不作限定。In a possible implementation, when the user clicks to close the repeat setting control, subtitle setting control, or close the word analysis box of the key word to exit the learning mode of the word, the video can automatically continue to play without requiring the user to click to play again Controls. Or the video is in a paused playback state, and the user can click the playback controls on the video display interface to continue playing the video. This application does not limit this.
上述提供的在视频中单词复读的方法,能够实现普通观影过程中学习英文单词,按照用户的需求,随时点击进入单词的学习,可以简化单词学习的搜索操作,同时增加用户学习的便捷性,提升用户体验。The method for re-reading the words in the video provided above can realize the learning of English words in the ordinary viewing process. According to the needs of users, click to enter the word learning at any time, which can simplify the search operation of word learning and increase the convenience of user learning. Improve user experience.
当用户在休闲放松观看影片过程中,如果想学习字幕中的某单词,可以通过上述方法,点击字幕中的该单词,进入该单词的学习模式。在另一种场景中,用户可能需要针对性的学习某些重点单词,例如用户需要学习某词汇集合中的多个单词,词汇集合可以是英语四六级词汇或者雅思词汇等。在这种场景中,本申请还提供一种单词学习的方法,能够为用户提供某词汇集合中包括的多个单词的集中性学习。When the user is relaxing and watching the movie, if he wants to learn a certain word in the subtitle, he can click the word in the subtitle through the above method to enter the learning mode of the word. In another scenario, the user may need to learn some key words in a targeted manner, for example, the user needs to learn multiple words in a certain vocabulary set, and the vocabulary set may be English level 4 or 6 vocabulary or IELTS vocabulary. In this scenario, the present application also provides a word learning method that can provide users with centralized learning of multiple words included in a certain vocabulary set.
在一种可能的实现方式中,该学习视频的字幕和该学习视频的进度条呈现于播放界面中的不同位置。可能的情况,如果学习视频的字幕和该学习视频的进度条都显示在播放界面的同一位置区域,则在用户点击字幕中包含的单词时,可能点击效果差,例如点击单词的过程可能误触发点击了进度条。特别是当用户观看该学习视频的电子设备的显示屏较小,或者用于播放该学习视频的界面较小时,此种情况更加突出。因此,将该学习视频的字幕和该学习视频的进度条呈现于播放界面中的不同位置,例如,视频播放的进度条显示在屏幕上方,字幕显示在屏幕下方,当然,也可以是播放界面的其他位置,都能够提高用户操作的灵敏度,提高用户体验。本申请对视频播放的进度条和字幕显示的位置不作限定。In a possible implementation manner, the subtitles of the learning video and the progress bar of the learning video are presented at different positions in the playback interface. If possible, if both the subtitle of the learning video and the progress bar of the learning video are displayed in the same position area of the playback interface, when the user clicks on the word contained in the subtitle, the click effect may be poor, for example, the process of clicking the word may be triggered by mistake Clicked the progress bar. This situation is particularly prominent when the display screen of the electronic device where the user watches the learning video is small, or the interface for playing the learning video is small. Therefore, the subtitles of the learning video and the progress bar of the learning video are displayed at different positions in the playback interface. For example, the progress bar of the video playback is displayed at the top of the screen, and the subtitles are displayed at the bottom of the screen. Of course, it can also be the playback interface. Other locations can improve the sensitivity of user operations and improve the user experience. This application does not limit the position of the video playback progress bar and the display of subtitles.
结合第一方面和上述实现方式,在第一方面的某些实现方式中,该第一文本单元的显示效果不同于该第二文本单元的显示效果。With reference to the first aspect and the foregoing implementation manners, in some implementation manners of the first aspect, the display effect of the first text unit is different from the display effect of the second text unit.
具体地,在该视频播放过程中,用户学习的单词在字幕中是高亮展示的。当字幕中出现用户要学习的“message”时,“message”是不同于字幕中其他单词的显示,用于提醒用户该单词的位置,以及注意该单词的发音等。Specifically, during the video playback, the words learned by the user are highlighted in the subtitles. When the "message" that the user wants to learn appears in the subtitles, the "message" is different from the display of other words in the subtitles to remind the user of the position of the word and pay attention to the pronunciation of the word.
此外,当用户在休闲放松观看影片过程中,如果想学习字幕中的某单词,可以通过上述方法,点击字幕中的该单词,进入该单词的学习模式。在另一种场景中,用户可能需要针对性的学习某些重点单词,例如用户需要学习某词汇集合中的多个单词,词汇集合可以是英语四六级词汇或者雅思词汇等。In addition, when the user is relaxing and watching the movie, if he wants to learn a certain word in the subtitle, he can click the word in the subtitle through the above method to enter the learning mode of the word. In another scenario, the user may need to learn some key words in a targeted manner, for example, the user needs to learn multiple words in a certain vocabulary set, and the vocabulary set may be English level 4 or 6 vocabulary or IELTS vocabulary.
在这种场景中,本申请还提供一种单词学习的方法,能够为用户提供某词汇集合中包括的多个单词的集中性学习。即对于一部视频资源,可以预先提取该视频资源中的某词汇集合中的所有单词。用户在选择影片之前,可以查看每一部影片中包括的所有重点单词,并可以点击选择需要学习的重点单词,或者用户可以根据重点单词的数量选择影片资源,例如选择包括的重点单词最多的影片作为当前观看的影片,并点击进入该影片学习模式。In this scenario, the present application also provides a word learning method that can provide users with centralized learning of multiple words included in a certain vocabulary set. That is, for a video resource, all words in a vocabulary set in the video resource can be extracted in advance. Before selecting a movie, the user can view all the key words included in each movie, and can click to select the key words to be learned, or the user can select the movie resource according to the number of key words, for example, select the movie with the most key words included As the currently watched movie, click to enter the movie learning mode.
应理解,在上述介绍的播放学习视频过程中,当视频播放至该单词所在的画面,高亮展示该单词,实现自动复读,当复读完成后,视频继续播放不受影响。或者,当视频播放至该单词所在的画面,弹出该重点单词的单词解析框,弹出该单词解析框的时间到达预设时长后,视频继续播放不受影响。It should be understood that in the process of playing the learning video described above, when the video is played to the screen where the word is located, the word is highlighted to realize automatic replay, and when the replay is completed, the video continues to play without being affected. Or, when the video plays to the screen where the word is located, a word parsing box of the key word pops up, and after the time for popping the word parsing box reaches a preset duration, the video continues to play without being affected.
通过上述介绍的在视频中实现单词或语句复读的方法,用户可以基于视频的英文字幕,在观看英文视频的同时,利用单词索引和播放器的回退等能力,实现英语单词的复读、 跟读等功能,提升用户的英语学习效果,提高用户体验。Through the above-mentioned method to realize the repetition of words or sentences in the video, users can use the English subtitles of the video to watch the English video while using the word index and the player's ability to rewind, etc. to realize the repetition and follow-up of English words And other functions to improve the user's English learning effect and improve the user experience.
第二方面,提供了一种电子设备,包括:一个或多个处理器;存储器;多个应用程序;以及一个或多个程序,其中该一个或多个程序被存储在该存储器中,当该一个或者多个程序被该处理器执行时,使得该电子设备执行以下步骤:显示第一界面,该第一界面显示正在播放的第一视频和该第一视频的字幕,该第一视频的字幕包括第一文本单元和第二文本单元;当播放到所述第一文本单元对应的该第一视频的第一片段时,在第一界面上自动重复播放该第一片段;在该第一界面上检测用户的第一操作;响应于该第一操作,显示第二界面,在该第二界面上显示与该第一文本单元关联的第一信息。In a second aspect, an electronic device is provided, including: one or more processors; a memory; multiple application programs; and one or more programs, wherein the one or more programs are stored in the memory when the When one or more programs are executed by the processor, the electronic device is caused to perform the following steps: display a first interface, the first interface displays the first video being played and the subtitles of the first video, and the subtitles of the first video It includes a first text unit and a second text unit; when the first segment of the first video corresponding to the first text unit is played, the first segment is automatically and repeatedly played on the first interface; on the first interface The first operation of the user is detected on the top; in response to the first operation, a second interface is displayed, and the first information associated with the first text unit is displayed on the second interface.
结合第二方面,在第二方面的某些实现方式中,该第一片段是该第一文本单元对应的视频片段,或者该第一片段是该第一文本单元所在的整句对应的视频片段。With reference to the second aspect, in some implementations of the second aspect, the first segment is a video segment corresponding to the first text unit, or the first segment is a video segment corresponding to the entire sentence where the first text unit is located .
结合第二方面和上述实现方式,在第二方面的某些实现方式中,当该一个或者多个程序被该处理器执行时,使得该电子设备执行以下步骤:在显示第一界面之前,显示第三界面,该第三界面显示用户输入的该第一文本单元,该第三界面包括与该第一文本单元关联的第二信息和第一视频列表,该第一视频列表包括该第一视频;在该第三界面上检测用户的第二操作;响应于该第二操作,显示该第一界面。With reference to the second aspect and the foregoing implementation manners, in some implementation manners of the second aspect, when the one or more programs are executed by the processor, the electronic device is caused to perform the following steps: before displaying the first interface, display A third interface that displays the first text unit input by the user, the third interface includes second information associated with the first text unit and a first video list, and the first video list includes the first video Detecting the user's second operation on the third interface; in response to the second operation, displaying the first interface.
结合第二方面和上述实现方式,在第二方面的某些实现方式中,该第一视频列表进一步包括第二视频,该第二操作用于选择该第一视频。With reference to the second aspect and the foregoing implementation manners, in some implementation manners of the second aspect, the first video list further includes a second video, and the second operation is used to select the first video.
结合第二方面和上述实现方式,在第二方面的某些实现方式中,当该一个或者多个程序被该处理器执行时,使得该电子设备执行以下步骤:在该第三界面上检测用户的第三操作;响应于该第三操作,显示第四界面,该第四界面包括该第一文本单元的第二信息和第二视频列表,该第二视频列表包括至少一个视频,该第二视频列表中的每个视频的字幕包括该第一文本单元。With reference to the second aspect and the foregoing implementation manners, in some implementation manners of the second aspect, when the one or more programs are executed by the processor, the electronic device is caused to perform the following steps: detect the user on the third interface A third operation; in response to the third operation, displaying a fourth interface, the fourth interface includes the second information of the first text unit and a second video list, the second video list includes at least one video, the second The subtitle of each video in the video list includes the first text unit.
结合第二方面和上述实现方式,在第二方面的某些实现方式中,显示该第二界面时,该第一视频被暂停播放。With reference to the second aspect and the foregoing implementation manners, in some implementation manners of the second aspect, when the second interface is displayed, the first video is paused for playback.
结合第二方面和上述实现方式,在第二方面的某些实现方式中,重复播放该第一视频的第一片段的次数是系统默认的预设次数或者用户设置的。With reference to the second aspect and the foregoing implementation manners, in some implementation manners of the second aspect, the number of times to repeatedly play the first segment of the first video is a preset number of times preset by the system or set by the user.
结合第二方面和上述实现方式,在第二方面的某些实现方式中,该第一文本单元的显示效果不同于该第二文本单元的显示效果。With reference to the second aspect and the foregoing implementation manners, in some implementation manners of the second aspect, the display effect of the first text unit is different from the display effect of the second text unit.
第三方面,本申请提供了一种装置,该装置包含在电子设备中,该装置具有实现上述方面及上述方面的可能实现方式中电子设备行为的功能。功能可以通过硬件实现,也可以通过硬件执行相应的软件实现。硬件或软件包括一个或多个与上述功能相对应的模块或单元。例如,显示模块或单元、检测模块或单元、处理模块或单元等。In a third aspect, the present application provides an apparatus, which is included in an electronic device, and the apparatus has a function of implementing the above aspect and the possible implementation manners of the above aspect. The function can be realized by hardware, and can also be realized by hardware executing corresponding software. The hardware or software includes one or more modules or units corresponding to the above functions. For example, display modules or units, detection modules or units, processing modules or units, etc.
第四方面,本申请提供了一种电子设备,包括:触摸显示屏,其中,触摸显示屏包括触敏表面和显示器;摄像头;一个或多个处理器;存储器;多个应用程序;以及一个或多个计算机程序。其中,一个或多个计算机程序被存储在存储器中,一个或多个计算机程序包括指令。当指令被电子设备执行时,使得电子设备执行上述任一方面任一项可能的实现中的视频播放的方法。In a fourth aspect, the present application provides an electronic device, including: a touch display screen, wherein the touch display screen includes a touch-sensitive surface and a display; a camera; one or more processors; a memory; a plurality of application programs; and one or Multiple computer programs. Among them, one or more computer programs are stored in the memory, and the one or more computer programs include instructions. When the instruction is executed by the electronic device, the electronic device is caused to execute the video playback method in any possible implementation of any one of the above aspects.
第五方面,本申请提供了一种电子设备,包括一个或多个处理器和一个或多个存储器。该一个或多个存储器与一个或多个处理器耦合,一个或多个存储器用于存储计算机程序代码,计算机程序代码包括计算机指令,当一个或多个处理器执行计算机指令时,使得电子 设备执行上述任一方面任一项可能的实现中的视频播放的方法。In a fifth aspect, the present application provides an electronic device, including one or more processors and one or more memories. The one or more memories are coupled to one or more processors. The one or more memories are used to store computer program code. The computer program codes include computer instructions. When the one or more processors execute the computer instructions, the electronic device is executed. A video playback method in any possible implementation of any of the above aspects.
第六方面,本申请提供了一种计算机存储介质,包括计算机指令,当计算机指令在电子设备上运行时,使得电子设备执行上述任一方面任一项可能的视频播放的方法。In a sixth aspect, the present application provides a computer storage medium, including computer instructions, which, when the computer instructions run on an electronic device, cause the electronic device to perform any possible video playback method of any one of the above aspects.
第七方面,本申请提供了一种计算机程序产品,当计算机程序产品在电子设备上运行时,使得电子设备执行上述任一方面任一项可能的视频播放的方法。In a seventh aspect, the present application provides a computer program product that, when the computer program product runs on an electronic device, causes the electronic device to perform any possible video playback method according to any one of the above aspects.
附图说明BRIEF DESCRIPTION
图1为本申请实施例提供的一种电子设备的硬件结构示意图。FIG. 1 is a schematic diagram of a hardware structure of an electronic device provided by an embodiment of the present application.
图2为本申请实施例提供的一种电子设备的软件结构示意图。FIG. 2 is a schematic diagram of a software structure of an electronic device provided by an embodiment of the present application.
图3是本申请实施例提供的一例视频中实现单词复读的用户界面示意图。FIG. 3 is a schematic diagram of a user interface for realizing word repetition in a video provided by an embodiment of the present application.
图4是本申请实施例提供的又一例观影过程中学习单词的用户界面示意图。FIG. 4 is a schematic diagram of another example of a user interface for learning words during movie watching provided by an embodiment of the present application.
图5是本申请实施例提供的又一例观影过程中学习单词的用户界面示意图。FIG. 5 is a schematic diagram of another example of a user interface for learning words during movie viewing provided by an embodiment of the present application.
图6是本申请实施例提供的一例HMM模型示意图。6 is a schematic diagram of an example of an HMM model provided by an embodiment of the present application.
图7是本申请提供的一例生成声学模型和强制对齐过程的实现流程图。FIG. 7 is an implementation flowchart of an example of an acoustic model generation and forced alignment process provided by this application.
图8是本申请实施例提供的一例生成单词时间序列的流程图。8 is a flowchart of an example of generating a word time series provided by an embodiment of the present application.
图9是本申请实施例提供的一例内容关联索引的示意图。9 is a schematic diagram of an example of a content association index provided by an embodiment of the present application.
图10是本申请实施例提供的单词或语句复读过程的实现流程图。10 is a flowchart of an implementation of a word or sentence repetition process provided by an embodiment of the present application.
图11是本申请实施例提供的视频中实现单词或语句复读的方法的实现过程示意图。FIG. 11 is a schematic diagram of an implementation process of a method for implementing word or sentence repetition in a video provided by an embodiment of the present application.
图12是本申请实施例提供的视频播放的方法的示意性流程图。12 is a schematic flowchart of a video playback method provided by an embodiment of the present application.
图13是本申请实施例提供的一例电子设备的组成示意图。13 is a schematic diagram of an example of an electronic device provided by an embodiment of the present application.
具体实施方式detailed description
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行描述。其中,在本申请实施例的描述中,除非另有说明,“/”表示或的意思,例如,A/B可以表示A或B;本文中的“和/或”仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,在本申请实施例的描述中,“多个”是指两个或多于两个。The technical solutions in the embodiments of the present application will be described below with reference to the drawings in the embodiments of the present application. In the description of the embodiments of the present application, unless otherwise stated, “/” means or, for example, A/B may mean A or B; “and/or” in this text is merely a description of the related object The association relationship indicates that there can be three relationships, for example, A and/or B, which can indicate: there are three situations in which A exists alone, A and B exist simultaneously, and B exists alone. In addition, in the description of the embodiments of the present application, “plurality” refers to two or more than two.
以下,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。在本实施例的描述中,除非另有说明,“多个”的含义是两个或两个以上。In the following, the terms "first" and "second" are used for description purposes only, and cannot be understood as indicating or implying relative importance or implicitly indicating the number of indicated technical features. Thus, the features defined as "first" and "second" may explicitly or implicitly include one or more of the features. In the description of this embodiment, unless otherwise stated, the meaning of "plurality" is two or more.
本申请实施例提供了一种视频中实现单词或语句复读的方法,可以应用于电子设备,也可是单独的应用程序,该应用程序可实现本申请中单词或语句复读的方法。具体地,用户可以基于视频的英文字幕,在观看英文视频的同时,利用单词索引和播放器的回退等能力,实现英语单词的复读、跟读等功能,提升用户的英语学习效果,提高用户体验。The embodiment of the present application provides a method for re-reading words or sentences in a video, which can be applied to an electronic device or a separate application program, which can implement the method for re-reading words or sentences in the present application. Specifically, based on the English subtitles of the video, while watching the English video, the user can use the word index and the player's ability to rewind, etc. to realize the functions of repeating and following English words, improving the user's English learning effect and improving the user Experience.
本申请实施例提供的视频中实现单词或语句复读的方法可以应用于手机、平板电脑、可穿戴设备、车载设备、增强现实(augmented reality,AR)/虚拟现实(virtual reality,VR)设备、笔记本电脑、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本、个人数字助理(personal digital assistant,PDA)等电子设备上,本申请实施例对电子设备的具体类型不作任何限制。The method for realizing the repetition of words or sentences in the video provided by the embodiments of the present application can be applied to mobile phones, tablet computers, wearable devices, vehicle-mounted devices, augmented reality (augmented reality (AR)/virtual reality (VR) devices, notebooks) On electronic devices such as computers, ultra-mobile personal computers (UMPCs), netbooks, personal digital assistants (PDAs), etc., the embodiments of the present application do not limit the specific types of electronic devices.
示例性的,图1示出了电子设备100的结构示意图。电子设备100可以包括处理器110,外部存储器接口120,内部存储器121,通用串行总线(universal serial bus,USB)接口130,充电管理模块140,电源管理模块141,电池142,天线1,天线2,移动通信模块150,无线通信模块160,音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,传感器模块180,按键190,马达191,指示器192,摄像头193,显示屏194,以及用户标识模块(subscriber identification module,SIM)卡接口195等。其中传感器模块180可以包括压力传感器180A,陀螺仪传感器180B,气压传感器180C,磁传感器180D,加速度传感器180E,距离传感器180F,接近光传感器180G,指纹传感器180H,温度传感器180J,触摸传感器180K,环境光传感器180L,骨传导传感器180M等。Exemplarily, FIG. 1 shows a schematic structural diagram of an electronic device 100. The electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charging management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2 , Mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone jack 170D, sensor module 180, key 190, motor 191, indicator 192, camera 193, display screen 194, and Subscriber identification module (SIM) card interface 195, etc. The sensor module 180 may include a pressure sensor 180A, a gyro sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light Sensor 180L, bone conduction sensor 180M, etc.
可以理解的是,本申请实施例示意的结构并不构成对电子设备100的具体限定。在本申请另一些实施例中,电子设备100可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。It can be understood that the structure illustrated in the embodiment of the present application does not constitute a specific limitation on the electronic device 100. In other embodiments of the present application, the electronic device 100 may include more or fewer components than shown, or combine some components, or split some components, or arrange different components. The illustrated components can be implemented in hardware, software, or a combination of software and hardware.
处理器110可以包括一个或多个处理单元,例如:处理器110可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The processor 110 may include one or more processing units, for example, the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processor (graphics processing unit, GPU), and an image signal processor (image)signal processor (ISP), controller, memory, video codec, digital signal processor (DSP), baseband processor, and/or neural-network processing unit (NPU) Wait. Among them, different processing units may be independent devices, or may be integrated in one or more processors.
其中,控制器可以是电子设备100的神经中枢和指挥中心。控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。The controller may be the nerve center and command center of the electronic device 100. The controller can generate the operation control signal according to the instruction operation code and the timing signal to complete the control of fetching instructions and executing instructions.
处理器110中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器110中的存储器为高速缓冲存储器。该存储器可以保存处理器110刚用过或循环使用的指令或数据。如果处理器110需要再次使用该指令或数据,可从所述存储器中直接调用。避免了重复存取,减少了处理器110的等待时间,因而提高了系统的效率。The processor 110 may also be provided with a memory for storing instructions and data. In some embodiments, the memory in the processor 110 is a cache memory. The memory may store instructions or data that the processor 110 has just used or recycled. If the processor 110 needs to use the instruction or data again, it can be directly called from the memory. Avoid repeated access, reduce the waiting time of the processor 110, thus improving the efficiency of the system.
在一些实施例中,处理器110可以包括一个或多个接口。接口可以包括集成电路(inter-integrated circuit,I2C)接口,集成电路内置音频(inter-integrated circuit sound,I2S)接口,脉冲编码调制(pulse code modulation,PCM)接口,通用异步收发传输器(universal asynchronous receiver/transmitter,UART)接口,移动产业处理器接口(mobile industry processor interface,MIPI),通用输入输出(general-purpose input/output,GPIO)接口,用户标识模块(subscriber identity module,SIM)接口,和/或通用串行总线(universal serial bus,USB)接口等。In some embodiments, the processor 110 may include one or more interfaces. Interfaces can include integrated circuit (inter-integrated circuit, I2C) interface, integrated circuit built-in audio (inter-integrated circuit, sound, I2S) interface, pulse code modulation (pulse code modulation (PCM) interface, universal asynchronous transceiver (universal asynchronous) receiver/transmitter, UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and And/or universal serial bus (USB) interface, etc.
I2C接口是一种双向同步串行总线,包括一根串行数据线(serial data line,SDA)和一根串行时钟线(derail clock line,SCL)。在一些实施例中,处理器110可以包含多组I2C总线。处理器110可以通过不同的I2C总线接口分别耦合触摸传感器180K,充电器,闪光灯,摄像头193等。例如:处理器110可以通过I2C接口耦合触摸传感器180K,使处理器110与触摸传感器180K通过I2C总线接口通信,实现电子设备100的触摸功能。The I2C interface is a bidirectional synchronous serial bus, including a serial data line (serial data line, SDA) and a serial clock line (derail clock line, SCL). In some embodiments, the processor 110 may include multiple sets of I2C buses. The processor 110 may respectively couple the touch sensor 180K, the charger, the flash, the camera 193, etc. through different I2C bus interfaces. For example, the processor 110 may couple the touch sensor 180K through the I2C interface, so that the processor 110 and the touch sensor 180K communicate through the I2C bus interface to realize the touch function of the electronic device 100.
I2S接口可以用于音频通信。在一些实施例中,处理器110可以包含多组I2S总线。处理器110可以通过I2S总线与音频模块170耦合,实现处理器110与音频模块170之间的通信。在一些实施例中,音频模块170可以通过I2S接口向无线通信模块160传递音频 信号,实现通过蓝牙耳机接听电话的功能。The I2S interface can be used for audio communication. In some embodiments, the processor 110 may include multiple sets of I2S buses. The processor 110 may be coupled to the audio module 170 through an I2S bus to implement communication between the processor 110 and the audio module 170. In some embodiments, the audio module 170 can transfer audio signals to the wireless communication module 160 through the I2S interface, so as to realize the function of answering the call through the Bluetooth headset.
PCM接口也可以用于音频通信,将模拟信号抽样,量化和编码。在一些实施例中,音频模块170与无线通信模块160可以通过PCM总线接口耦合。在一些实施例中,音频模块170也可以通过PCM接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。所述I2S接口和所述PCM接口都可以用于音频通信。The PCM interface can also be used for audio communication, sampling, quantizing and encoding analog signals. In some embodiments, the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface. In some embodiments, the audio module 170 can also transmit audio signals to the wireless communication module 160 through the PCM interface to realize the function of answering the phone call through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
UART接口是一种通用串行数据总线,用于异步通信。该总线可以为双向通信总线。它将要传输的数据在串行通信与并行通信之间转换。在一些实施例中,UART接口通常被用于连接处理器110与无线通信模块160。例如:处理器110通过UART接口与无线通信模块160中的蓝牙模块通信,实现蓝牙功能。在一些实施例中,音频模块170可以通过UART接口向无线通信模块160传递音频信号,实现通过蓝牙耳机播放音乐的功能。The UART interface is a universal serial data bus used for asynchronous communication. The bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, the UART interface is generally used to connect the processor 110 and the wireless communication module 160. For example, the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to implement the Bluetooth function. In some embodiments, the audio module 170 can transmit audio signals to the wireless communication module 160 through the UART interface, so as to realize the function of playing music through the Bluetooth headset.
MIPI接口可以被用于连接处理器110与显示屏194,摄像头193等外围器件。MIPI接口包括摄像头串行接口(camera serial interface,CSI),显示屏串行接口(display serial interface,DSI)等。在一些实施例中,处理器110和摄像头193通过CSI接口通信,实现电子设备100的拍摄功能。处理器110和显示屏194通过DSI接口通信,实现电子设备100的显示功能。The MIPI interface can be used to connect the processor 110 to peripheral devices such as the display screen 194 and the camera 193. MIPI interface includes camera serial interface (camera serial interface, CSI), display serial interface (display serial interface, DSI) and so on. In some embodiments, the processor 110 and the camera 193 communicate through a CSI interface to implement the shooting function of the electronic device 100. The processor 110 and the display screen 194 communicate through the DSI interface to realize the display function of the electronic device 100.
GPIO接口可以通过软件配置。GPIO接口可以被配置为控制信号,也可被配置为数据信号。在一些实施例中,GPIO接口可以用于连接处理器110与摄像头193,显示屏194,无线通信模块160,音频模块170,传感器模块180等。GPIO接口还可以被配置为I2C接口,I2S接口,UART接口,MIPI接口等。The GPIO interface can be configured via software. The GPIO interface can be configured as a control signal or a data signal. In some embodiments, the GPIO interface may be used to connect the processor 110 to the camera 193, the display screen 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like. GPIO interface can also be configured as I2C interface, I2S interface, UART interface, MIPI interface, etc.
USB接口130是符合USB标准规范的接口,具体可以是Mini USB接口,Micro USB接口,USB Type C接口等。USB接口130可以用于连接充电器为电子设备100充电,也可以用于电子设备100与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。该接口还可以用于连接其他电子设备,例如AR设备等。The USB interface 130 is an interface that conforms to the USB standard, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, and so on. The USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transfer data between the electronic device 100 and peripheral devices. It can also be used to connect headphones and play audio through the headphones. The interface can also be used to connect other electronic devices, such as AR devices.
可以理解的是,本申请实施例示意的各模块间的接口连接关系,只是示意性说明,并不构成对电子设备100的结构限定。在本申请另一些实施例中,电子设备100也可以采用上述实施例中不同的接口连接方式,或多种接口连接方式的组合。It can be understood that the interface connection relationship between the modules illustrated in the embodiments of the present application is only a schematic description, and does not constitute a limitation on the structure of the electronic device 100. In other embodiments of the present application, the electronic device 100 may also use different interface connection methods in the foregoing embodiments, or a combination of multiple interface connection methods.
充电管理模块140用于从充电器接收充电输入。其中,充电器可以是无线充电器,也可以是有线充电器。在一些有线充电的实施例中,充电管理模块140可以通过USB接口130接收有线充电器的充电输入。在一些无线充电的实施例中,充电管理模块140可以通过电子设备100的无线充电线圈接收无线充电输入。充电管理模块140为电池142充电的同时,还可以通过电源管理模块141为电子设备供电。The charging management module 140 is used to receive charging input from the charger. The charger may be a wireless charger or a wired charger. In some wired charging embodiments, the charging management module 140 may receive the charging input of the wired charger through the USB interface 130. In some wireless charging embodiments, the charging management module 140 may receive wireless charging input through the wireless charging coil of the electronic device 100. While the charging management module 140 charges the battery 142, it can also supply power to the electronic device through the power management module 141.
电源管理模块141用于连接电池142,充电管理模块140与处理器110。电源管理模块141接收电池142和/或充电管理模块140的输入,为处理器110,内部存储器121,外部存储器,显示屏194,摄像头193,和无线通信模块160等供电。电源管理模块141还可以用于监测电池容量,电池循环次数,电池健康状态(漏电,阻抗)等参数。在其他一些实施例中,电源管理模块141也可以设置于处理器110中。在另一些实施例中,电源管理模块141和充电管理模块140也可以设置于同一个器件中。The power management module 141 is used to connect the battery 142, the charging management module 140 and the processor 110. The power management module 141 receives input from the battery 142 and/or the charging management module 140, and supplies power to the processor 110, internal memory 121, external memory, display screen 194, camera 193, wireless communication module 160, and the like. The power management module 141 can also be used to monitor battery capacity, battery cycle times, battery health status (leakage, impedance) and other parameters. In some other embodiments, the power management module 141 may also be disposed in the processor 110. In other embodiments, the power management module 141 and the charging management module 140 may also be set in the same device.
电子设备100的无线通信功能可以通过天线1,天线2,移动通信模块150,无线通信模块160,调制解调处理器以及基带处理器等实现。The wireless communication function of the electronic device 100 can be realized by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modem processor, and the baseband processor.
天线1和天线2用于发射和接收电磁波信号。电子设备100中的每个天线可用于覆盖 单个或多个通信频带。不同的天线还可以复用,以提高天线的利用率。例如:可以将天线1复用为无线局域网的分集天线。在另外一些实施例中,天线可以和调谐开关结合使用。 Antenna 1 and antenna 2 are used to transmit and receive electromagnetic wave signals. Each antenna in the electronic device 100 can be used to cover a single or multiple communication frequency bands. Different antennas can also be reused to improve antenna utilization. For example, the antenna 1 can be multiplexed as a diversity antenna of a wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
移动通信模块150可以提供应用在电子设备100上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块150可以包括至少一个滤波器,开关,功率放大器,低噪声放大器(low noise amplifier,LNA)等。移动通信模块150可以由天线1接收电磁波,并对接收的电磁波进行滤波,放大等处理,传送至调制解调处理器进行解调。移动通信模块150还可以对经调制解调处理器调制后的信号放大,经天线1转为电磁波辐射出去。在一些实施例中,移动通信模块150的至少部分功能模块可以被设置于处理器110中。在一些实施例中,移动通信模块150的至少部分功能模块可以与处理器110的至少部分模块被设置在同一个器件中。The mobile communication module 150 can provide a wireless communication solution including 2G/3G/4G/5G and the like applied to the electronic device 100. The mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA), and so on. The mobile communication module 150 can receive electromagnetic waves from the antenna 1 and filter, amplify, etc. the received electromagnetic waves, and transmit them to the modem processor for demodulation. The mobile communication module 150 can also amplify the signal modulated by the modulation and demodulation processor and convert it to electromagnetic wave radiation through the antenna 1. In some embodiments, at least part of the functional modules of the mobile communication module 150 may be provided in the processor 110. In some embodiments, at least part of the functional modules of the mobile communication module 150 and at least part of the modules of the processor 110 may be provided in the same device.
调制解调处理器可以包括调制器和解调器。其中,调制器用于将待发送的低频基带信号调制成中高频信号。解调器用于将接收的电磁波信号解调为低频基带信号。随后解调器将解调得到的低频基带信号传送至基带处理器处理。低频基带信号经基带处理器处理后,被传递给应用处理器。应用处理器通过音频设备(不限于扬声器170A,受话器170B等)输出声音信号,或通过显示屏194显示图像或视频。在一些实施例中,调制解调处理器可以是独立的器件。在另一些实施例中,调制解调处理器可以独立于处理器110,与移动通信模块150或其他功能模块设置在同一个器件中。The modem processor may include a modulator and a demodulator. Among them, the modulator is used to modulate the low-frequency baseband signal to be transmitted into a high-frequency signal. The demodulator is used to demodulate the received electromagnetic wave signal into a low-frequency baseband signal. The demodulator then transmits the demodulated low-frequency baseband signal to the baseband processor for processing. The low-frequency baseband signal is processed by the baseband processor and then passed to the application processor. The application processor outputs a sound signal through an audio device (not limited to a speaker 170A, a receiver 170B, etc.), or displays an image or video through a display screen 194. In some embodiments, the modem processor may be an independent device. In other embodiments, the modem processor may be independent of the processor 110, and may be set in the same device as the mobile communication module 150 or other functional modules.
无线通信模块160可以提供应用在电子设备100上的包括无线局域网(wireless local area networks,WLAN)(如无线保真(wireless fidelity,Wi-Fi)网络),蓝牙(bluetooth,BT),全球导航卫星系统(global navigation satellite system,GNSS),调频(frequency modulation,FM),近距离无线通信技术(near field communication,NFC),红外技术(infrared,IR)等无线通信的解决方案。无线通信模块160可以是集成至少一个通信处理模块的一个或多个器件。无线通信模块160经由天线2接收电磁波,将电磁波信号调频以及滤波处理,将处理后的信号发送到处理器110。无线通信模块160还可以从处理器110接收待发送的信号,对其进行调频,放大,经天线2转为电磁波辐射出去。The wireless communication module 160 can provide wireless local area networks (wireless local area networks, WLAN) (such as wireless fidelity (Wi-Fi) networks), Bluetooth (bluetooth, BT), and global navigation satellites that are applied to the electronic device 100. System (global navigation satellite system, GNSS), frequency modulation (frequency modulation, FM), near field communication technology (near field communication, NFC), infrared technology (infrared, IR) and other wireless communication solutions. The wireless communication module 160 may be one or more devices integrating at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via the antenna 2, frequency-modulates and filters electromagnetic wave signals, and transmits the processed signals to the processor 110. The wireless communication module 160 may also receive the signal to be transmitted from the processor 110, frequency-modulate it, amplify it, and convert it to electromagnetic wave radiation through the antenna 2.
在一些实施例中,电子设备100的天线1和移动通信模块150耦合,天线2和无线通信模块160耦合,使得电子设备100可以通过无线通信技术与网络以及其他设备通信。所述无线通信技术可以包括全球移动通讯系统(global system for mobile communications,GSM),通用分组无线服务(general packet radio service,GPRS),码分多址接入(code division multiple access,CDMA),宽带码分多址(wideband code division multiple access,WCDMA),时分码分多址(time-division code division multiple access,TD-SCDMA),长期演进(long term evolution,LTE),BT,GNSS,WLAN,NFC,FM,和/或IR技术等。所述GNSS可以包括全球卫星定位系统(global positioning system,GPS),全球导航卫星系统(global navigation satellite system,GLONASS),北斗卫星导航系统(beidou navigation satellite system,BDS),准天顶卫星系统(quasi-zenith satellite system,QZSS)和/或星基增强系统(satellite based augmentation systems,SBAS)。In some embodiments, the antenna 1 of the electronic device 100 and the mobile communication module 150 are coupled, and the antenna 2 and the wireless communication module 160 are coupled so that the electronic device 100 can communicate with the network and other devices through wireless communication technology. The wireless communication technology may include a global mobile communication system (global system for mobile communications, GSM), general packet radio service (general packet radio service, GPRS), code division multiple access (code division multiple access, CDMA), broadband Wideband code division multiple access (WCDMA), time-division code division multiple access (TD-SCDMA), long-term evolution (LTE), BT, GNSS, WLAN, NFC , FM, and/or IR technology, etc. The GNSS may include a global positioning system (GPS), a global navigation satellite system (GLONASS), a beidou navigation system (BDS), and a quasi-zenith satellite system (quasi -zenith satellite system (QZSS) and/or satellite-based augmentation systems (SBAS).
电子设备100通过GPU,显示屏194,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏194和应用处理器。GPU用于执行数学和几何计算,用于图形渲染。处理器110可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。The electronic device 100 realizes a display function through a GPU, a display screen 194, and an application processor. The GPU is a microprocessor for image processing, connecting the display screen 194 and the application processor. The GPU is used to perform mathematical and geometric calculations, and is used for graphics rendering. The processor 110 may include one or more GPUs that execute program instructions to generate or change display information.
显示屏194用于显示图像,视频等。显示屏194包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diode的,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。在一些实施例中,电子设备100可以包括1个或N个显示屏194,N为大于1的正整数。The display screen 194 is used to display images, videos and the like. The display screen 194 includes a display panel. The display panel may use a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active matrix organic light-emitting diode or an active matrix organic light-emitting diode (active-matrix organic light) emitting diode, AMOLED, flexible light-emitting diode (FLED), Miniled, MicroLed, Micro-oLed, quantum dot light emitting diode (QLED), etc. In some embodiments, the electronic device 100 may include 1 or N display screens 194, where N is a positive integer greater than 1.
电子设备100可以通过ISP,摄像头193,视频编解码器,GPU,显示屏194以及应用处理器等实现拍摄功能。The electronic device 100 can realize a shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
ISP用于处理摄像头193反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给ISP处理,转化为肉眼可见的图像。ISP还可以对图像的噪点,亮度,肤色进行算法优化。ISP还可以对拍摄场景的曝光,色温等参数优化。在一些实施例中,ISP可以设置在摄像头193中。The ISP processes the data fed back by the camera 193. For example, when taking a picture, the shutter is opened, the light is transmitted to the camera photosensitive element through the lens, and the optical signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing, which is converted into an image visible to the naked eye. ISP can also optimize the image noise, brightness, and skin color. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, the ISP may be set in the camera 193.
摄像头193用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,电子设备100可以包括1个或N个摄像头193,N为大于1的正整数。The camera 193 is used to capture still images or video. The object generates an optical image through the lens and projects it onto the photosensitive element. The photosensitive element may be a charge coupled device (charge coupled device, CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The photosensitive element converts the optical signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal. The ISP outputs the digital image signal to the DSP for processing. DSP converts digital image signals into standard RGB, YUV and other image signals. In some embodiments, the electronic device 100 may include 1 or N cameras 193, where N is a positive integer greater than 1.
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号。例如,当电子设备100在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。The digital signal processor is used to process digital signals. In addition to digital image signals, it can also process other digital signals. For example, when the electronic device 100 is selected at a frequency point, the digital signal processor is used to perform Fourier transform on the energy at the frequency point.
视频编解码器用于对数字视频压缩或解压缩。电子设备100可以支持一种或多种视频编解码器。这样,电子设备100可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts group,MPEG)1,MPEG2,MPEG3,MPEG4等。Video codec is used to compress or decompress digital video. The electronic device 100 may support one or more video codecs. In this way, the electronic device 100 can play or record videos in various encoding formats, for example: moving picture experts group (MPEG) 1, MPEG2, MPEG3, MPEG4, etc.
NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现电子设备100的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解等。NPU is a neural-network (NN) computing processor. By drawing on the structure of biological neural networks, for example, the transfer mode between neurons in the human brain, it can quickly process the input information and can continue to self-learn. The NPU can realize applications such as intelligent recognition of the electronic device 100, such as image recognition, face recognition, voice recognition, and text understanding.
外部存储器接口120可以用于连接外部存储卡,例如Micro SD卡,实现扩展电子设备100的存储能力。外部存储卡通过外部存储器接口120与处理器110通信,实现数据存储功能。例如将音乐,视频等文件保存在外部存储卡中。The external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100. The external memory card communicates with the processor 110 through the external memory interface 120 to realize the data storage function. For example, save music, video and other files in an external memory card.
内部存储器121可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。处理器110通过运行存储在内部存储器121的指令,从而执行电子设备100的各种功能应用以及数据处理。内部存储器121可以包括存储程序区和存储数据区。其中,存储程序区可存储操作系统,至少一个功能所需的应用程序(比如声音播放功能,图像播放功能等)等。存储数据区可存储电子设备100使用过程中所创建的数据(比如音频数据,电话本等)等。此外,内部存储器121可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,通用闪存存储器(universal flash storage,UFS)等。The internal memory 121 may be used to store computer executable program code, where the executable program code includes instructions. The processor 110 executes instructions stored in the internal memory 121 to execute various functional applications and data processing of the electronic device 100. The internal memory 121 may include a storage program area and a storage data area. Among them, the storage program area may store an operating system, at least one function required application programs (such as sound playback function, image playback function, etc.). The storage data area may store data (such as audio data, phone book, etc.) created during use of the electronic device 100 and the like. In addition, the internal memory 121 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one disk storage device, a flash memory device, a universal flash memory (universal flash storage, UFS), and so on.
电子设备100可以通过音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,以及应用处理器等实现音频功能。例如音乐播放,录音等。The electronic device 100 may implement audio functions through an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, a headphone interface 170D, and an application processor. For example, music playback, recording, etc.
音频模块170用于将数字音频信息转换成模拟音频信号输出,也用于将模拟音频输入转换为数字音频信号。音频模块170还可以用于对音频信号编码和解码。在一些实施例中,音频模块170可以设置于处理器110中,或将音频模块170的部分功能模块设置于处理器110中。The audio module 170 is used to convert digital audio information into analog audio signal output, and also used to convert analog audio input into digital audio signal. The audio module 170 can also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be disposed in the processor 110, or some functional modules of the audio module 170 may be disposed in the processor 110.
扬声器170A,也称“喇叭”,用于将音频电信号转换为声音信号。电子设备100可以通过扬声器170A收听音乐,或收听免提通话。The speaker 170A, also called "speaker", is used to convert audio electrical signals into sound signals. The electronic device 100 can listen to music through the speaker 170A, or listen to a hands-free call.
受话器170B,也称“听筒”,用于将音频电信号转换成声音信号。当电子设备100接听电话或语音信息时,可以通过将受话器170B靠近人耳接听语音。The receiver 170B, also known as "handset", is used to convert audio electrical signals into sound signals. When the electronic device 100 answers a call or a voice message, the voice can be received by bringing the receiver 170B close to the ear.
麦克风170C,也称“话筒”,“传声器”,用于将声音信号转换为电信号。当拨打电话或发送语音信息时,用户可以通过人嘴靠近麦克风170C发声,将声音信号输入到麦克风170C。电子设备100可以设置至少一个麦克风170C。在另一些实施例中,电子设备100可以设置两个麦克风170C,除了采集声音信号,还可以实现降噪功能。在另一些实施例中,电子设备100还可以设置三个,四个或更多麦克风170C,实现采集声音信号,降噪,还可以识别声音来源,实现定向录音功能等。The microphone 170C, also called "microphone", "microphone", is used to convert sound signals into electrical signals. When making a call or sending a voice message, the user can make a sound by approaching the microphone 170C through a person's mouth, and input a sound signal to the microphone 170C. The electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C. In addition to collecting sound signals, it may also implement a noise reduction function. In other embodiments, the electronic device 100 may further include three, four, or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and implement directional recording functions.
耳机接口170D用于连接有线耳机。耳机接口170D可以是USB接口130,也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform,OMTP)标准接口,美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口。The headset interface 170D is used to connect wired headsets. The earphone interface 170D may be a USB interface 130, or a 3.5mm open mobile electronic device (open terminal) platform (OMTP) standard interface, and the American Telecommunications Industry Association (cellular telecommunications industry association of the United States, CTIA) standard interface.
压力传感器180A用于感受压力信号,可以将压力信号转换成电信号。在一些实施例中,压力传感器180A可以设置于显示屏194。压力传感器180A的种类很多,如电阻式压力传感器,电感式压力传感器,电容式压力传感器等。电容式压力传感器可以是包括至少两个具有导电材料的平行板。当有力作用于压力传感器180A,电极之间的电容改变。电子设备100根据电容的变化确定压力的强度。当有触摸操作作用于显示屏194,电子设备100根据压力传感器180A检测所述触摸操作强度。电子设备100也可以根据压力传感器180A的检测信号计算触摸的位置。在一些实施例中,作用于相同触摸位置,但不同触摸操作强度的触摸操作,可以对应不同的操作指令。例如:当有触摸操作强度小于第一压力阈值的触摸操作作用于短消息应用图标时,执行查看短消息的指令。当有触摸操作强度大于或等于第一压力阈值的触摸操作作用于短消息应用图标时,执行新建短消息的指令。The pressure sensor 180A is used to sense the pressure signal and can convert the pressure signal into an electrical signal. In some embodiments, the pressure sensor 180A may be provided on the display screen 194. There are many types of pressure sensors 180A, such as resistive pressure sensors, inductive pressure sensors, and capacitive pressure sensors. The capacitive pressure sensor may be at least two parallel plates with conductive materials. When force is applied to the pressure sensor 180A, the capacitance between the electrodes changes. The electronic device 100 determines the strength of the pressure according to the change in capacitance. When a touch operation is applied to the display screen 194, the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A. The electronic device 100 may also calculate the touched position based on the detection signal of the pressure sensor 180A. In some embodiments, touch operations that act on the same touch position but have different touch operation intensities may correspond to different operation instructions. For example, when a touch operation with a touch operation intensity less than the first pressure threshold acts on the short message application icon, an instruction to view the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold acts on the short message application icon, an instruction to create a new short message is executed.
陀螺仪传感器180B可以用于确定电子设备100的运动姿态。在一些实施例中,可以通过陀螺仪传感器180B确定电子设备100围绕三个轴(即,x,y和z轴)的角速度。陀螺仪传感器180B可以用于拍摄防抖。示例性的,当按下快门,陀螺仪传感器180B检测电子设备100抖动的角度,根据角度计算出镜头模组需要补偿的距离,让镜头通过反向运动抵消电子设备100的抖动,实现防抖。陀螺仪传感器180B还可以用于导航,体感游戏场景。The gyro sensor 180B may be used to determine the movement posture of the electronic device 100. In some embodiments, the angular velocity of the electronic device 100 around three axes (ie, x, y, and z axes) may be determined by the gyro sensor 180B. The gyro sensor 180B can be used for image stabilization. Exemplarily, when the shutter is pressed, the gyro sensor 180B detects the jitter angle of the electronic device 100, calculates the distance that the lens module needs to compensate based on the angle, and allows the lens to counteract the jitter of the electronic device 100 through reverse movement to achieve anti-shake. The gyro sensor 180B can also be used for navigation and somatosensory game scenes.
气压传感器180C用于测量气压。在一些实施例中,电子设备100通过气压传感器180C测得的气压值计算海拔高度,辅助定位和导航。The air pressure sensor 180C is used to measure air pressure. In some embodiments, the electronic device 100 calculates the altitude using the air pressure value measured by the air pressure sensor 180C to assist positioning and navigation.
磁传感器180D包括霍尔传感器。电子设备100可以利用磁传感器180D检测翻盖皮套的开合。在一些实施例中,当电子设备100是翻盖机时,电子设备100可以根据磁传感 器180D检测翻盖的开合。进而根据检测到的皮套的开合状态或翻盖的开合状态,设置翻盖自动解锁等特性。The magnetic sensor 180D includes a Hall sensor. The electronic device 100 can detect the opening and closing of the flip holster using the magnetic sensor 180D. In some embodiments, when the electronic device 100 is a clamshell machine, the electronic device 100 may detect the opening and closing of the clamshell according to the magnetic sensor 180D. Furthermore, according to the detected opening and closing state of the holster or the opening and closing state of the flip cover, features such as automatic unlocking of the flip cover are set.
加速度传感器180E可检测电子设备100在各个方向上(一般为三轴)加速度的大小。当电子设备100静止时可检测出重力的大小及方向。还可以用于识别电子设备姿态,应用于横竖屏切换,计步器等应用。The acceleration sensor 180E can detect the magnitude of acceleration of the electronic device 100 in various directions (generally three axes). When the electronic device 100 is stationary, the magnitude and direction of gravity can be detected. It can also be used to recognize the posture of electronic devices, and be used in applications such as horizontal and vertical screen switching and pedometers.
距离传感器180F,用于测量距离。电子设备100可以通过红外或激光测量距离。在一些实施例中,拍摄场景,电子设备100可以利用距离传感器180F测距以实现快速对焦。The distance sensor 180F is used to measure the distance. The electronic device 100 can measure the distance by infrared or laser. In some embodiments, when shooting scenes, the electronic device 100 may use the distance sensor 180F to measure distance to achieve fast focusing.
接近光传感器180G可以包括例如发光二极管(LED)和光检测器,例如光电二极管。发光二极管可以是红外发光二极管。电子设备100通过发光二极管向外发射红外光。电子设备100使用光电二极管检测来自附近物体的红外反射光。当检测到充分的反射光时,可以确定电子设备100附近有物体。当检测到不充分的反射光时,电子设备100可以确定电子设备100附近没有物体。电子设备100可以利用接近光传感器180G检测用户手持电子设备100贴近耳朵通话,以便自动熄灭屏幕达到省电的目的。接近光传感器180G也可用于皮套模式,口袋模式自动解锁与锁屏。The proximity light sensor 180G may include, for example, a light emitting diode (LED) and a light detector, such as a photodiode. The light emitting diode may be an infrared light emitting diode. The electronic device 100 emits infrared light outward through the light emitting diode. The electronic device 100 uses a photodiode to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it may be determined that there is an object near the electronic device 100. When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100. The electronic device 100 can use the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear to talk, so as to automatically turn off the screen to save power. The proximity light sensor 180G can also be used in leather case mode, pocket mode automatically unlocks and locks the screen.
环境光传感器180L用于感知环境光亮度。电子设备100可以根据感知的环境光亮度自适应调节显示屏194亮度。环境光传感器180L也可用于拍照时自动调节白平衡。环境光传感器180L还可以与接近光传感器180G配合,检测电子设备100是否在口袋里,以防误触。The ambient light sensor 180L is used to sense the brightness of ambient light. The electronic device 100 can adaptively adjust the brightness of the display screen 194 according to the perceived ambient light brightness. The ambient light sensor 180L can also be used to automatically adjust the white balance when taking pictures. The ambient light sensor 180L can also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in a pocket to prevent accidental touch.
指纹传感器180H用于采集指纹。电子设备100可以利用采集的指纹特性实现指纹解锁,访问应用锁,指纹拍照,指纹接听来电等。The fingerprint sensor 180H is used to collect fingerprints. The electronic device 100 can use the collected fingerprint characteristics to realize fingerprint unlocking, access to application locks, fingerprint photographing, and fingerprint answering calls.
温度传感器180J用于检测温度。在一些实施例中,电子设备100利用温度传感器180J检测的温度,执行温度处理策略。例如,当温度传感器180J上报的温度超过阈值,电子设备100执行降低位于温度传感器180J附近的处理器的性能,以便降低功耗实施热保护。在另一些实施例中,当温度低于另一阈值时,电子设备100对电池142加热,以避免低温导致电子设备100异常关机。在其他一些实施例中,当温度低于又一阈值时,电子设备100对电池142的输出电压执行升压,以避免低温导致的异常关机。The temperature sensor 180J is used to detect the temperature. In some embodiments, the electronic device 100 uses the temperature detected by the temperature sensor 180J to execute a temperature processing strategy. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold, the electronic device 100 performs performance reduction of the processor located near the temperature sensor 180J in order to reduce power consumption and implement thermal protection. In some other embodiments, when the temperature is below another threshold, the electronic device 100 heats the battery 142 to avoid the abnormal shutdown of the electronic device 100 due to the low temperature. In some other embodiments, when the temperature is below another threshold, the electronic device 100 performs boosting on the output voltage of the battery 142 to avoid abnormal shutdown due to low temperature.
触摸传感器180K,也称“触控面板”。触摸传感器180K可以设置于显示屏194,由触摸传感器180K与显示屏194组成触摸屏,也称“触控屏”。触摸传感器180K用于检测作用于其上或附近的触摸操作。触摸传感器可以将检测到的触摸操作传递给应用处理器,以确定触摸事件类型。可以通过显示屏194提供与触摸操作相关的视觉输出。在另一些实施例中,触摸传感器180K也可以设置于电子设备100的表面,与显示屏194所处的位置不同。Touch sensor 180K, also known as "touch panel". The touch sensor 180K may be provided on the display screen 194, and the touch sensor 180K and the display screen 194 constitute a touch screen, also called a "touch screen". The touch sensor 180K is used to detect a touch operation acting on or near it. The touch sensor can pass the detected touch operation to the application processor to determine the type of touch event. The visual output related to the touch operation may be provided through the display screen 194. In other embodiments, the touch sensor 180K may also be disposed on the surface of the electronic device 100, which is different from the location where the display screen 194 is located.
骨传导传感器180M可以获取振动信号。在一些实施例中,骨传导传感器180M可以获取人体声部振动骨块的振动信号。骨传导传感器180M也可以接触人体脉搏,接收血压跳动信号。在一些实施例中,骨传导传感器180M也可以设置于耳机中,结合成骨传导耳机。音频模块170可以基于所述骨传导传感器180M获取的声部振动骨块的振动信号,解析出语音信号,实现语音功能。应用处理器可以基于所述骨传导传感器180M获取的血压跳动信号解析心率信息,实现心率检测功能。The bone conduction sensor 180M can acquire vibration signals. In some embodiments, the bone conduction sensor 180M can acquire the vibration signal of the vibrating bone mass of the human voice. The bone conduction sensor 180M can also contact the pulse of the human body and receive a blood pressure beating signal. In some embodiments, the bone conduction sensor 180M may also be provided in the earphone and combined into a bone conduction earphone. The audio module 170 may parse out the voice signal based on the vibration signal of the vibrating bone block of the voice part acquired by the bone conduction sensor 180M to realize the voice function. The application processor may analyze the heart rate information based on the blood pressure beating signal acquired by the bone conduction sensor 180M to implement the heart rate detection function.
按键190包括开机键,音量键等。按键190可以是机械按键。也可以是触摸式按键。电子设备100可以接收按键输入,产生与电子设备100的用户设置以及功能控制有关的键 信号输入。The key 190 includes a power-on key, a volume key, and the like. The key 190 may be a mechanical key. It can also be a touch button. The electronic device 100 can receive key input and generate key signal input related to user settings and function control of the electronic device 100.
马达191可以产生振动提示。马达191可以用于来电振动提示,也可以用于触摸振动反馈。例如,作用于不同应用(例如拍照,音频播放等)的触摸操作,可以对应不同的振动反馈效果。作用于显示屏194不同区域的触摸操作,马达191也可对应不同的振动反馈效果。不同的应用场景(例如:时间提醒,接收信息,闹钟,游戏等)也可以对应不同的振动反馈效果。触摸振动反馈效果还可以支持自定义。The motor 191 may generate a vibration prompt. The motor 191 can be used for vibration notification of incoming calls and can also be used for touch vibration feedback. For example, touch operations applied to different applications (such as taking pictures, playing audio, etc.) may correspond to different vibration feedback effects. For the touch operation in different areas of the display screen 194, the motor 191 can also correspond to different vibration feedback effects. Different application scenarios (for example: time reminder, receiving information, alarm clock, game, etc.) can also correspond to different vibration feedback effects. Touch vibration feedback effect can also support customization.
指示器192可以是指示灯,可以用于指示充电状态,电量变化,也可以用于指示消息,未接来电,通知等。The indicator 192 may be an indicator light, which may be used to indicate a charging state, a power change, and may also be used to indicate a message, a missed call, a notification, and the like.
SIM卡接口195用于连接SIM卡。SIM卡可以通过插入SIM卡接口195,或从SIM卡接口195拔出,实现和电子设备100的接触和分离。电子设备100可以支持1个或N个SIM卡接口,N为大于1的正整数。SIM卡接口195可以支持Nano SIM卡,Micro SIM卡,SIM卡等。同一个SIM卡接口195可以同时插入多张卡。所述多张卡的类型可以相同,也可以不同。SIM卡接口195也可以兼容不同类型的SIM卡。SIM卡接口195也可以兼容外部存储卡。电子设备100通过SIM卡和网络交互,实现通话以及数据通信等功能。在一些实施例中,电子设备100采用eSIM,即:嵌入式SIM卡。eSIM卡可以嵌在电子设备100中,不能和电子设备100分离。The SIM card interface 195 is used to connect a SIM card. The SIM card can be inserted into or removed from the SIM card interface 195 to achieve contact and separation with the electronic device 100. The electronic device 100 may support 1 or N SIM card interfaces, where N is a positive integer greater than 1. The SIM card interface 195 can support Nano SIM cards, Micro SIM cards, SIM cards, etc. The same SIM card interface 195 can insert multiple cards at the same time. The types of the multiple cards may be the same or different. The SIM card interface 195 can also be compatible with different types of SIM cards. The SIM card interface 195 can also be compatible with external memory cards. The electronic device 100 interacts with the network through the SIM card to realize functions such as call and data communication. In some embodiments, the electronic device 100 uses eSIM, that is, an embedded SIM card. The eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100.
电子设备100的软件系统可以采用分层架构,事件驱动架构,微核架构,微服务架构,或云架构。本申请实施例以分层架构的Android系统为例,示例性说明电子设备100的软件结构。The software system of the electronic device 100 may adopt a layered architecture, event-driven architecture, micro-core architecture, micro-service architecture, or cloud architecture. The embodiment of the present application takes an Android system with a layered architecture as an example to exemplarily explain the software structure of the electronic device 100.
图2是本申请实施例的电子设备100的软件结构框图。分层架构将软件分成若干个层,每一层都有清晰的角色和分工。层与层之间通过软件接口通信。在一些实施例中,将Android系统分为四层,从上至下分别为应用程序层,应用程序框架层,安卓运行时(Android runtime)和系统库,以及内核层。应用程序层可以包括一系列应用程序包。2 is a block diagram of the software structure of the electronic device 100 according to an embodiment of the present application. The layered architecture divides the software into several layers, each of which has a clear role and division of labor. The layers communicate with each other through a software interface. In some embodiments, the Android system is divided into four layers, from top to bottom are the application layer, the application framework layer, the Android runtime and the system library, and the kernel layer. The application layer may include a series of application packages.
如图2所示,应用程序包可以包括相机,图库,日历,通话,地图,导航,WLAN,蓝牙,音乐,视频,短信息等应用程序。As shown in FIG. 2, the application package may include applications such as camera, gallery, calendar, call, map, navigation, WLAN, Bluetooth, music, video, and short message.
应用程序框架层为应用程序层的应用程序提供应用编程接口(application programming interface,API)和编程框架。应用程序框架层包括一些预先定义的函数。The application framework layer provides an application programming interface (application programming interface) and programming framework for applications at the application layer. The application framework layer includes some predefined functions.
如图2所示,应用程序框架层可以包括窗口管理器,内容提供器,视图系统,电话管理器,资源管理器,通知管理器等。As shown in FIG. 2, the application framework layer may include a window manager, a content provider, a view system, a phone manager, a resource manager, a notification manager, and so on.
窗口管理器用于管理窗口程序。窗口管理器可以获取显示屏大小,判断是否有状态栏,锁定屏幕,截取屏幕等。The window manager is used to manage window programs. The window manager can obtain the size of the display screen, determine whether there is a status bar, lock the screen, intercept the screen, etc.
内容提供器用来存放和获取数据,并使这些数据可以被应用程序访问。所述数据可以包括视频,图像,音频,拨打和接听的电话,浏览历史和书签,电话簿等。Content providers are used to store and retrieve data, and make these data accessible to applications. The data may include videos, images, audio, calls made and received, browsing history and bookmarks, phonebooks, etc.
视图系统包括可视控件,例如显示文字的控件,显示图片的控件等。视图系统可用于构建应用程序。显示界面可以由一个或多个视图组成的。例如,包括短信通知图标的显示界面,可以包括显示文字的视图以及显示图片的视图。The view system includes visual controls, such as controls for displaying text and controls for displaying pictures. The view system can be used to build applications. The display interface can be composed of one or more views. For example, a display interface that includes an SMS notification icon may include a view that displays text and a view that displays pictures.
电话管理器用于提供电子设备100的通信功能。例如通话状态的管理(包括接通,挂断等)。The phone manager is used to provide the communication function of the electronic device 100. For example, the management of the call state (including connection, hang up, etc.).
资源管理器为应用程序提供各种资源,比如本地化字符串,图标,图片,布局文件,视频文件等等。The resource manager provides various resources for the application, such as localized strings, icons, pictures, layout files, video files, and so on.
通知管理器使应用程序可以在状态栏中显示通知信息,可以用于传达告知类型的消息,可以短暂停留后自动消失,无需用户交互。比如通知管理器被用于告知下载完成,消息提醒等。通知管理器还可以是以图表或者滚动条文本形式出现在系统顶部状态栏的通知,例如后台运行的应用程序的通知,还可以是以对话窗口形式出现在屏幕上的通知。例如在状态栏提示文本信息,发出提示音,电子设备振动,指示灯闪烁等。The notification manager enables applications to display notification information in the status bar, which can be used to convey notification-type messages, and can disappear after a short stay without user interaction. For example, the notification manager is used to notify the completion of downloading, message reminders, etc. The notification manager can also be a notification that appears in the status bar at the top of the system in the form of a chart or scroll bar text, such as a notification of an application running in the background, or a notification that appears on the screen in the form of a dialog window. For example, the text message is displayed in the status bar, a sound is emitted, the electronic device vibrates, and the indicator light flashes.
Android runtime包括核心库和虚拟机。Android runtime负责安卓系统的调度和管理。Android runtime includes core library and virtual machine. Android runtime is responsible for the scheduling and management of the Android system.
核心库包含两部分:一部分是java语言需要调用的功能函数,另一部分是安卓的核心库。The core library contains two parts: one part is the function function that Java language needs to call, and the other part is the core library of Android.
应用程序层和应用程序框架层运行在虚拟机中。虚拟机将应用程序层和应用程序框架层的java文件执行为二进制文件。虚拟机用于执行对象生命周期的管理,堆栈管理,线程管理,安全和异常的管理,以及垃圾回收等功能。The application layer and the application framework layer run in the virtual machine. The virtual machine executes the java files of the application layer and the application framework layer into binary files. The virtual machine is used to perform functions such as object lifecycle management, stack management, thread management, security and exception management, and garbage collection.
系统库可以包括多个功能模块。例如:表面管理器(surface manager),媒体库(media libraries),三维图形处理库(例如:OpenGL ES),2D图形引擎(例如:SGL)等。The system library may include multiple functional modules. For example: surface manager (surface manager), media library (media library), 3D graphics processing library (eg: OpenGL ES), 2D graphics engine (eg: SGL), etc.
表面管理器用于对显示子系统进行管理,并且为多个应用程序提供了2D和3D图层的融合。The surface manager is used to manage the display subsystem and provides a combination of 2D and 3D layers for multiple applications.
媒体库支持多种常用的音频,视频格式回放和录制,以及静态图像文件等。媒体库可以支持多种音视频编码格式,例如:MPEG4,H.264,MP3,AAC,AMR,JPG,PNG等。The media library supports a variety of commonly used audio, video format playback and recording, and still image files. The media library can support multiple audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
三维图形处理库用于实现三维图形绘图,图像渲染,合成,和图层处理等。The 3D graphics processing library is used to realize 3D graphics drawing, image rendering, synthesis, and layer processing.
2D图形引擎是2D绘图的绘图引擎。The 2D graphics engine is a drawing engine for 2D drawing.
内核层是硬件和软件之间的层。内核层至少包含显示驱动,摄像头驱动,音频驱动,传感器驱动。The kernel layer is the layer between hardware and software. The kernel layer contains at least the display driver, camera driver, audio driver, and sensor driver.
为了便于理解,本申请以下实施例将以具有图1和图2所示结构的电子设备为例,结合附图和应用场景,对本申请实施例提供的视频中实现单词复读的方法进行具体阐述。For ease of understanding, the following embodiments of the present application will take the electronic device having the structure shown in FIG. 1 and FIG. 2 as an example, combined with the accompanying drawings and application scenarios, to specifically explain the method of implementing word repetition in the video provided by the embodiments of the present application.
现有的英语学习资源,大部分停留在单词、释义、例句阶段。英语学习类应用对视频资源的利用比较匮乏。对于英语学习的视频,大部分都是基于现有的视频拆分,并且拆分的视频与英语单词学习关联度较低,不便于用户在英语学习过程中,通过视频资源进行英文单词或者语句的学习。Most of the existing English learning resources stay at the stage of words, interpretation and example sentences. The use of video resources by English learning applications is relatively scarce. For English learning videos, most of them are based on existing video splits, and the split videos have a low correlation with English word learning, which is not convenient for users to use English video resources or sentences in English learning process. Learn.
因此,本申请将提出一种视频中实现单词或语句复读的方法,能够基于视频的英文字幕,在观看英文视频的同时,实现英语单词的复读、跟读等功能,提升用户的英语学习效果,提高用户体验。Therefore, this application will propose a method to realize the repetition of words or sentences in a video, which can be based on the English subtitles of the video, while watching the English video, and realize the functions of repetition and follow-up of English words to improve the user's English learning effect. Improve user experience.
图3是本申请实施例提供的一例视频中单词复读的图形用户界面(graphical user interface,GUI)的示意图,本申请将以手机作为电子设备,详细介绍本申请提供的在视频中实现单词或语句复读的方法。其中,图3中的(a)图示出了手机的解锁模式下,手机的屏幕显示系统显示了当前输出的界面内容301,该界面内容301为手机的主界面。该界面内容301显示了多款第三方应用程序(application,App),例如支付宝、任务卡商店、相册、微信、卡包、设置、相机,以及本申请实施例中提供的用于英语学习的应用程序,例如图3中的(a)图所示的趣V英语。应理解,界面内容301还可以包括其他更多的应用程序,本申请对此不作限定。3 is a schematic diagram of an example of a graphical user interface (GUI) for repetition of words in a video provided by an embodiment of the present application. This application will use a mobile phone as an electronic device to introduce in detail the implementation of words or sentences in a video provided by the present application Repeat the method. FIG. 3(a) shows that in the unlocking mode of the mobile phone, the screen display system of the mobile phone displays the currently output interface content 301, which is the main interface of the mobile phone. The interface content 301 displays various third-party applications (applications, apps), such as Alipay, task card store, photo album, WeChat, card package, settings, camera, and applications for English learning provided in the embodiments of the present application The program is, for example, Fun V English shown in (a) in FIG. 3. It should be understood that the interface content 301 may also include other more applications, which is not limited in this application.
对该英语学习的应用程序输入用户操作,该用户操作可以包括用户对手机显示的英语学习的应用程序的图标的点击操作。响应于该用户的点击操作,进入图3中的(b)图所 示的该英语学习的主界面。在该主界面上,可以包括多个功能区域,例如每日推荐的英语单词学习区域,用于列举部分单词和对应的视频,用户可以接收每天的推送,点击学习该单词。该主界面上的顶部区域,包括搜索框302、浏览记录控件303和消息提醒控件304。其中,搜索框302用于用户输入单词,进入该单词的学习模式;浏览记录控件303用于记录用户的搜索和学习记录,方便用户快速查找已学习的单词;消息提醒控件304可以包括系统推送的消息等。该主界面上还可以包括场景分类区域,例如图3中的(b)图中示出的餐厅、打车、飞机、会议、机场、酒店、商场等不同的场景分类。用户可以点击选择任意一种场景,进入该场景分类下,选择英语学习视频进行学习。A user operation is input to the English learning application. The user operation may include a user's click operation on the icon of the English learning application displayed on the mobile phone. In response to the user's click operation, the main interface of the English learning shown in (b) in FIG. 3 is entered. The main interface may include multiple functional areas, such as a daily recommended English word learning area for listing some words and corresponding videos, and the user may receive daily pushes and click to learn the word. The top area on the main interface includes a search box 302, a browsing record control 303, and a message reminding control 304. Among them, the search box 302 is used by the user to enter a word and enter the learning mode of the word; the browsing record control 303 is used to record the user's search and learning record, which is convenient for the user to quickly find the learned word; the message reminder control 304 may include the system push News etc. The main interface may also include scene classification areas, such as different scene classifications such as restaurants, taxis, airplanes, conferences, airports, hotels, and shopping malls shown in (b) in FIG. 3. The user can click to select any scene, enter the scene category, and select English learning videos to learn.
如图3中的(b)图所示,用户点击搜索框302,输入要学习的单词“message”,进入如图3中的(c)图所示的显示界面。用户输入单词后,界面自动显示该单词的导航栏305,用户点击“message”的导航栏305,进入图3中的(d)图所示的单词解析界面。图3中的(d)图中,该“message”解析界面包括的英式、美式发音、中文释义、中英文例句以及视频例句。可选地,该单词解析界面呈现的详细内容可以来源于系统本身内置的英语词典,也可以关联其他的英语在线词典等,本申请对此不作限定。As shown in (b) in FIG. 3, the user clicks on the search box 302, enters the word "message" to be learned, and enters the display interface shown in (c) in FIG. After the user inputs a word, the interface automatically displays the navigation bar 305 of the word, and the user clicks the navigation bar 305 of "message" to enter the word analysis interface shown in (d) of FIG. 3. In (d) of FIG. 3, the "message" parsing interface includes English, American pronunciation, Chinese interpretation, Chinese and English example sentences, and video example sentences. Optionally, the detailed content presented on the word analysis interface may come from the English dictionary built into the system itself, or may be associated with other English online dictionaries, etc., which is not limited in this application.
该单词解析界面还包括单词添加控件306,点击该单词添加控件306可以将单词添加到单词簿中,用户可以在单词簿中点击该单词快速进入该单词解析界面,简化搜索操作。The word analysis interface also includes a word addition control 306. Clicking the word addition control 306 can add a word to the word book. The user can click the word in the word book to quickly enter the word analysis interface, simplifying the search operation.
如图3中的(d)图所示,视频例句区域可以显示与该单词相关的所有的学习视频资源。可选地,该学习视频资源可以包括不同场景分类下的视频资源。用户输入要学习的单词后,获取不同场景分类下的英语学习视频列表,用户可以点击选择任意一种场景,获取该场景分类下的英语学习视频进行学习。例如用户可以点击餐厅、打车、飞机等不同的场景分类,查看不同场景下的该单词的视频资源。本申请对视频资源的分类不作限定。As shown in (d) of FIG. 3, the video example sentence area may display all learning video resources related to the word. Optionally, the learning video resource may include video resources under different scene classifications. After the user inputs the words to be learned, the list of English learning videos in different scene classifications is obtained. The user can click to select any scene to obtain the English learning videos in the scene classification for learning. For example, the user can click on different scene categories such as restaurants, taxis, and airplanes to view the video resource of the word in different scenes. This application does not limit the classification of video resources.
响应于如图3中的(d)图所示的点击操作,用户可以进入“message”的视频学习模式,该视频学习模式界面如图3中的(e)图所示,例如该视频是节选于影片《当幸福来敲门》的长达36秒的片段。在该视频学习模式界面上,包括视频播放区域,用于播放与“message”相关的学习视频;还包括单词解析界面,详细显示“message”的释义等,方便用户同时结合视频场景和中文释义进行学习,提高学习效果;此外,向上滑动该视频学习模式界面,还可以显示如图3中的(h)图中区域308所示的同类推荐的相关视频,该同类推荐的相关视频可以是与当前播放的单词相关的学习视频,例如与“message”相关的其他场景下的学习视频,也可以是与当前场景相同的其他视频,例如当前播放的“message”的学习视频是餐厅场景分类下的视频,该同类推荐的相关视频也是餐厅场景分类下的其他学习视频,本申请对此不作限定。In response to the click operation shown in (d) in FIG. 3, the user can enter the video learning mode of "message". The video learning mode interface is shown in (e) in FIG. 3, for example, the video is an excerpt A 36-second clip from the film "When Happiness Comes to Knock". The video learning mode interface includes a video playback area for playing learning videos related to "message"; it also includes a word analysis interface that displays the interpretation of "message" in detail, which is convenient for users to combine video scenes and Chinese interpretation at the same time. Learn, improve learning effect; In addition, slide up the video learning mode interface, you can also display the related videos of the same recommendation as shown in the area 308 in (h) in Figure 3, the related videos of the same recommendation can be the same as the current Learning videos related to words played, for example, learning videos in other scenes related to "message", or other videos that are the same as the current scene, for example, currently playing learning videos for "message" are videos under the restaurant scene category The related videos recommended by this category are also other learning videos under the restaurant scene classification, which is not limited in this application.
可选地,在图3中的(e)图所示的视频播放过程中,用户可通过视频播放进度条中的标识查看要学习的英文单词所处的时间片的位置。例如,在当前播放的36秒的“message”的学习视频中,进度条中第9秒的位置处有用户可见的标识,用于标注“message”在该视频中出现的时间片的位置。应理解,该学习视频中可以包括多个标识,且该标识的数量匹配于该单词在视频中出现的次数。而且在播放过程中,用户可以通过拖动进度条控制视频播放的进度,例如学习视频较长时,用户可以将进度条拖动到靠近该单词的标识处,进行播放。Optionally, during the video playback process shown in (e) of FIG. 3, the user can view the position of the time slice where the English word to be learned is located through the mark in the video playback progress bar. For example, in the currently playing 36-second "message" learning video, there is a user-visible logo at the 9th second position in the progress bar, which is used to mark the location of the time slot where "message" appears in the video. It should be understood that the learning video may include multiple identifiers, and the number of the identifiers matches the number of times the word appears in the video. In addition, during playback, the user can control the progress of the video playback by dragging the progress bar. For example, when the learning video is long, the user can drag the progress bar to the mark near the word for playback.
在一种可能的实现方式中,用户学习的单词在该视频播放过程中是高亮展示的。在图3中的(e)图所示的视频播放过程中,当字幕中出现用户要学习的“message”时,“message” 是不同于字幕中其他单词的显示,用于提醒用户该单词的位置。In a possible implementation, the words learned by the user are highlighted during the video playback. In the video playback process shown in (e) in FIG. 3, when the "message" that the user wants to learn appears in the subtitles, the "message" is different from the display of other words in the subtitles to remind the user of the word position.
在“message”的视频播放区域,包括复读设置控件306,如图3中的(f)图所示。用户点击该复读设置控件306后,在视频播放区域可以弹出图3中的(g)图所示的复读设置框307。在复读设置框307中,包括循环次数设置选项和字幕设置选项。The video playback area of "message" includes a repeat setting control 306, as shown in (f) of FIG. 3. After the user clicks on the repeat setting control 306, a repeat setting box 307 shown in (g) in FIG. 3 may pop up in the video playback area. In the repeat setting box 307, the setting options of the number of loops and the setting of subtitles are included.
示例性的,用户可以通过该循环次数设置选项选择复读的内容,例如用户可以点击“无”设置为无复读模式,即在当前的学习视频播放过程中,不复读单词或语句。或者,用户可以点击“词”设置为词复读模式,即在当前“message”的学习视频播放过程中,当播放到“message”时,会进行循环播放该“message”对应的音频和视频帧。又或者,用户可以点击“句”设置为句复读模式,即在当前“message”的学习视频播放过程中,当播放到包含“message”的语句时,会进行循环播放该语句对应的音频和视频帧。例如图3中的(h)图中包括“message”的语句:“Yes,I’d like to leave a message for Mr.Jay Twistle”,在句复读模式下,会循环播放该语句对应的音频和视频帧。Exemplarily, the user can select the content to be repeated through the cycle number setting option. For example, the user can click "None" to set to no repeat mode, that is, the word or sentence is not repeated during the current learning video playback process. Alternatively, the user can click "Word" to set the word repeat mode, that is, in the current "message" learning video playback process, when the "message" is played, the audio and video frames corresponding to the "message" will be played in a loop. Or, the user can click "sentence" to set the sentence repetition mode, that is, in the current "message" learning video playback process, when a sentence containing "message" is played, the audio and video corresponding to the sentence will be played in a loop. frame. For example, in the (h) diagram in Figure 3, the sentence "message" includes: "Yes, I'd like to leave message for Mr. Jay Twistle". In the sentence repetition mode, the audio and the corresponding sentence of the sentence will be played cyclically. Video frame.
应理解,在上述介绍的播放学习视频过程中,当视频播放至该单词所在的时间片,高亮展示该单词,实现自动复读,当复读完成后,视频继续播放不受影响。It should be understood that during the learning video playback described above, when the video is played to the time slot where the word is located, the word is highlighted to realize automatic replay, and when the replay is completed, the video continues to play without being affected.
在一种可能的实现方式中,循环播放的次数可以是用户后台设置的,也可以是系统默认的。在用户未设置循环播放的次数的情况下,循环播放的次数可以为系统默认的3次。本申请对此不作限定。In a possible implementation manner, the number of loop playbacks may be set by the user in the background, or may be the system default. In the case that the user does not set the number of looped playbacks, the number of looped playbacks may be the system default 3 times. This application does not limit this.
此外,用户可以通过字幕设置选项设置字幕的展现形式。例如用户可以点击复读设置框307中字幕的第一个控件,对应的是无字幕模式,即在学习视频播放过程中不显示中文或者英文的任何字幕。或者,用户可以点击复读设置框307中字幕的第二个控件“A”,对应的是英文字幕模式,即在学习视频播放过程中只显示英文字幕。又或者,用户可以点击复读设置框307中字幕的第三个控件“A+”,对应的是全字幕模式,即在学习视频播放过程中同时显示英文字幕和中文字幕。In addition, the user can set the presentation form of the subtitles through the subtitle setting options. For example, the user can click the first control of the subtitles in the repeat setting box 307, which corresponds to the no subtitles mode, that is, no subtitles in Chinese or English are displayed during the learning video playback process. Alternatively, the user can click the second control "A" of the subtitle in the repeat setting box 307, which corresponds to the English subtitle mode, that is, only English subtitles are displayed during the learning video playback process. Alternatively, the user can click the third control "A+" of the subtitle in the repeat setting box 307, which corresponds to the full subtitle mode, that is, English subtitles and Chinese subtitles are simultaneously displayed during the learning video playback process.
应理解,在视频播放过程中,只要点击复读设置控件设置复读模式和复读次数,或者用户点击字幕中的任意一个单词并进入该单词的学习模式,弹出该单词的单词解析框时,该视频都是暂停播放的。It should be understood that during video playback, as long as you click the repeat setting control to set the repeat mode and repeat times, or the user clicks any word in the subtitle and enters the learning mode of the word, when the word analysis box of the word pops up, the video is Is paused.
还应理解,在上述实施例的介绍时,在用户学习过程中以单个单词为例,如“message”,在实际应用过程中,用户还可以输入词组、句子等不同的文本单元,本申请对此不作限定。It should also be understood that in the introduction of the above embodiments, a single word is used as an example in the user's learning process, such as "message". In the actual application process, the user can also input different text units such as phrases and sentences. This is not limited.
在一种可能的实现方式中,当用户点击关闭复读设置控件或者关闭该重点单词的单词解析框退出该单词的学习模式之后,视频可以实现继续播放,或者视频处于暂停状态,用户可以点击视频显示界面的播放控件,继续播放该视频。本申请对此不作限定。In a possible implementation, when the user clicks to close the repeat setting control or close the word analysis box of the key word to exit the learning mode of the word, the video can continue to play, or the video is paused, and the user can click the video display The playback controls on the interface continue to play the video. This application does not limit this.
通过上述介绍的本申请提供的视频中实现单词或语句复读的方法,能够为用户提供更利于学习英文的环境。基于本申请的方法,用户可以根据要学习的英文单词,选择不同场景下的视频资源。在视频学习过程中,基于视频的英文字幕,用户可以在观看英文视频的同时,实现英语单词的复读、跟读等功能,提升用户的英语学习效果,提高用户体验。The method for re-reading words or sentences in the video provided by the present application as described above can provide users with an environment more conducive to learning English. Based on the method of this application, the user can select video resources in different scenarios according to the English words to be learned. During the video learning process, based on the English subtitles of the video, the user can watch the English video while realizing the functions of repeating and following the English words, improving the user's English learning effect and improving the user experience.
在另一种可能的实现方式中,上述介绍的在视频中实现单词或语句复读的方法除了应用于专门用于学习英语的应用程序(例如前述的趣V英语)之外,还可以应用于视频播放类应用程序中,例如现有的优酷视频、腾讯视频、youtube等,本申请对此不作限定。图4是本申请提供的又一例观影过程中学习单词的用户界面示意图,下面结合图4进行介绍。In another possible implementation, the above-mentioned method for repetition of words or sentences in a video can be applied to videos in addition to applications specifically for learning English (such as the aforementioned Fun V English) Play applications, such as existing Youku videos, Tencent videos, YouTube, etc., this application does not limit. FIG. 4 is a schematic diagram of another example of a user interface for learning words during a viewing process provided by the present application, which will be described below in conjunction with FIG. 4.
示例性的,图4中的(a)图示出了手机的解锁模式下,手机的屏幕显示系统显示了 当前输出的界面内容401,该界面内容401为手机的主界面。该界面内容401显示了多款第三方应用程序,其中包括用户观影的应用程序,例如华为的影视应用程序华为电影。用户点击华为电影进入图4中的(b)图所示的华为电影的显示界面。如图所示,该界面可以包括各种分类的影视资源,以及各类推荐的影视资源,如图中示出的精彩推荐的影片《当幸福来敲门》。点击该推荐影片进入到该影片的播放模式。Exemplarily, FIG. 4(a) shows that in the unlocking mode of the mobile phone, the screen display system of the mobile phone displays the currently output interface content 401, which is the main interface of the mobile phone. The interface content 401 shows a variety of third-party applications, including applications for users to watch movies, such as Huawei’s film and television application Huawei Movies. The user clicks the Huawei movie to enter the display interface of the Huawei movie shown in (b) of FIG. 4. As shown in the figure, the interface may include various classified film and television resources, as well as various recommended film and television resources, as shown in the wonderful recommended movie "When Happiness Comes to Knock". Click the recommended movie to enter the play mode of the movie.
示例性的,在影片播放过程中,当音频播放到任意一个单词时,该单词在视频下方的字幕中是高亮展示的。例如,在图4中的(c)图所示的视频播放过程中,当影片中音频播放到close时,字幕中close是不同于字幕中其他单词的显示,用于明确每一个单词在音频和字幕中的位置,便于用户学习该单词的发音和释义。Exemplarily, during the movie playback, when any word is played in the audio, the word is highlighted in the subtitles below the video. For example, in the video playback process shown in (c) of Figure 4, when the audio in the movie is played to close, close in the subtitle is different from the display of other words in the subtitle, which is used to clarify that each word is in the audio and The position in the subtitles is convenient for users to learn the pronunciation and interpretation of the word.
示例性的,在影片播放界面中,可以包括循环次数设置控件402和字幕设置控件403。同理,用户可以通过该循环次数设置控件选择复读的内容,例如用户可以点击“无”设置为无复读模式,即在当前的学习视频播放过程中,不复读单词或语句。或者,用户可以点击“词”设置为词复读模式。又或者,用户可以点击“句”设置为句复读模式,即在当前学习视频播放过程中,用户设置后,会进行循环播放设置前或设置后的语句对应的音频和视频帧。Exemplarily, the movie playback interface may include a loop number setting control 402 and a subtitle setting control 403. In the same way, the user can select the content to be repeated through the loop number setting control. For example, the user can click "None" to set to no repeat mode, that is, the word or sentence is not repeated during the current learning video playback process. Alternatively, the user can click "Word" to set the word repeat mode. Or, the user can click "sentence" to set the sentence repetition mode, that is, in the current learning video playback process, after the user sets, the audio and video frames corresponding to the sentence before or after the setting will be cyclically played.
应理解,此时的复读设置可以默认为用户启动该设置时的时间片的后一个单词或者后一句对应的音频和视频帧,也可以默认为用户启动该设置时的时间片的前一个单词或者后一句对应的音频和视频帧,用户也可以对循环复读的单词或语句和开启设置模式的时间片的关系进行更改。本申请对此不作限定。It should be understood that the repeat setting at this time may default to the last word or the corresponding audio and video frames of the time slice when the user starts the setting, or may default to the previous word of the time slice when the user starts the setting or For the audio and video frames corresponding to the latter sentence, the user can also change the relationship between the repeated words or sentences and the time slice in which the setting mode is turned on. This application does not limit this.
例如,用户在在观看该影片的过程中,出现没有听清楚的语句、单词或者用户想学习该语句、单词,用户可以直接点击复读设置控件,设置复读模式和复读的次数,从而退出复读模式时,直接复读前句或者前单词。For example, during the process of watching the movie, the user does not hear a clear sentence or word, or the user wants to learn the sentence or word. The user can directly click the repeat setting control to set the repeat mode and the number of repeat times to exit the repeat mode. , Repeat the previous sentence or word directly.
在观看该影片的过程中,用户可能遇见陌生单词。在一种可能的实现方式中,当用户想学习视频中出现的陌生单词时,可以点击视频字幕中的该陌生单词。如图4中的(d)图所示,用户如果想学习“close”,可以在视频播放界面的字幕上点击“close”,进入图4中的(e)图所示的界面。即用户可以通过点击字幕中的单词进入该单词的学习模式,如图4中的(e)图所的弹框404所示的close的单词解析,且该弹框404上包括“详情”控件403和添加至单词簿控件406。其中,“详情”控件405用于用户快速进入如图4中的(f)图所所示的该单词的解析界面,该解析界面包括的英式、美式发音、中文释义、中英文例句以及相关视频等,用户可以在该单词解析界面查看该单词的相关学习内容,以及与该单词的相关视频资源。类似地,该单词解析界面呈现的详细内容可以来源于系统本身内置的英语词典,也可以关联其他的英语在线词典等,本申请对此不作限定。此外,用户可以点击该单词的视频资源进行学习,具体地操作过程可参照前述图3中的相关介绍,此处不再赘述。In the process of watching the movie, the user may encounter strange words. In a possible implementation manner, when the user wants to learn the strange word appearing in the video, the user can click the strange word in the video subtitle. As shown in (d) of FIG. 4, if the user wants to learn “close”, he can click “close” on the subtitle of the video playback interface to enter the interface shown in (e) of FIG. 4. That is, the user can enter the learning mode of the word by clicking on the word in the subtitle, as shown in (e) in FIG. 4, the close word resolution shown in the pop-up box 404 in the figure, and the pop-up box 404 includes a “details” control 403和 Add to wordbook control 406. Among them, the "details" control 405 is used for the user to quickly enter the parsing interface of the word as shown in (f) of FIG. 4, the parsing interface includes English, American pronunciation, Chinese interpretation, Chinese and English example sentences and related Users can view related learning content of the word and video resources related to the word on the word analysis interface. Similarly, the detailed content presented on the word analysis interface can come from the English dictionary built into the system itself, or it can be associated with other English online dictionaries, etc., which is not limited in this application. In addition, the user can click on the video resource of the word to learn. For the specific operation process, please refer to the related introduction in FIG. 3, which will not be repeated here.
应理解,除了点击视频字幕中的单词,用户还可以点击视频字幕中的词组。例如,某些单词的出现基本都是以词组的形式出现,则在用户点击过程中,可以以词组的形式出现该词组的解析。例如,当单词解析界面呈现的详细内容关联英语词典,词典中该单词主要以词组形式出现,在用户观看视频过程中,点击该单词,也可以弹出该词组的解析或释义,本申请对此不作限定。It should be understood that in addition to clicking on words in video subtitles, users can also click on phrases in video subtitles. For example, some words appear basically in the form of phrases, and during the user's click, the parsing of the phrase may appear in the form of phrases. For example, when the detailed content presented in the word analysis interface is associated with an English dictionary, the word in the dictionary mainly appears in the form of a phrase. When the user clicks on the word while watching the video, the analysis or interpretation of the phrase may also pop up. limited.
还应理解,在视频播放过程中,只要点击复读设置控件设置复读模式和复读次数,或者用户点击字幕中的任意一个单词并进入该单词的学习模式,弹出该单词的单词解析框时,该视频都是暂停播放的。It should also be understood that during video playback, as long as the repeat setting control is clicked to set the repeat mode and repeat times, or the user clicks any word in the subtitle and enters the learning mode of the word, when the word resolution box for the word pops up, the video They are all paused.
在一种可能的实现方式中,该学习视频的字幕和该学习视频的进度条呈现于播放界面中的不同位置。可能的情况,如果学习视频的字幕和该学习视频的进度条都显示在播放界面的同一位置区域,则在用户点击字幕中包含的单词时,可能点击效果差,例如点击单词的过程可能误触发点击了进度条。特别是当用户观看该学习视频的电子设备的显示屏较小,或者用于播放该学习视频的界面较小时,此种情况更加突出。因此,将该学习视频的字幕和该学习视频的进度条呈现于播放界面中的不同位置,例如,图4中的(c)图所和图4中的(d)图所所示,视频播放的进度条显示在屏幕上方,字幕显示在屏幕下方,当然,也可以是播放界面的其他位置,都能够提高用户操作的灵敏度,提高用户体验。本申请对视频播放的进度条和字幕显示的位置不作限定。In a possible implementation manner, the subtitles of the learning video and the progress bar of the learning video are presented at different positions in the playback interface. If possible, if both the subtitle of the learning video and the progress bar of the learning video are displayed in the same position area of the playback interface, when the user clicks on the word contained in the subtitle, the click effect may be poor, for example, the process of clicking the word may be triggered by mistake Clicked the progress bar. This situation is particularly prominent when the display screen of the electronic device where the user watches the learning video is small, or the interface for playing the learning video is small. Therefore, the subtitles of the learning video and the progress bar of the learning video are presented at different positions in the playback interface, for example, as shown in FIG. 4(c) and FIG. 4(d), the video is played The progress bar is displayed at the top of the screen, and the subtitles are displayed at the bottom of the screen. Of course, it can also be in other positions of the playback interface, which can improve the sensitivity of user operations and improve the user experience. This application does not limit the position of the video playback progress bar and the display of subtitles.
在一种可能的实现方式中,当用户点击关闭复读设置控件402、字幕设置控件403或者关闭该重点单词的单词解析框404退出该单词的学习模式之后,视频可以实现自动继续播放,不需要用户再点击播放控件。或者视频处于暂停播放的状态,用户可以点击视频显示界面的播放控件,继续播放该视频。本申请对此不作限定。In a possible implementation, when the user clicks to close the repeat setting control 402, the subtitle setting control 403, or close the word parsing box 404 of the key word to exit the learning mode of the word, the video can be automatically resumed without requiring the user Then click the playback control. Or the video is in a paused playback state, and the user can click the playback controls on the video display interface to continue playing the video. This application does not limit this.
上述提供的在观影过程中学习单词的方法,能够实现在普通观影过程中学习英文单词,按照用户的需求,随时点击进入单词的学习,可以简化单词学习的搜索操作,同时增加用户学习的便捷性,提升用户体验。The above-mentioned method for learning words in the process of watching movies can realize the learning of English words in the ordinary process of watching movies. According to the needs of users, click to enter the learning of words at any time, which can simplify the search operation of word learning and increase the user’s learning. Convenience, enhance user experience.
当用户在休闲放松观看影片过程中,如果想学习字幕中的某单词,可以通过上述方法,点击字幕中的该单词,进入该单词的学习模式。在另一种场景中,用户可能需要针对性的学习某些重点单词,例如用户需要学习某词汇集合中的多个单词,词汇集合可以是英语四六级词汇或者雅思词汇等。在这种场景中,本申请还提供一种单词学习的方法,能够为用户提供某词汇集合中包括的多个单词的集中性学习。When the user is relaxing and watching the movie, if he wants to learn a certain word in the subtitle, he can click the word in the subtitle through the above method to enter the learning mode of the word. In another scenario, the user may need to learn some key words in a targeted manner, for example, the user needs to learn multiple words in a certain vocabulary set, and the vocabulary set may be English level 4 or 6 vocabulary or IELTS vocabulary. In this scenario, the present application also provides a word learning method that can provide users with centralized learning of multiple words included in a certain vocabulary set.
图5是本申请实施例提供的又一例观影过程中学习单词的用户界面示意图。对于一部视频资源,可以预先提取该视频资源中的某词汇集合中的所有单词。例如图5中的(a)图所示影片《加勒比海盗》,提取该影片字幕中包括的所有六级词汇形成重点单词集合,如图中的重点单词区域中的单词列表503。FIG. 5 is a schematic diagram of another example of a user interface for learning words during movie viewing provided by an embodiment of the present application. For a video resource, all words in a vocabulary set in the video resource can be extracted in advance. For example, in the movie "Pirates of the Caribbean" shown in (a) of FIG. 5, all six levels of vocabulary included in the subtitles of the movie are extracted to form a key word set, as shown in the word list 503 in the key word area in the figure.
可选地,在用户选择的该视频界面中,显示单词集合503。用户可以设置该单词集合的种类。例如,用户可以点击“重点单词”控件设置该单词列表503为六级词汇或者雅思词汇。或者,用户可以点击该单词列表503中的单词,将点击选中的单词列为重点单词,用于视频学习。又或者,用户可以点击该单词列表503中的单词,将该单词列表503中除了点击选中的单词之外的所有单词列为重点单词,用于视频学习。本申请对此不作限定。Optionally, in the video interface selected by the user, a word set 503 is displayed. The user can set the kind of the word set. For example, the user can click on the "emphasis word" control to set the word list 503 as a six-level vocabulary or an IELTS vocabulary. Alternatively, the user can click on the word in the word list 503 and list the selected word as the key word for video learning. Alternatively, the user may click on a word in the word list 503, and list all words in the word list 503 except the selected word as key words for video learning. This application does not limit this.
通过上述方法,用户在选择影片之前,可以查看每一部影片中包括的所有重点单词,并可以点击选择需要学习的重点单词,或者用户可以根据重点单词的数量选择影片资源,例如选择包括的重点单词最多的影片作为当前观看的影片,并点击进入该影片学习模式。Through the above method, before selecting a movie, the user can view all the key words included in each movie, and can click to select the key words to be learned, or the user can select the movie resource according to the number of key words, for example, select the included key The movie with the most words is regarded as the currently watched movie, and click to enter the movie learning mode.
在一种可能的实现方式中,用户可通过视频播放进度条中的标识查看要该影片中的重点单词所处的时间片的位置。用户在观影过程中可以通过拖动视频进度条来找到重点单词的位置。In a possible implementation manner, the user can check the position of the time slice where the key word in the movie is located through the mark in the video playback progress bar. The user can find the position of the key word by dragging the video progress bar during the viewing process.
在一种可能的实现方式中,该影片中的重点单词在该视频播放过程中是高亮展示的。在图5中的(b)图所示的视频播放过程中,当字幕中出现该影片中的重点单词“abandon”时,“abandon”的显示不同于字幕中其他单词的显示,用于提醒用户该单词的位置和发音。例如字幕整体显示为黑色,当字幕中出现重点单词时,显示为蓝色,高亮突出以提醒用户 注意该单词的位置以及相关发音。In a possible implementation manner, the key words in the movie are highlighted during the video playback. In the video playback process shown in (b) of Figure 5, when the key word "abandon" in the movie appears in the subtitles, the display of "abandon" is different from the display of other words in the subtitles to remind the user The position and pronunciation of the word. For example, the subtitles are displayed in black as a whole. When key words appear in the subtitles, they are displayed in blue and highlighted to remind users to pay attention to the position of the word and related pronunciation.
在一种可能的实现方式中,视频播放区域可以包括循环次数设置控件501和字幕设置控件502,如图5中的(b)图所示。用户可以通过点击该循环次数设置控件501设置该重点单词的复读次数,通过点击该字幕设置控件502设置该影片的字幕的展现形式。功能类似于前述图3的相关描述中介绍复读设置控件306,为了简便,此处不再赘述。In a possible implementation manner, the video playback area may include a loop number setting control 501 and a subtitle setting control 502, as shown in (b) of FIG. 5. The user can set the number of repetitions of the key word by clicking the cycle number setting control 501, and set the presentation form of the subtitle of the movie by clicking the subtitle setting control 502. The function is similar to that of the repeat setting control 306 described in the related description of FIG. 3 above. For the sake of simplicity, it will not be repeated here.
可选地,对于该影片中的重点单词,可以默认循环播放。例如,当视频播放至该重点单词时,该单词默认循环播放3次,能够减少用户的设置步骤,并提高用户的学习效果。Optionally, the key words in the movie can be looped by default. For example, when the video plays to the key word, the word is looped three times by default, which can reduce the user's setting steps and improve the user's learning effect.
在一种可能的实现方式中,该影片字幕中包括的所有单词,用户可以点击任意一个单词并进入该单词的学习模式,如图5中的(b)图所示点击abandon后弹出的abandon的单词解析框。同理,该单词解析框包括详情控件和添加至单词簿控件,用户可以点击详情控件进入abandon的学习界面,此处不再赘述。In a possible implementation, for all the words included in the movie subtitles, the user can click on any word and enter the learning mode of the word, as shown in (b) of FIG. 5, after clicking on the abandon Word parsing box. Similarly, the word parsing box includes a detail control and a control added to the word book. The user can click the detail control to enter the learning interface of abandon, which will not be repeated here.
或者,该影片播放至该重点单词时,自动弹出该重点单词的单词解析框504,如图5中的(b)图所示。可选地,弹出该单词解析框504的时间可以设置固定时长,例如弹出该单词解析框504的时长为5秒,5秒时间之后自动关闭该单词解析框504。本申请对此不作限定。Or, when the movie plays to the key word, a word analysis box 504 for the key word is automatically popped up, as shown in (b) of FIG. 5. Optionally, the time for popping up the word parsing box 504 can be set to a fixed duration, for example, the time for popping up the word parsing box 504 is 5 seconds, and the word parsing box 504 is automatically closed after 5 seconds. This application does not limit this.
应理解,在视频播放过程中,只要弹出单词解析框504进入某单词的学习模式,该视频是暂停播放的。具体地,用户可以点击字幕中的任意一个单词并进入该单词的学习模式,或者前述影片播放至该重点单词时,自动弹出该重点单词的单词解析框504,进入该单词的学习模式,该视频都是暂停播放的。It should be understood that during video playback, as long as the word analysis box 504 pops up to enter a certain word learning mode, the video is paused. Specifically, the user can click on any word in the subtitle and enter the learning mode of the word, or when the aforementioned movie is played to the key word, a word analysis box 504 of the key word automatically pops up to enter the word learning mode, the video They are all paused.
在一种可能的实现方式中,当用户点击关闭该重点单词的单词解析框504退出该单词的学习模式,或者该单词解析框504的显示时间达到设置的固定时长之后自动关闭该单词解析框504退出该单词的学习模式之后,视频可以在关闭该单词解析框504后继续播放,或者视频处于暂停状态,用户可以点击视频显示界面的播放控件,继续播放该视频。本申请对此不作限定。In a possible implementation, when the user clicks to close the word analysis box 504 of the key word to exit the learning mode of the word, or the display time of the word analysis box 504 reaches the set fixed duration, the word analysis box 504 is automatically closed After exiting the word learning mode, the video can continue to play after the word parsing box 504 is closed, or the video is in a paused state, and the user can click the playback controls on the video display interface to continue playing the video. This application does not limit this.
应理解,在上述介绍的播放学习视频过程中,当视频播放至该单词所在的画面,高亮展示该单词,实现自动复读,当复读完成后,视频继续播放不受影响。或者,当视频播放至该单词所在的画面,弹出该重点单词的单词解析框,弹出该单词解析框的时间到达预设时长后,视频继续播放不受影响。It should be understood that in the process of playing the learning video described above, when the video is played to the screen where the word is located, the word is highlighted to realize automatic replay, and when the replay is completed, the video continues to play without being affected. Or, when the video plays to the screen where the word is located, a word parsing box of the key word pops up, and after the time for popping the word parsing box reaches a preset duration, the video continues to play without being affected.
通过上述介绍的在视频中实现单词或语句复读的方法,用户可以基于视频的英文字幕,在观看英文视频的同时,利用单词索引和播放器的回退等能力,实现英语单词的复读、跟读等功能,提升用户的英语学习效果,提高用户体验。Through the above-mentioned method of re-reading words or sentences in a video, users can use English subtitles based on the video, while watching the English video, using the word index and the player's ability to rewind, etc. to realize the re-reading and follow-up of English words And other functions to improve the user's English learning effect and improve the user experience.
应理解,在上述实施例的介绍时,在用户学习过程中以单个单词为例,在实际应用过程中,用户还可以输入词组、句子等不同的文本单元,本申请对此不作限定。It should be understood that in the introduction of the above embodiments, a single word is taken as an example in the user's learning process. In the actual application process, the user can also input different text units such as phrases and sentences, which is not limited in this application.
还应理解,本文以英语学习为例介绍了用户学习英语过程中可以实现单词或语句的复读,该方法同样适用于其他语言的视频学习,本申请对此不作限定。It should also be understood that this article uses English learning as an example to introduce that users can realize the repetition of words or sentences in the process of learning English. This method is also applicable to video learning in other languages, which is not limited in this application.
上述结合图3至图5细描述了本申请的人机交互实施例,为了更好地理解本申请提供的视频中实现单词或语句复读的方法,下面介绍具体的实现过程和算法原理。The human-computer interaction embodiments of the present application are described in detail above with reference to FIGS. 3 to 5. In order to better understand the method of implementing word or sentence repetition in the video provided by the present application, the specific implementation process and algorithm principle are introduced below.
在具体的实现过程中,本申请提供的单词或语句复读的方法需要基于语音识别技术,生成与视频关联的单词搜索功能,生成从多个单词到多个视频的对应关系索引,以及生成单个视频到多个单词的对应关系索引,实现从用户可以从单词搜索到相关的学习视频。此 外,本申请再利用单词的搜索功能和播放器的回退能力,通过定位单词的起止时间和关键帧,实现复读功能。具体地包括以下实现步骤:In the specific implementation process, the method for repetition of words or sentences provided by this application needs to generate a word search function associated with a video based on speech recognition technology, generate a correspondence index from multiple words to multiple videos, and generate a single video Correspondence index to multiple words enables users to search from words to related learning videos. In addition, this application reuses the word search function and the player's ability to retreat, and locates the start and end times and key frames of words to realize the repeat function. Specifically, it includes the following implementation steps:
步骤一:生成声学模型Step 1: Generate an acoustic model
应理解,声学模型(acoustic model)是语音识别系统中最为重要的部分之一,在语音识别中,声学模型用于表示声音信号与音素的关系,或者是用于表示构成语音的各个语言单元之间的关联。其中,音素是发音的最小单位。目前的主流系统多采用隐马尔科夫模型(Hidden Markov model,HMM)进行建模。隐马尔科夫模型是最常见的声学模型,隐马尔可夫模型的概念是一个离散时域有限状态自动机,HMM是指这一马尔可夫模型的内部状态外界不可见,外界只能看到各个时刻的输出值。图6是一例HMM模型示意图。本申请将以HMM为例进行介绍。图6中1至6示出了一个单词的每个音素,1和6是单词的头和尾。HMM可以根据各个概率得到最优的音素、单词以及句子序列。It should be understood that the acoustic model (acoustic model) is one of the most important parts in the speech recognition system. In speech recognition, the acoustic model is used to represent the relationship between the sound signal and the phoneme, or to represent each language unit that constitutes speech Relationship. Among them, phoneme is the smallest unit of pronunciation. Most current mainstream systems use Hidden Markov Model (HMM) for modeling. Hidden Markov model is the most common acoustic model. The concept of hidden Markov model is a discrete time domain finite state automaton. HMM means that the internal state of this Markov model is not visible to the outside world, and only visible to the outside world. The output value at each moment. Figure 6 is a schematic diagram of an HMM model. This application will use HMM as an example. In Fig. 6, 1 to 6 show each phoneme of a word, and 1 and 6 are the head and tail of the word. HMM can get the best phoneme, word and sentence sequence according to each probability.
例如,我们要识别good的音频信号,按照语音识别的基本步骤,首先需要将这两个单词拆成音素,例如:For example, if we want to recognize good audio signals, according to the basic steps of speech recognition, we first need to split these two words into phonemes, for example:
good由3个音素组成,分成音素表达就是:G IH0 Dgood consists of 3 phonemes, divided into phonemes and expressed as: G IH0 D
morning由6个音素组成,分成音素表达就是:M AO1 R N IH0 NGmorning is composed of 6 phonemes, divided into phonemes, the expression is: MAO1RNNIH0NG
再对每个音素进行模型训练,这个训练是通过大量语音信号来进行的,本申请使用已有的模型,包括单音素的monophone模型和三音素的triphone模型。其中,monophone模型就是用一个HMM来代表1个音素,triphone模型就是用一个HMM来代表3个音素。由于不同的发音在连读的时候会发生变化,例如英文发音中两个单词连读可能产生新的发音。例如,can和I这个两个单词进行连读,在一起发音很像“can nai”,因此,需要使用多个音素来表示can I的发音。Then, each phoneme is subjected to model training. This training is performed through a large number of speech signals. This application uses existing models, including monophone models of monophones and triphone models of triphones. Among them, the monophone model uses an HMM to represent one phoneme, and the triphone model uses an HMM to represent three phonemes. Because different pronunciations will change during continuous reading, for example, the continuous pronunciation of two words in English pronunciation may produce a new pronunciation. For example, the two words can and I are read consecutively, and they sound like "cannai" together. Therefore, you need to use multiple phonemes to represent the pronunciation of can.
步骤二:强制对齐Step 2: Force alignment
强制对齐(forced alignment)是一种通过音频文件获取字典词汇正确拼写及发音的版本,并生成时间点的技术。具体地,强制对齐实际上就是使用到了上文提及的声学模型(acoustic model)和备选单词,需解决这些单词怎么摆放,将已获得的音频信号产生音素,如何将声学模型连在一起。例如:Forced alignment is a technique for obtaining the correct spelling and pronunciation of dictionary vocabulary through audio files and generating a point in time. Specifically, forced alignment actually uses the aforementioned acoustic model and alternative words. It is necessary to solve how to place these words, generate phonemes from the obtained audio signals, and how to connect the acoustic models together. . E.g:
good morning产生音素为:G IH0 D M AO1 R N IH0 NGThe phoneme generated by good morning is: G IH0 D M AO1 R N IH0 NG
本申请在实现步骤一和步骤二的过程中使用系统是Kaldi算法,即Kaldi的开源工具包(请参考 http://kaldi-asr.org/doc/index.html)。图7是本申请提供的生成声学模型和强制对齐过程的实现流程图。具体包括: The system used in this application to implement steps one and two is the Kaldi algorithm, which is Kaldi's open source toolkit (please refer to http://kaldi-asr.org/doc/index.html ). 7 is an implementation flowchart of the process of generating an acoustic model and forcibly aligning provided by the present application. This includes:
701,导入样本库,产生单音素,进行单音素模型训练得到monophone模型。701. Import a sample library to generate a single phoneme, and train a single phoneme model to obtain a monophone model.
具体地,701包括特征提取过程和声音模型的建立过程。在特征提取过程中,无论何种语境,获取大量的不同语境的样本库。将事先准备好的和语言模型相关的文件导入,提取样本的特征,进行高斯模型(gaussian mixture model,GMM)训练,基于GMM的声学模型进行最大似然估计,然后进行迭代循环操作,不断地重新估计GMM,对分散在不同处理器上的结果进行合并。Specifically, 701 includes a feature extraction process and a sound model establishment process. In the feature extraction process, regardless of the context, a large number of sample libraries of different contexts are obtained. Import the files related to the language model prepared in advance, extract the features of the sample, train the Gaussian (mixture model), GMM-based acoustic model for maximum likelihood estimation, and then perform iterative loop operation, constantly re-starting Estimate GMM and combine the results scattered on different processors.
702,根据境况上下文,在单音模型基础上产生三音素triphone模型。702. According to the context of the situation, a triphone triphone model is generated based on the monophonic model.
具体地,例如Good由3个音素组成,按照单音素模型(monophone)只需要建立3个HMM模型。而考虑上下文的协同发音效应,也就是上下文音素会对当前的中心音素发音有影响,会产生协同变化,这与该音素的单独发音会有所不同。为了考虑这个影响使用 三音素模型(triphone)将会出现数千个HMM模型,可以提高语音识别的准确性。Specifically, for example, Good is composed of 3 phonemes, and only 3 HMM models need to be established according to the monophone model (monophone). Considering the syntactic pronunciation effect of the context, that is, the context phoneme will affect the current pronunciation of the central phoneme and will produce a synergistic change, which is different from the individual pronunciation of the phoneme. In order to consider this effect, the use of triphone models (triphone) will appear thousands of HMM models, which can improve the accuracy of speech recognition.
传统的三音素方法就是模型绑定,也就是归一化三音素,使用一个后验平滑的方法。或者,如果上下文的发音类型相似,则对当前音素的影响是相似的,则可以将这些数据聚类。Kald算法可以自动产生问题集,根据音素本身数据上的相似性,自动聚为一类。The traditional triphone method is model binding, that is, normalized triphone, using a posterior smoothing method. Or, if the pronunciation types of the context are similar, the impact on the current phoneme is similar, then these data can be clustered. The Kald algorithm can automatically generate a problem set, and automatically cluster into a class based on the similarity of the phonemes themselves.
703,执行LDA+MLLT学习各音素之间最大差异,优化特征提取。703. Perform LDA+MLLT to learn the maximum difference between each phoneme and optimize feature extraction.
具体地,线性判别分析(linear discriminant analysis,LDA)算法是通过投影的方法,将特征向量投影到维度更低的空间中,使得投影后的点,会形成按类别区分,在投影后的空间中更接近。即LDA算法通过一个变化矩阵来达到为特征向量降维的目的,使得样本内的分布凝聚,使得样本间的分布疏远,这样提取的特征更加有代表性,使得分类更优。Specifically, the linear discriminant analysis (LDA) algorithm uses the projection method to project the feature vector into a space with a lower dimension, so that the projected points will be differentiated by category in the projected space. Closer. That is, the LDA algorithm uses a change matrix to achieve the purpose of dimensionality reduction for feature vectors, so that the distribution within the sample is condensed, and the distribution between the samples is alienated, so that the extracted features are more representative and make the classification better.
最大似然线性变换(maximum likelihood linear transformation,MLLT)可以在最大似然(maximum likelihood,ML)准则下使用一个线性变换矩阵对参数特征矢量进行解相关,从而使得新空间中,模型与训练集的似然度增加,优化特征提取的过程。The maximum likelihood linear transformation (MLLT) can use a linear transformation matrix to decorrelate the parameter feature vector under the maximum likelihood (ML) criterion, so that in the new space, the model and the training set Likelihood is increased, and the process of feature extraction is optimized.
此外,在703中,还引入适配多人说话信息,增强triphone模型,提高算法的准确性。In addition, in 703, it also introduces the adaptation of multi-person speaking information, enhances the triphone model, and improves the accuracy of the algorithm.
应理解,每个triphone模型最后对应到一段声音信号,即一段声音信号的起止时间确定,这个起止时间就是音素级别对齐的时间。It should be understood that each triphone model finally corresponds to a sound signal, that is, the start and end time of a sound signal is determined, and this start and end time is the time when the phoneme levels are aligned.
步骤三:生成单词时间序列Step 3: Generate word time series
通过上述介绍的步骤一和步骤二,就通过Kaldi的开源工具包的相关算法建立了声学模型。在步骤三中需要预先处理本申请可用来学习英语的音频文件。利用音频文件生成的triphone模型,输出与音频文件关联的“单词+时间”列表,图8是本申请实施例提供的一例生成单词时间序列的流程图,该生成过程包括以下内容:Through steps 1 and 2 introduced above, an acoustic model is established through the relevant algorithms of Kaldi's open source toolkit. In step three, it is necessary to pre-process the audio files that this application can use to learn English. Using the triphone model generated by the audio file, a list of "word + time" associated with the audio file is output. FIG. 8 is a flowchart of an example of generating a time series of words provided by an embodiment of the present application. The generation process includes the following:
801,导入字幕文件,挨个抽取单词,生成每个单词的声学模型。801. Import subtitle files, extract words one by one, and generate an acoustic model of each word.
802,导入音频文件,生成整个音频文件的声学模型,并且根据声学模型判断可能出现的单词。802. Import an audio file, generate an acoustic model of the entire audio file, and judge possible words according to the acoustic model.
803,将音频文件声学模型与单词声学模型进行逐个比对,匹配的单词,输出单词及起止时间,不匹配的单词直接丢弃,音频文件对应的单词序列。803: Compare the acoustic model of the audio file with the acoustic model of the words one by one, match the words, output the words and start and end time, the words that do not match are discarded directly, and the word sequence corresponding to the audio file.
通过步骤三,对于英文视频资源,通过管理员的后台处理,可以获取每个单词的起止时间,实现每个单词的时间片的准确定位,即实现每个单词和音频对应的文件。Through step three, for English video resources, through the background processing of the administrator, the start and end time of each word can be obtained to achieve the accurate positioning of the time slice of each word, that is, the file corresponding to each word and audio.
步骤四:生成内容关联索引Step 4: Generate content association index
图9是本申请实施例提供的一例内容关联索引的示意图,以四级词汇为例,图9示出了经过前述步骤处理后,得到单词和音频对应的文件。FIG. 9 is a schematic diagram of an example of a content association index provided by an embodiment of the present application. Taking a four-level vocabulary as an example, FIG. 9 shows that after the foregoing steps are processed, files corresponding to words and audio are obtained.
具体地,在步骤四中,对于一个英文视频资源,根据该视频资源的英文字幕文件和音频文件,该英文字幕文件包括每个单词和时间片的对应索引,该音频文件包括时间索引。通过时间片信息建立该英文视频资源所有单词中的四级词汇和音频文件中该四级词汇的索引关系,生成多个内容关联索引表,生成从单个单词到多个视频的对应关系索引,或者生成从单个视频对应多个单词的关系索引。例如图9中列举的表1和表2,表1用于表示点单个单词与视频的对应关系,用于用户可以通过单词搜索到相关的英文学习视频,进行单词学习;表2用于表示单个视频与单词的对应关系,用于用户在观看英文视频的过程中,展示用户想学习的英文单词或语句,并实现复读功能。Specifically, in step four, for an English video resource, according to the English subtitle file and audio file of the video resource, the English subtitle file includes a corresponding index for each word and time slice, and the audio file includes a time index. Use the time slice information to establish the index relationship between the four-level vocabulary in all words of the English video resource and the four-level vocabulary in the audio file, generate multiple content association index tables, and generate the corresponding relationship index from a single word to multiple videos, or Generate a relationship index corresponding to multiple words from a single video. For example, Table 1 and Table 2 listed in FIG. 9, Table 1 is used to represent the correspondence between a single word and a video, and is used for users to find relevant English learning videos through word search for word learning; Table 2 is used to represent a single The corresponding relationship between the video and the word is used for the user to display the English word or sentence that the user wants to learn while watching the English video, and to implement the repeat function.
步骤五:生成内容元数据Step 5: Generate content metadata
内容元数据可以指用户输入的单词以及单词的起止时间,或者用户观看的英文视频中 包括的单词以及单词的起止时间等。当客户端请求内容元数据时,查询视频与单词对应关系,将元数据合入单词起止时间后返回。应理解,在本申请中,起止时间都是毫秒级的时间。The content metadata may refer to the words input by the user and the start and end times of the words, or the words included in the English video watched by the user and the start and end times of the words. When the client requests content metadata, the corresponding relationship between the video and the word is queried, and the metadata is integrated into the start and end time of the word and returned. It should be understood that in this application, the start and end times are all in the order of milliseconds.
综上所述,通过步骤一至步骤五就完成了内容元数据的说呢过程过程,即使用语音识别算法对视频中音频文件进行单词粒度的起止时间抽取,产生单词与视频内容的对应关系。之后,就可以根据用户的请求,实现单词或语句复读。In summary, the process of content metadata is completed through steps one to five, that is, a speech recognition algorithm is used to extract the start and end time of the word granularity of the audio files in the video, and the corresponding relationship between the words and the video content is generated. After that, you can repeat the words or sentences according to the user's request.
步骤六:基于单词起止时间定位关键帧,启动定时任务,开启复读功能Step 6: Locate key frames based on word start and end time, start timed tasks, and enable repeat function
用户通过向云端请求获取内容元数据,其中包含内容所含关键词及关键词的起止时间后,用户端需关键词及关键词的起止时间定位该关键词的核心帧位置,从而根据用户的设置进行单词或语句复读。After requesting the cloud to obtain content metadata, the user needs to locate the core frame position of the keyword after the keyword and the start and end time of the keyword are included in the content, according to the user's settings Repeat words or sentences.
图10是本申请实施例提供的单词或语句复读过程的实现流程图,根据图10所示,整个过程包括以下内容:FIG. 10 is a flowchart of a word or sentence repetition process provided by an embodiment of the present application. According to FIG. 10, the entire process includes the following:
1001,获取内容元数据,根据内容元数据所含关键词及关键词的起止时间定位关键词词。具体地,用户端接收用户指令,向云端请求获取内容元数据。根据内容元数据中关键词所在关键句的时间信息,判断关键词所在时间片。应理解,这里的关键句指关键词所在的语句,这里的时间片和前述的起止时间都是毫秒级的时间。1001. Acquire content metadata, and locate keyword words according to the keywords contained in the content metadata and the start and end times of the keywords. Specifically, the user terminal receives the user instruction, and requests the cloud to obtain content metadata. According to the time information of the key sentence where the keyword is located in the content metadata, the time slot where the keyword is located is judged. It should be understood that the key sentence here refers to the sentence where the keyword is located, and the time slice here and the foregoing start-end time are both millisecond-level time.
1002,启动播放。1002, start playing.
响应于用户的点击操作,播放学习视频。具体地,用户可以向播放器导入播放链接,开始播放视频。此外,用户端通过关键词检索关键句所在时间,并检索当前的复读模式,确认当前为词复读或者句复读。In response to the user's click operation, the learning video is played. Specifically, the user can import a playback link to the player and start playing the video. In addition, the user terminal searches for the time of the key sentence through the keyword, and searches the current repeat mode to confirm that it is currently word repeat or sentence repeat.
应理解,在本申请中,视频资源的来源可以是储存在云端的视频资源,用户通过向云端发送请求获取视频资源,或者视频资源也可以是本地资源,本申请对此不作限定。It should be understood that in this application, the source of the video resource may be a video resource stored in the cloud, the user obtains the video resource by sending a request to the cloud, or the video resource may also be a local resource, which is not limited in this application.
1003,启动定时。1003, start timing.
在视频播放过程中,当播放至关键句时间时,启动定时任务,在关键单词结束时间点触发定时任务。During the video playback, when the key sentence time is played, the scheduled task is started, and the scheduled task is triggered at the end time of the key word.
1004,定位关键词的开始帧,检索复读数。1004. Locate the start frame of the keyword and retrieve the complex reading.
判断当前所处的复读模式,调用播放引擎,采用前向帧定位模式。Determine the current repeat mode, call the playback engine, and use the forward frame positioning mode.
具体地,当开始时间点与视频关键帧不匹配时,采用回退方式检索关键帧,回退至关键词开始时间位置,检索到关键帧,开始播放该关键帧,实现复读。Specifically, when the start time point does not match the key frame of the video, the key frame is retrieved using the fallback method, the key frame is retrieved to the start time position of the keyword, the key frame is retrieved, and the key frame is started to be played back to realize re-reading.
当当前的复读模式为词复读时,该关键帧为该关键词起始时间对应的视频帧;当当前的复读模式为句复读时,该关键帧为该关键词所在的关键句的起始时间对应的视频帧。When the current repeat mode is word repeat, the key frame is the video frame corresponding to the start time of the keyword; when the current repeat mode is sentence repeat, the key frame is the start time of the key sentence where the keyword is located Corresponding video frame.
此外,通过检索复读数,确定回退检索关键帧的次数。In addition, by retrieving the multiple readings, the number of backtracking key frames is determined.
1005,启动复读,回退播放。1005, start replay and rewind playback.
检索当前所处的复读模式和复读次数,并读取当前所处复读模式与复读次数,开始前向播放,累计一次播放。例如当复读次数默认为3次时,检测当复读数小于3时,继续进行回退检索关键帧,并播放关键帧。Retrieve the current repeat mode and repeat times, and read the current repeat mode and repeat times. Start forward playback and accumulate one playback. For example, when the number of re-reading is 3 by default, it is detected that when the re-reading is less than 3, the key frame is searched and the key frame is played back.
1006,结束复读,继续播放。1006. End the repeat reading and continue playing.
当重复次数与设置次数匹配时,例如当复读次数默认为3次时,检索到累计次数大于或等于3次时,停止复读,视频继续向前播放。When the number of repetitions matches the set number of times, for example, when the number of repetitions is 3 by default, when the cumulative number of retrievals is greater than or equal to 3, the repetition is stopped, and the video continues to play forward.
综上所述,通过以上介绍的六个步骤,从声学模型的建立、语音算法对音频和视频文 件的处理、单词时间序列的生成、内容元数据的生成、关键帧的定位和复读功能等方面详细介绍了本申请实施例提供的视频中实现单词或语句复读的方法的实现过程,使得用户可以基于视频的英文字幕,在观看英文视频的同时,利用单词索引和播放器的回退等能力,实现英语单词的复读、跟读等功能,提升用户的英语学习效果,提高用户体验。In summary, through the six steps introduced above, from the establishment of acoustic models, the processing of audio and video files by speech algorithms, the generation of word time series, the generation of content metadata, the positioning of key frames and the function of repetition, etc. Introduced the implementation process of the method for re-reading words or sentences in the video provided by the embodiments of the present application in detail, so that the user can use the English index of the video, while watching the English video, using the word index and the player's ability to retreat, Realize the functions of repetition and follow-up of English words, improve the user's English learning effect, and improve the user experience.
此外,从视频资源的管理员和用户角度,本申请实施例提供的视频中实现单词或语句复读的方法的实现过程如图11所示。In addition, from the perspective of the administrator of the video resource and the user, the implementation process of the method for repetition of words or sentences in the video provided by the embodiments of the present application is shown in FIG. 11.
具体地,在视频资源的管理员角度,包括如下操作:1101,管理员操作管理台,提取视频资源;1102,调用算法,对视频进行预处理,调用语音算法自动拆分;1103,输出含时间戳的单词序列;1104,生成视频内容元数据,即生成单词序列检索索引。Specifically, from the perspective of the video resource administrator, it includes the following operations: 1101, the administrator operates the management console to extract video resources; 1102, calls the algorithm, preprocesses the video, and calls the voice algorithm to automatically split; 1103, the output includes time Poke word sequence; 1104, generate video content metadata, that is, generate word sequence search index.
相应地,在用户端,根据不同的场景,用户可包括如下操作:1105,用户输入关键词,通过场景搜索或者单词搜索,搜索到短视频片段,客户端可展示视频关键词及内容;1106,进入视频详情页;1107,进度条标识关键词,用户可通过播放器查看关键词所处位置,例如进度条标识了关键词所在位置;1108,选择词汇复读,用户可以设置复读模式为词重复或者句重复,并可通过设置界面设置复读次数,默认3次;1109,凸显关键词、开启复读。且当视频播放至关键词所在时间片,可高亮展示单词,并自动复读;当复读完后,累计次数大于或等于3次时,停止复读,视频继续播放不影响。Correspondingly, on the user side, according to different scenes, the user may include the following operations: 1105, the user enters keywords, through scene search or word search, short video clips are searched, and the client may display the video keywords and content; 1106, Go to the video details page; 1107, the progress bar identifies the keyword, the user can view the location of the keyword through the player, for example, the progress bar identifies the location of the keyword; 1108, select vocabulary repeat, the user can set the repeat mode to word repeat or Sentences are repeated, and the number of repetitions can be set through the setting interface, the default is 3 times; 1109, highlight keywords and enable repetition. And when the video is played to the time slot where the keyword is located, the words can be highlighted and automatically re-read; when the cumulative number of times is greater than or equal to 3 after the re-reading, stop re-reading, and the video continues to play without affecting.
在一种可能的场景中,例如图4中介绍的场景,用户观看电影过程中,可以通过点击方式取出字幕中包含的单词,并展示该单词词卡,对单个单词在电影中手动执行复读。前述有详细的介绍,此处不再赘述。In a possible scenario, such as the scenario described in FIG. 4, during watching a movie, a user can click to extract the word contained in the subtitles and display the word card, and manually perform re-reading of a single word in the movie. There is a detailed introduction to the foregoing, so I won't repeat them here.
在一种可能的场景中,例如图5中介绍的场景,用户可以在通过在视频播放应用程序中,边观影边学习专业单词。具体地,用户打开英文电影,可以查看当前电影中包含哪些专业词汇,例如四级、托业、托福等,看电影过程中,播放到专业词汇的位置,开启复读功能。In a possible scenario, such as the scenario described in FIG. 5, the user can learn professional words while watching the movie in the video playback application. Specifically, when a user opens an English movie, he can check which professional vocabulary is included in the current movie, such as Level 4, TOEIC, TOEFL, etc. During the movie watching, play to the position of the professional vocabulary and enable the repeat function.
综上所述,本申请提供的单词或语句复读的方法基于语音识别技术,生成与视频关联的单词搜索功能,生成从多个单词到多个视频的对应关系索引,以及生成单个视频到多个单词的对应关系索引,实现从用户可以从单词搜索到相关的学习视频。此外,利用单词的搜索功能和播放器的回退能力,通过定位单词的起止时间和关键帧,实现复读功能。在实现过程中,用户通过向云端请求内容元数据,获取的内容元数据中含有单词及时间轴信息,在播放器产生复读过程中,复读的内容时长不影响视频资源内容本身的时长,同时实现播放时间的高频变化。避免了现有的视频内容剪辑过程,如果要实现单词的复读则会加长内容时间等,提高了用户体验。In summary, the method for repetition of words or sentences provided by this application is based on speech recognition technology, generates a word search function associated with videos, generates a correspondence index from multiple words to multiple videos, and generates a single video to multiple The correspondence index of words enables users to search from words to related learning videos. In addition, the use of the word search function and the player's ability to retreat, by positioning the start and end time of the word and key frames, to achieve the repeat function. In the implementation process, the user requests content metadata from the cloud, and the acquired content metadata contains words and timeline information. During the player's replay process, the length of the replayed content does not affect the duration of the video resource content itself. High-frequency changes in playback time. The existing video content editing process is avoided, and if the words are to be repeated, the content time will be lengthened, etc., which improves the user experience.
结合上述实施例及相关附图,本申请实施例提供了一种视频播放的方法,该方法可以在如图1、图2所示的具有触摸屏和摄像头的电子设备(例如手机、平板电脑等)中实现。图12是本申请实施例提供的视频播放的方法的示意性流程图,如图12所示,该方法可以包括以下步骤:With reference to the above embodiments and related drawings, embodiments of the present application provide a video playback method, which can be used in electronic devices (such as mobile phones, tablet computers, etc.) with a touch screen and a camera as shown in FIGS. 1 and 2. Implemented in. FIG. 12 is a schematic flowchart of a video playback method provided by an embodiment of the present application. As shown in FIG. 12, the method may include the following steps:
1201,显示第一界面,所述第一界面显示正在播放的第一视频和所述第一视频的字幕,所述第一视频的字幕包括第一文本单元和第二文本单元。1201. A first interface is displayed, where the first interface displays the first video being played and the subtitles of the first video. The subtitles of the first video include a first text unit and a second text unit.
应理解,在用户学习过程中的文本单元(例如第一文本单元和第二文本单元)可以是单个单词,或者文本单元可以包括多个单词的词组、句子等,本申请对此不作限定。It should be understood that the text unit (for example, the first text unit and the second text unit) in the user learning process may be a single word, or the text unit may include phrases, sentences, etc. of multiple words, which is not limited in this application.
示例性的,该第一界面为图3中的(e)图或(f)图所示的界面。其中,该第一界面 包括正在播放的第一视频和第一视频的字幕,在该第一视频的字幕上,包括用户要学习的第一文本单元“message”,将该字幕上除了“message”之外的单词称为第二文本单元。Exemplarily, the first interface is the interface shown in (e) or (f) in FIG. 3. The first interface includes the first video being played and the subtitles of the first video. On the subtitles of the first video, the first text unit "message" to be learned by the user is included, and the "message" is removed from the subtitles. Words outside are called the second text unit.
可选地,该第一界面还可以包括第一文本单元的解析详情,例如“message”的英式、美式发音、中文释义、中英文例句以及视频例句等。可选地,该单词的解析详情呈现的详细内容可以来源于系统本身内置的英语词典,也可以关联其他的英语在线词典等,本申请对此不作限定。Optionally, the first interface may also include parsing details of the first text unit, such as English, American pronunciation, Chinese interpretation, Chinese and English example sentences and video example sentences of "message". Optionally, the detailed content presented by the detailed analysis of the word may come from the English dictionary built into the system itself, or may be associated with other English online dictionaries, etc., which is not limited in this application.
1202,当播放到所述第一文本单元对应的所述第一视频的第一片段时,在所述第一界面上自动重复播放所述第一片段。1202: When the first segment of the first video corresponding to the first text unit is played, the first segment is automatically played repeatedly on the first interface.
可选地,第一片段是所述第一文本单元对应的视频片段,或者所述第一片段是所述第一文本单元所在的整句对应的视频片段。Optionally, the first segment is a video segment corresponding to the first text unit, or the first segment is a video segment corresponding to the entire sentence where the first text unit is located.
示例性的,通过图(g)图的复读设置框307设置为词复读后,在该第一界面重复播放图3中的(h)图所示的从“message”对应的起始时间到结束时间内的视频片段。Exemplarily, after setting the word repetition setting box 307 in the figure (g) picture to repeat the word, the first interface repeats playing from the start time to the end corresponding to "message" shown in the picture (h) in FIG. Video clip within time.
或者,通过图3中的(g)图的复读设置框307设置为句复读后,在该第一界面重复播放图3中的(h)图所示的从“message”所在的整个语句,如“Yes,I’d like to leave a message for Mr.Jay Twistle”对应的起始时间到结束时间内的视频片段。Alternatively, after setting the sentence repetition through the repetition setting box 307 in (g) of FIG. 3, repeat the entire sentence from “message” shown in (h) of FIG. 3 on the first interface, such as "Yes, I'd like to leave message for Mr. Jay Twistle" corresponds to the video clip from the start time to the end time.
可选地,所述重复播放所述第一视频的第一片段的次数是系统默认的预设次数或者用户设置的。Optionally, the number of times to repeatedly play the first segment of the first video is a preset number of times preset by the system or set by a user.
示例性的,重复播放的次数可以是用户后台设置的,也可以是系统默认的。在用户未设置重复播放的次数的情况下,重复播放的次数可以为系统默认的3次。本申请对此不作限定。Exemplarily, the number of repeated playbacks may be set by the user in the background, or may be the system default. In the case that the user does not set the number of repeated playbacks, the number of repeated playbacks may be the system default 3 times. This application does not limit this.
1203,在所述第一界面上检测用户的第一操作。1203: Detect the user's first operation on the first interface.
示例性的,如图4中的(d)图所示,该第一操作可以为用户点击第一视频的字幕上的第一文本单元的操作。例如,用户点击字幕中的“close”。Exemplarily, as shown in (d) of FIG. 4, the first operation may be an operation in which the user clicks the first text unit on the subtitle of the first video. For example, the user clicks on "close" in the subtitles.
1204,响应于所述第一操作,显示第二界面,在所述第二界面上显示与所述第一文本单元关联的第一信息。1204. In response to the first operation, display a second interface, and display first information associated with the first text unit on the second interface.
当手机检测到用户点击字幕中的第一文本单元的操作后,手机进入如图4中的(e)图所示的第二界面。其中,第二界面上显示与该用户点击的第一文本单元关联的解析详情等信息。When the mobile phone detects that the user clicks on the first text unit in the subtitle, the mobile phone enters the second interface shown in (e) of FIG. 4. Among them, the second interface displays information such as analysis details associated with the first text unit clicked by the user.
在一种可能的实现方式中,在显示第一界面之前,该方法1200还包括:In a possible implementation, before displaying the first interface, the method 1200 further includes:
显示第三界面,所述第三界面显示用户输入的所述第一文本单元,所述第三界面包括与所述第一文本单元关联的第二信息和第一视频列表,所述第一视频列表包括所述第一视频。Displaying a third interface displaying the first text unit input by the user, the third interface including second information and a first video list associated with the first text unit, the first video The list includes the first video.
示例性的,该第三界面为图3中的(d)图所示的界面。其中,该第三界面是用户执行如图3中的(c)图所示的操作,输入要学习的“message”,点击导航框305后显示的界面。如图3中的(d)图所示,该第三界面包括“message”的单词解析详情和“message”相关联的视频列表。Exemplarily, the third interface is the interface shown in (d) of FIG. 3. The third interface is an interface displayed after the user performs the operation shown in (c) in FIG. 3, enters the "message" to be learned, and clicks the navigation box 305. As shown in (d) of FIG. 3, the third interface includes word resolution details of "message" and a video list associated with "message".
在所述第三界面上检测用户的第二操作。Detect the second operation of the user on the third interface.
可选地,第一视频列表进一步包括第二视频,所述第二操作用于选择所述第一视频。Optionally, the first video list further includes a second video, and the second operation is used to select the first video.
示例性的,该视频列表中可以包括多个视频,用户执行类似于图3中的(h)图中的在该第三界面上的向上划动的操作,可以看见更多可选择的视频。Exemplarily, the video list may include multiple videos, and the user performs an operation of swiping upward on the third interface similar to (h) in FIG. 3 to see more selectable videos.
响应于所述第二操作,显示所述第一界面。In response to the second operation, the first interface is displayed.
示例性的,如图3中的(d)图所示,第二操作可以是用户对该第一视频的点击操作,用户可以点击该第一视频进入第一界面。Exemplarily, as shown in (d) of FIG. 3, the second operation may be a user's click operation on the first video, and the user may click the first video to enter the first interface.
在一种可能的实现方式中,方法1200还包括:In a possible implementation manner, the method 1200 further includes:
在所述第三界面上检测用户的第三操作。The third operation of the user is detected on the third interface.
示例性的,如图4中的(e)图所示,第三操作可以是用户点击单词解析弹框404中的详情控件。Exemplarily, as shown in (e) of FIG. 4, the third operation may be that the user clicks the detail control in the word analysis popup box 404.
响应于所述第三操作,显示第四界面,所述第四界面包括所述第一文本单元的第二信息和第二视频列表,所述第二视频列表包括至少一个视频,所述第二视频列表中的每个视频的字幕包括所述第一文本单元。In response to the third operation, a fourth interface is displayed, the fourth interface includes second information of the first text unit and a second video list, the second video list includes at least one video, the second The subtitle of each video in the video list includes the first text unit.
示例性的,如图4中的(f)图所示,第四界面是用户点击详情控件后进入该文本单元的解析界面,包括单词解析详情和与该单词相关联的视频列表。Exemplarily, as shown in (f) in FIG. 4, the fourth interface is a user’s click on the detail control to enter the parsing interface of the text unit, including word parsing details and a video list associated with the word.
在一种可能的实现方式中,显示所述第二界面时,所述第一视频被暂停播放。In a possible implementation, when the second interface is displayed, the first video is paused.
示例性的,在视频播放过程中,如图3中的(f)图至(g)图所示,只要点击复读设置控件设置复读模式和复读次数,或者如图4中的(d)图至(e)图所示,用户点击字幕中的任意一个单词并进入该单词的学习模式,弹出该单词的单词解析框时,该视频都是暂停播放的。Exemplarily, during video playback, as shown in (f) to (g) in FIG. 3, as long as you click the repeat setting control to set the repeat mode and repeat times, or as shown in (d) in FIG. 4 to (e) As shown in the figure, when the user clicks on any word in the subtitle and enters the learning mode of the word, when the word analysis box of the word pops up, the video is paused.
在一种可能的实现方式中,所述第一文本单元的显示效果不同于所述第二文本单元的显示效果。In a possible implementation manner, the display effect of the first text unit is different from the display effect of the second text unit.
示例性的,如图3中的(e)图至(h)图所示,字幕中个“message”是不同于其他单词的显示,或者,如图4中的(c)图和(d)图所示,字幕中个“close”是不同于其他单词的显示。例如高亮效果显示,或者如图5中的(b)图所示,字幕中个“abandon”是不同于其他单词的显示。Exemplarily, as shown in (e) to (h) in Figure 3, the "message" in the subtitle is different from the display of other words, or, as shown in (c) and (d) in Figure 4 As shown in the figure, the "close" in the subtitle is different from the display of other words. For example, highlight effect display, or as shown in (b) in Figure 5, the subtitle "abandon" is different from the display of other words.
可以理解的是,电子设备为了实现上述功能,其包含了执行各个功能相应的硬件和/或软件模块。结合本文中所公开的实施例描述的各示例的算法步骤,本申请能够以硬件或硬件和计算机软件的结合形式来实现。某个功能究竟以硬件还是计算机软件驱动硬件的方式来执行,取决于技术方案的特定应用和设计约束条件。本领域技术人员可以结合实施例对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。It can be understood that, in order to realize the above-mentioned functions, the electronic device includes hardware and/or software modules corresponding to performing each function. With reference to the example algorithm steps described in the embodiments disclosed herein, the present application can be implemented in the form of hardware or a combination of hardware and computer software. Whether a function is executed by hardware or computer software driven hardware depends on the specific application and design constraints of the technical solution. Those skilled in the art may use different methods to implement the described functions for each specific application in combination with the embodiments, but such implementation should not be considered beyond the scope of the present application.
本实施例可以根据上述方法示例对电子设备进行功能模块的划分,例如,可以对应各个功能划分各个功能模块,也可以将两个或两个以上的功能集成在一个处理模块中。上述集成的模块可以采用硬件的形式实现。需要说明的是,本实施例中对模块的划分是示意性的,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。In this embodiment, the electronic device may be divided into function modules according to the above method example. For example, each function module may be divided corresponding to each function, or two or more functions may be integrated into one processing module. The above integrated module can be implemented in the form of hardware. It should be noted that the division of the modules in this embodiment is schematic, and is only a division of logical functions. In actual implementation, there may be another division manner.
在采用对应各个功能划分各个功能模块的情况下,图13示出了上述实施例中涉及的电子设备1300的一种可能的组成示意图,如图13所示,该电子设备1300可以包括:显示单元1301、检测单元1302和处理单元1303。In the case where each functional module is divided corresponding to each function, FIG. 13 shows a schematic diagram of a possible composition of the electronic device 1300 involved in the above embodiment. As shown in FIG. 13, the electronic device 1300 may include: a display unit 1301, a detection unit 1302, and a processing unit 1303.
其中,显示单元1301可以用于支持电子设备1300执行上述步骤1201和步骤1204等,和/或用于本文所描述的技术的其他过程。The display unit 1301 may be used to support the electronic device 1300 to perform the above steps 1201 and 1204, and/or other processes used in the technology described herein.
检测单元1302可以用于支持电子设备1300执行上述步骤1203等,和/或用于本文所描述的技术的其他过程。The detection unit 1302 may be used to support the electronic device 1300 to perform the above steps 1203, etc., and/or other processes for the technology described herein.
处理单元1303可以用于支持电子设备1300执行上述步骤1202等,和/或用于本文所描述的技术的其他过程。The processing unit 1303 may be used to support the electronic device 1300 to perform the above steps 1202, etc., and/or other processes for the technology described herein.
需要说明的是,上述方法实施例涉及的各步骤的所有相关内容均可以援引到对应功能模块的功能描述,在此不再赘述。It should be noted that all relevant content of the steps involved in the above method embodiments can be referred to the function description of the corresponding function module, which will not be repeated here.
本实施例提供的电子设备,用于执行上述视频播放的方法,因此可以达到与上述实现方法相同的效果。The electronic device provided in this embodiment is used to execute the above-mentioned video playback method, and therefore can achieve the same effect as the above-mentioned implementation method.
在采用集成的单元的情况下,电子设备可以包括处理模块、存储模块和通信模块。其中,处理模块可以用于对电子设备的动作进行控制管理,例如,可以用于支持电子设备执行上述显示单元1301、检测单元1302和处理单元1303执行的步骤。存储模块可以用于支持电子设备执行存储程序代码和数据等。通信模块,可以用于支持电子设备与其他设备的通信。In the case of using an integrated unit, the electronic device may include a processing module, a storage module, and a communication module. The processing module may be used to control and manage the actions of the electronic device. For example, it may be used to support the electronic device to execute the steps performed by the display unit 1301, the detection unit 1302, and the processing unit 1303. The storage module can be used to support electronic devices to execute stored program codes and data. The communication module can be used to support communication between electronic devices and other devices.
其中,处理模块可以是处理器或控制器。其可以实现或执行结合本申请公开内容所描述的各种示例性的逻辑方框,模块和电路。处理器也可以是实现计算功能的组合,例如包含一个或多个微处理器组合,数字信号处理(digital signal processing,DSP)和微处理器的组合等等。存储模块可以是存储器。通信模块具体可以为射频电路、蓝牙芯片、Wi-Fi芯片等与其他电子设备交互的设备。The processing module may be a processor or a controller. It can implement or execute various exemplary logical blocks, modules, and circuits described in conjunction with the disclosure of the present application. The processor may also be a combination of computing functions, such as a combination of one or more microprocessors, a combination of digital signal processing (DSP) and a microprocessor, and so on. The storage module may be a memory. The communication module may specifically be a device that interacts with other electronic devices, such as a radio frequency circuit, a Bluetooth chip, or a Wi-Fi chip.
在一个实施例中,当处理模块为处理器,存储模块为存储器时,本实施例所涉及的电子设备可以为具有图1所示结构的设备。In one embodiment, when the processing module is a processor and the storage module is a memory, the electronic device involved in this embodiment may be a device having the structure shown in FIG. 1.
本实施例还提供一种计算机存储介质,该计算机存储介质中存储有计算机指令,当该计算机指令在电子设备上运行时,使得电子设备执行上述相关方法步骤实现上述实施例中的拍摄长曝光图像的方法。This embodiment also provides a computer storage medium that stores computer instructions. When the computer instructions run on the electronic device, the electronic device is allowed to perform the above-mentioned related method steps to realize the shooting of long-exposure images in the above embodiment. Methods.
本实施例还提供了一种计算机程序产品,当该计算机程序产品在计算机上运行时,使得计算机执行上述相关步骤,以实现上述实施例中的拍摄长曝光图像的方法。This embodiment also provides a computer program product, which, when the computer program product runs on a computer, causes the computer to perform the above-mentioned relevant steps to implement the method of shooting a long exposure image in the above embodiment.
另外,本申请的实施例还提供一种装置,这个装置具体可以是芯片,组件或模块,该装置可包括相连的处理器和存储器;其中,存储器用于存储计算机执行指令,当装置运行时,处理器可执行存储器存储的计算机执行指令,以使芯片执行上述各方法实施例中的拍摄长曝光图像的方法。In addition, the embodiments of the present application also provide an apparatus. The apparatus may specifically be a chip, a component, or a module. The apparatus may include a connected processor and a memory; wherein the memory is used to store computer-executed instructions. When the apparatus is running, The processor may execute computer execution instructions stored in the memory, so that the chip executes the method for shooting a long exposure image in each of the above method embodiments.
其中,本实施例提供的电子设备、计算机存储介质、计算机程序产品或芯片均用于执行上文所提供的对应的方法,因此,其所能达到的有益效果可参考上文所提供的对应的方法中的有益效果,此处不再赘述。Among them, the electronic devices, computer storage media, computer program products, or chips provided in this embodiment are used to perform the corresponding methods provided above. Therefore, for the beneficial effects that can be achieved, refer to the corresponding The beneficial effects in the method will not be repeated here.
通过以上实施方式的描述,所属领域的技术人员可以了解到,为描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。Through the description of the above embodiments, those skilled in the art can understand that, for the convenience and conciseness of description, only the above-mentioned division of each functional module is used as an example for illustration. In actual applications, the above-mentioned functions can be assigned by different The functional module is completed, that is, the internal structure of the device is divided into different functional modules to complete all or part of the functions described above.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个装置,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed device and method may be implemented in other ways. For example, the device embodiments described above are only schematic. For example, the division of modules or units is only a division of logical functions. In actual implementation, there may be other divisions, for example, multiple units or components may be combined or Can be integrated into another device, or some features can be ignored, or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是一个物理单元或多个物理单元,即可以位于一个地方,或者也可以分布到多个不同地方。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may be one physical unit or multiple physical units, that is, they may be located in one place, or may be distributed in multiple different places. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above integrated unit may be implemented in the form of hardware or software functional unit.
集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个可读取存储介质中。基于这样的理解,本申请实施例的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该软件产品存储在一个存储介质中,包括若干指令用以使得一个设备(可以是单片机,芯片等)或处理器(processor)执行本申请各个实施例方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(read only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a readable storage medium. Based on this understanding, the technical solutions of the embodiments of the present application may be essentially or part of the contribution to the existing technology or all or part of the technical solutions may be embodied in the form of software products, which are stored in a storage medium In it, several instructions are included to enable a device (which may be a single-chip microcomputer, chip, etc.) or processor to execute all or part of the steps of the methods of the embodiments of the present application. The foregoing storage media include various media that can store program codes, such as a USB flash drive, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk.
以上内容,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above content is only the specific implementation of this application, but the scope of protection of this application is not limited to this. Any person skilled in the art can easily think of changes or replacements within the technical scope disclosed in this application. Covered within the scope of protection of this application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (18)

  1. 一种视频播放的方法,其特征在于,所述方法包括:A video playback method, characterized in that the method includes:
    显示第一界面,所述第一界面显示正在播放的第一视频和所述第一视频的字幕,所述第一视频的字幕包括第一文本单元和第二文本单元;Displaying a first interface displaying the first video being played and the subtitles of the first video, the subtitles of the first video including a first text unit and a second text unit;
    当播放到所述第一文本单元对应的所述第一视频的第一片段时,在所述第一界面上自动重复播放所述第一片段;When playing the first segment of the first video corresponding to the first text unit, automatically playing the first segment repeatedly on the first interface;
    在所述第一界面上检测用户的第一操作;Detecting the user's first operation on the first interface;
    响应于所述第一操作,显示第二界面,在所述第二界面上显示与所述第一文本单元关联的第一信息。In response to the first operation, a second interface is displayed, and the first information associated with the first text unit is displayed on the second interface.
  2. 根据权利要求1所述的方法,其特征在于,所述第一片段是所述第一文本单元对应的视频片段,或者所述第一片段是所述第一文本单元所在的整句对应的视频片段。The method according to claim 1, wherein the first segment is a video segment corresponding to the first text unit, or the first segment is a video corresponding to the entire sentence where the first text unit is located Fragment.
  3. 根据权利要求1或2所述的方法,其特征在于,所述方法还包括:The method according to claim 1 or 2, wherein the method further comprises:
    在显示第一界面之前,显示第三界面,所述第三界面显示用户输入的所述第一文本单元,所述第三界面包括与所述第一文本单元关联的第二信息和第一视频列表,所述第一视频列表包括所述第一视频;Before displaying the first interface, a third interface is displayed, the third interface displays the first text unit input by the user, and the third interface includes second information and a first video associated with the first text unit List, the first video list includes the first video;
    在所述第三界面上检测用户的第二操作;Detecting the second operation of the user on the third interface;
    响应于所述第二操作,显示所述第一界面。In response to the second operation, the first interface is displayed.
  4. 根据权利要求3所述的方法,其特征在于,所述第一视频列表进一步包括第二视频,所述第二操作用于选择所述第一视频。The method according to claim 3, wherein the first video list further includes a second video, and the second operation is used to select the first video.
  5. 根据权利要求4所述的方法,其特征在于,所述方法还包括:The method according to claim 4, wherein the method further comprises:
    在所述第三界面上检测用户的第三操作;Detecting the third operation of the user on the third interface;
    响应于所述第三操作,显示第四界面,所述第四界面包括所述第一文本单元的第二信息和第二视频列表,所述第二视频列表包括至少一个视频,所述第二视频列表中的每个视频的字幕包括所述第一文本单元。In response to the third operation, a fourth interface is displayed, the fourth interface includes second information of the first text unit and a second video list, the second video list includes at least one video, the second The subtitle of each video in the video list includes the first text unit.
  6. 根据权利要求1至5中任一项所述的方法,其特征在于,显示所述第二界面时,所述第一视频被暂停播放。The method according to any one of claims 1 to 5, wherein the first video is paused when the second interface is displayed.
  7. 根据权利要求6所述的方法,其特征在于,所述重复播放所述第一视频的第一片段的次数是系统默认的预设次数或者用户设置的。The method according to claim 6, wherein the number of times of repeatedly playing the first segment of the first video is a preset number of times preset by the system or set by a user.
  8. 根据权利要求1至7中任一项所述的方法,其特征在于,所述第一文本单元的显示效果不同于所述第二文本单元的显示效果。The method according to any one of claims 1 to 7, wherein the display effect of the first text unit is different from the display effect of the second text unit.
  9. 一种电子设备,其特征在于,包括:一个或多个处理器;存储器;多个应用程序;以及一个或多个程序,其中所述一个或多个程序被存储在所述存储器中,当所述一个或者多个程序被所述处理器执行时,使得所述电子设备执行以下步骤:An electronic device, comprising: one or more processors; a memory; a plurality of application programs; and one or more programs, wherein the one or more programs are stored in the memory, when the When the one or more programs are executed by the processor, the electronic device performs the following steps:
    显示第一界面,所述第一界面显示正在播放的第一视频和所述第一视频的字幕,所述第一视频的字幕包括第一文本单元和第二文本单元;Displaying a first interface displaying the first video being played and the subtitles of the first video, the subtitles of the first video including a first text unit and a second text unit;
    当播放到所述第一文本单元对应的所述第一视频的第一片段时,在所述第一界面上自动重复播放所述第一片段;When playing the first segment of the first video corresponding to the first text unit, automatically playing the first segment repeatedly on the first interface;
    在所述第一界面上检测用户的第一操作;Detecting the user's first operation on the first interface;
    响应于所述第一操作,显示第二界面,在所述第二界面上显示与所述第一文本单元关 联的第一信息。In response to the first operation, a second interface is displayed, and the first information associated with the first text unit is displayed on the second interface.
  10. 根据权利要求9所述的电子设备,其特征在于,所述第一片段是所述第一文本单元对应的视频片段,或者所述第一片段是所述第一文本单元所在的整句对应的视频片段。The electronic device according to claim 9, wherein the first segment is a video segment corresponding to the first text unit, or the first segment is corresponding to the entire sentence where the first text unit is located Video clip.
  11. 根据权利要求9或10所述的电子设备,其特征在于,当所述一个或者多个程序被所述处理器执行时,使得所述电子设备执行以下步骤:The electronic device according to claim 9 or 10, wherein when the one or more programs are executed by the processor, the electronic device is caused to perform the following steps:
    在显示第一界面之前,显示第三界面,所述第三界面显示用户输入的所述第一文本单元,所述第三界面包括与所述第一文本单元关联的第二信息和第一视频列表,所述第一视频列表包括所述第一视频;Before displaying the first interface, a third interface is displayed, the third interface displays the first text unit input by the user, and the third interface includes second information and a first video associated with the first text unit List, the first video list includes the first video;
    在所述第三界面上检测用户的第二操作;Detecting the second operation of the user on the third interface;
    响应于所述第二操作,显示所述第一界面。In response to the second operation, the first interface is displayed.
  12. 根据权利要求11所述的电子设备,其特征在于,所述第一视频列表进一步包括第二视频,所述第二操作用于选择所述第一视频。The electronic device according to claim 11, wherein the first video list further includes a second video, and the second operation is used to select the first video.
  13. 根据权利要求12所述的电子设备,其特征在于,当所述一个或者多个程序被所述处理器执行时,使得所述电子设备执行以下步骤:The electronic device according to claim 12, wherein when the one or more programs are executed by the processor, the electronic device is caused to perform the following steps:
    在所述第三界面上检测用户的第三操作;Detecting the third operation of the user on the third interface;
    响应于所述第三操作,显示第四界面,所述第四界面包括所述第一文本单元的第二信息和第二视频列表,所述第二视频列表包括至少一个视频,所述第二视频列表中的每个视频的字幕包括所述第一文本单元。In response to the third operation, a fourth interface is displayed, the fourth interface includes second information of the first text unit and a second video list, the second video list includes at least one video, the second The subtitle of each video in the video list includes the first text unit.
  14. 根据权利要求9至13中任一项所述的电子设备,其特征在于,显示所述第二界面时,所述第一视频被暂停播放。The electronic device according to any one of claims 9 to 13, wherein the first video is paused when the second interface is displayed.
  15. 根据权利要求14所述的电子设备,其特征在于,所述重复播放所述第一视频的第一片段的次数是系统默认的预设次数或者用户设置的。The electronic device according to claim 14, wherein the number of times of repeatedly playing the first segment of the first video is a preset number of times preset by the system or set by a user.
  16. 根据权利要求9至15中任一项所述的电子设备,其特征在于,所述第一文本单元的显示效果不同于所述第二文本单元的显示效果。The electronic device according to any one of claims 9 to 15, wherein the display effect of the first text unit is different from the display effect of the second text unit.
  17. 一种计算机存储介质,其特征在于,包括计算机指令,当所述计算机指令在电子设备上运行时,使得所述电子设备执行如权利要求1至8中任一项所述的视频播放的方法。A computer storage medium, characterized by comprising computer instructions, when the computer instructions run on an electronic device, the electronic device is caused to perform the video playback method according to any one of claims 1 to 8.
  18. 一种计算机程序产品,其特征在于,当所述计算机程序产品在计算机上运行时,使得所述计算机执行如权利要求1至8中任一项所述的视频播放的方法。A computer program product, characterized in that, when the computer program product runs on a computer, the computer program product is caused to perform the video playback method according to any one of claims 1 to 8.
PCT/CN2019/121187 2018-12-10 2019-11-27 Method for repeating word or sentence during video playback, and electronic device WO2020119455A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811502510.3A CN109756770A (en) 2018-12-10 2018-12-10 Video display process realizes word or the re-reading method and electronic equipment of sentence
CN201811502510.3 2018-12-10

Publications (1)

Publication Number Publication Date
WO2020119455A1 true WO2020119455A1 (en) 2020-06-18

Family

ID=66402724

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/121187 WO2020119455A1 (en) 2018-12-10 2019-11-27 Method for repeating word or sentence during video playback, and electronic device

Country Status (2)

Country Link
CN (1) CN109756770A (en)
WO (1) WO2020119455A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109756770A (en) * 2018-12-10 2019-05-14 华为技术有限公司 Video display process realizes word or the re-reading method and electronic equipment of sentence
CN110223549A (en) * 2019-06-14 2019-09-10 林慧泽 A kind of word multifrequency mnemonics
CN110598012B (en) * 2019-09-23 2023-05-30 听典(上海)教育科技有限公司 Audio and video playing method and multimedia playing device
CN113051985B (en) * 2019-12-26 2024-07-05 深圳云天励飞技术有限公司 Information prompting method, device, electronic equipment and storage medium
CN111459448A (en) * 2020-01-19 2020-07-28 托普朗宁(北京)教育科技有限公司 Reading assisting method and device, storage medium and electronic equipment
CN111710199A (en) * 2020-07-15 2020-09-25 罗鹏 English teaching system based on big data
CN111901665B (en) * 2020-08-28 2022-08-26 完美世界控股集团有限公司 Teaching resource playing method and device and storage medium
CN113053415B (en) * 2021-03-24 2023-09-29 北京如布科技有限公司 Method, device, equipment and storage medium for detecting continuous reading
CN113436478A (en) * 2021-06-22 2021-09-24 读书郎教育科技有限公司 System and method for assisting in word-back in combination with textbook content

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1735914A (en) * 2003-01-30 2006-02-15 电影教学系统股份有限公司 Video based language learning system
WO2011065758A2 (en) * 2009-11-25 2011-06-03 올토주식회사 Content creation method, edutainment device using the content, and edutainment method using the same
CN103324685A (en) * 2013-06-03 2013-09-25 大连理工大学 Search method for video fragments of Japanese online video corpora
CN104202678A (en) * 2014-09-22 2014-12-10 杨海 Video subtitle display method with bilingual subtitle replaying and previewing functions
CN105354331A (en) * 2015-12-02 2016-02-24 深圳大学 Online video based vocabulary learning aiding method and vocabulary learning system
CN107632755A (en) * 2017-09-13 2018-01-26 周连惠 A kind of Chinese and English switching method of English study software
CN109756770A (en) * 2018-12-10 2019-05-14 华为技术有限公司 Video display process realizes word or the re-reading method and electronic equipment of sentence

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7149970B1 (en) * 2000-06-23 2006-12-12 Microsoft Corporation Method and system for filtering and selecting from a candidate list generated by a stochastic input method
CN101093621A (en) * 2007-07-12 2007-12-26 魏益刚 Controllable palmtop simulator for language environment, and controllable method for simulating language environment
CN102354465A (en) * 2011-10-08 2012-02-15 许卫刚 Method for English study by taking sentence as minimum unit and system therefor
CN106205239A (en) * 2016-09-18 2016-12-07 三峡大学 A kind of electronic dictionary system based on 3D three-dimensional imaging

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1735914A (en) * 2003-01-30 2006-02-15 电影教学系统股份有限公司 Video based language learning system
WO2011065758A2 (en) * 2009-11-25 2011-06-03 올토주식회사 Content creation method, edutainment device using the content, and edutainment method using the same
CN103324685A (en) * 2013-06-03 2013-09-25 大连理工大学 Search method for video fragments of Japanese online video corpora
CN104202678A (en) * 2014-09-22 2014-12-10 杨海 Video subtitle display method with bilingual subtitle replaying and previewing functions
CN105354331A (en) * 2015-12-02 2016-02-24 深圳大学 Online video based vocabulary learning aiding method and vocabulary learning system
CN107632755A (en) * 2017-09-13 2018-01-26 周连惠 A kind of Chinese and English switching method of English study software
CN109756770A (en) * 2018-12-10 2019-05-14 华为技术有限公司 Video display process realizes word or the re-reading method and electronic equipment of sentence

Also Published As

Publication number Publication date
CN109756770A (en) 2019-05-14

Similar Documents

Publication Publication Date Title
WO2020119455A1 (en) Method for repeating word or sentence during video playback, and electronic device
WO2022052776A1 (en) Human-computer interaction method, and electronic device and system
JP7142783B2 (en) Voice control method and electronic device
WO2020238356A1 (en) Interface display method and apparatus, terminal, and storage medium
WO2020211701A1 (en) Model training method, emotion recognition method, related apparatus and device
WO2020168929A1 (en) Method for identifying specific position on specific route and electronic device
CN112567457B (en) Voice detection method, prediction model training method, device, equipment and medium
US20220130360A1 (en) Song Recording Method, Audio Correction Method, and Electronic Device
WO2020078299A1 (en) Method for processing video file, and electronic device
CN112214636B (en) Audio file recommendation method and device, electronic equipment and readable storage medium
CN116564304A (en) Voice interaction method and device
WO2021258797A1 (en) Image information input method, electronic device, and computer readable storage medium
WO2022100221A1 (en) Retrieval processing method and apparatus, and storage medium
CN111970401B (en) Call content processing method, electronic equipment and storage medium
WO2022042766A1 (en) Information display method, terminal device, and computer readable storage medium
WO2020239001A1 (en) Humming recognition method and related device
WO2020134892A1 (en) Media file clipping method, electronic device, and server
WO2022143258A1 (en) Voice interaction processing method and related apparatus
WO2022033432A1 (en) Content recommendation method, electronic device and server
WO2020062014A1 (en) Method for inputting information into input box and electronic device
WO2021031862A1 (en) Data processing method and apparatus thereof
CN113742460B (en) Method and device for generating virtual roles
WO2023179490A1 (en) Application recommendation method and an electronic device
WO2023029916A1 (en) Annotation display method and apparatus, terminal device, and readable storage medium
CN114390341B (en) Video recording method, electronic equipment, storage medium and chip

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19896748

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19896748

Country of ref document: EP

Kind code of ref document: A1