CN113868445A - Continuous playing position determining method and continuous playing system - Google Patents
Continuous playing position determining method and continuous playing system Download PDFInfo
- Publication number
- CN113868445A CN113868445A CN202010617722.7A CN202010617722A CN113868445A CN 113868445 A CN113868445 A CN 113868445A CN 202010617722 A CN202010617722 A CN 202010617722A CN 113868445 A CN113868445 A CN 113868445A
- Authority
- CN
- China
- Prior art keywords
- playing
- played
- logic
- historical
- determining
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/44—Browsing; Visualisation therefor
- G06F16/447—Temporal browsing, e.g. timeline
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/43—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/43—Querying
- G06F16/438—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/48—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/489—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using time information
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
The application provides a continuous playing position determining method, which comprises the following steps: receiving a continuous playing request, wherein the continuous playing request carries identification information of a user and initiation time of the continuous playing request; determining the latest stop time and stop position of at least one part of historical playing objects of the user according to the identification information of the user; determining the off-air time of at least one part of historical playing objects according to the latest off-air time and the initiation time of at least one part of historical playing objects; and determining the continuous playing position of at least one part of the historical playing objects according to the stop playing time of at least one part of the historical playing objects and the latest stop playing position of at least one part of the historical playing objects, wherein the continuous playing position of each historical playing object is before the latest stop playing position of each historical playing object. The method and the device improve the flexibility of determining the continuous playing position so as to meet the diversified playing requirements.
Description
Technical Field
The present application relates to the field of computer technologies, and in particular, to a resume position determining method and a resume system.
Background
With the continuous popularization and development of user terminals, users increasingly rely on the user terminals. A user may install various media applications on the user terminal to enable the user to perform media entertainment on the user terminal, such as listening to a voice book, watching a movie, reading a novel, etc.
In order to improve user experience, the existing media applications all have a continuous playing function. At present, the principle of continuous playing of media application is as follows: and taking the position of the last pause playing as the starting point of the current continuous playing to continue playing. Obviously, the resume position is a position at which the playing is paused last time, and the determination of the resume position is not flexible, and cannot meet diversified playing requirements.
Disclosure of Invention
The application provides a continuous playing position determining method and a continuous playing system, which are used for solving the problems that the determination of the continuous playing position is not flexible and cannot meet diversified playing requirements.
In a first aspect, the present application provides a method for determining a resume position, including: receiving a continuous playing request, wherein the continuous playing request carries identification information of a user and initiation time of the continuous playing request; determining the latest stop time and stop position of at least one part of historical playing objects of the user according to the identification information of the user; determining the off-air time of at least one part of historical playing objects according to the latest off-air time and the initiation time of at least one part of historical playing objects; determining the continuous playing position of at least one part of historical playing objects according to the stop playing time of at least one part of historical playing objects and the latest stop playing position of at least one part of historical playing objects, wherein: the resume position of each of the at least a portion of the historical play objects is prior to its last stop position.
The method comprises the steps of determining the continuous playing time of a historical playing object according to the initiation time of a continuous playing request and the latest stop time of the historical playing object, determining the continuous playing position of the historical playing object according to the stop time of the historical playing object and the latest stop position of the historical playing object, and enabling the continuous playing position of the historical playing object to be more flexibly determined, so that diversified playing requirements can be better met.
In one possible implementation, the method further includes: acquiring logic information of at least one part of historical playing objects, wherein the logic information of each historical playing object in the at least one part of historical playing objects is used for indicating the logic relationship among a plurality of logic points of the historical playing objects and the position of each logic point; the determining the resume position of the at least a part of the historical playing object comprises: determining the continuous playing position of at least one part of historical playing objects according to the stop playing duration of at least one part of historical playing objects and the latest stop playing position of at least one part of historical playing objects and the logic information of at least one part of historical playing objects; and the continuous playing position of each historical playing object in the at least part of the historical playing objects is the position of one logic point in the logic information.
According to the stop-air time of the historical playing object and the last stop-air position of the historical playing object, the position of one logic point in the logic information of the historical playing object is determined as the continuous playing position of the audio book H, namely the determination of the continuous playing position is related to the logic information, the stop-air time and the last stop-air position of the historical playing object, so that the determination of the continuous playing position is more flexible, and the playing requirements of diversity can be better met. In addition, the resume position of the history playing object is before the last stop position, so that when the history playing object is played to the user according to the resume position, the user can review the previous content, and the user experience is improved. In addition, the resume position of the history playing object is the position of a logic point of the history playing object, so that when the history playing object is played according to the resume position, a user can better review the previous content, and the user experience is further improved.
In a possible implementation manner, the determining the resume position of the at least one part of the historical play object includes: determining the memory rate of the user to the at least one part of historical playing objects according to the off-air time of the at least one part of historical playing objects; and determining the continuous playing position of the at least one part of the historical playing object according to the memory rate of the user to the at least one part of the historical playing object and the latest stop playing position of the at least one part of the historical playing object.
In one possible implementation, the method further includes: acquiring logic information of at least one part of historical playing objects, wherein the logic information of each historical playing object in the at least one part of historical playing objects is used for indicating the logic relationship among a plurality of logic points of the historical playing objects and the position of each logic point; the determining the resume position of the at least a part of the historical playing object comprises: determining the memory rate of the user to the at least one part of historical playing objects according to the off-air time of the at least one part of historical playing objects; determining a continuous playing position of at least one part of historical playing objects according to the memory rate of the user to the at least one part of historical playing objects, the latest stop playing position of the at least one part of historical playing objects and the logic information of the at least one part of historical playing objects; and the continuous playing position of each historical playing object in the at least part of the historical playing objects is the position of one logic point in the logic information.
Determining the memory rate of the user to the historical playing object according to the stop playing duration of the historical playing object, determining the position of a logic point in the logic information of the historical playing object as the continuous playing position of the historical playing object according to the memory rate of the user to the historical playing object and the latest stop playing position of the historical playing object, namely determining the continuous playing position, wherein the determination of the continuous playing position is related to the logic information of the historical playing object, the latest stop playing position and the memory rate of the user to the historical playing object, so that the determination of the continuous playing position is more flexible, and the playing requirements of diversity are better met. In addition, the resume position of the history playing object is before the last stop position, so that when the history playing object is played to the user according to the resume position, the user can review the previous content, and the user experience is improved. In addition, the resume position of the history playing object is the position of a logic point of the history playing object, so that when the history playing object is played according to the resume position, a user can better review the previous content, and the user experience is further improved. In addition, the position of one of the plurality of logical points in the historical playing object is determined as the resume position according to the memory rate of the user to the historical playing object and the latest stop position of the historical playing object, namely, the determination of the resume position is related to the memory rate of the user to the historical playing object, so that the resume position can be determined based on the memory rates of different users, and the accuracy of determining the resume position is further improved.
In one possible implementation manner, the determining the memory rate of the user for the at least one part of the historical playing object includes: and determining the memory rate of the user to the at least one part of the historical playing objects according to the off-air time length of the at least one part of the historical playing objects and a memory rate curve.
In a possible implementation manner, the type of the history playing object is audio; for each history playing object, acquiring the logic information of the history playing object comprises: acquiring voice information from the audio data of the historical playing object; based on the identification of the voice information, determining a position of each of a plurality of logical points of the history playing object and a logical relationship between the plurality of logical points.
In one possible implementation manner, the history playing object is an audio book; the plurality of logical points of the history playing object include a chapter start point, a paragraph start point, and a sentence start point of the audio book.
In a possible implementation manner, the type of the history playing object is a video; for each history playing object, acquiring the logic information of the history playing object comprises: acquiring audio data from the video data of the historical playing object; acquiring human voice information from the audio data; based on the identification of the voice information, determining a position of each of a plurality of logical points of the history playing object and a logical relationship between the plurality of logical points.
In a possible implementation manner, the type of the history playing object is a video; for each history playing object, acquiring the logic information of the history playing object comprises: acquiring subtitle data from the video data of the historical playing object; based on the identification of the subtitle data, determining a position of each of a plurality of logical points of the historical play object and a logical relationship between the plurality of logical points.
In a possible implementation manner, the type of the history playing object is a text; for each history playing object, acquiring the logic information of the history playing object comprises: identifying keywords in the text data of the historical playing object; based on the analysis of the keywords, determining the position of each logical point in the plurality of logical points of the historical playing object and the logical relationship between the plurality of logical points.
In a possible implementation manner, the at least one part of the historical playing object is a playing object that is played to the user last time.
In one possible implementation, the method further includes: and sending the resume position of the at least part of the historical playing object and playing data which is used for the user terminal to cache and corresponds to the resume position of the at least part of the historical playing object to the user terminal.
In a second aspect, a continuous play position determining method is provided, including: receiving a continuous playing request, wherein the continuous playing request carries identification information of a user, identification information of an object to be played and initiation time of the continuous playing request; determining the latest stop time and stop position of the object to be played according to the identification information of the object to be played and the identification information of the user; determining the broadcasting stop duration of the object to be played according to the broadcasting stop time and the initiating time; and determining the continuous playing position of the object to be played according to the stop playing time length and the stop playing position, wherein the continuous playing position of the object to be played is before the stop playing position.
The method comprises the steps of determining the continuous playing time length of an object to be played according to the initiation time of a continuous playing request and the latest stop time of the object to be played, determining the continuous playing position of the object to be played according to the stop time length of the object to be played and the latest stop position of the object to be played, and enabling the continuous playing position of the novel to be played to be more flexible, so that diversified playing requirements can be met better.
In one possible implementation, the method further includes: acquiring logic information of the object to be played, wherein the logic information of the object to be played is used for indicating the logic relationship among a plurality of logic points of the object to be played and the position of each logic point; the determining the continuous playing position of the object to be played comprises: determining the continuous playing position of the object to be played according to the off-air time length and the off-air position and by combining the logic information of the object to be played; and the continuous playing position of the object to be played is the position of a logic point in the logic information of the object to be played.
In a possible implementation manner, the determining the resume position of the object to be played includes: determining the memory rate of the user to the object to be played according to the off-air time of the object to be played; and determining the continuous playing position of the object to be played according to the memory rate of the user to the object to be played and the latest stop playing position of the object to be played.
In one possible implementation, the method further includes: acquiring logic information of the object to be played, wherein the logic information of the object to be played is used for indicating the logic relationship among a plurality of logic points of the object to be played and the position of each logic point; the determining the continuous playing position of the object to be played comprises: determining the memory rate of the user to the object to be played according to the off-air time; determining a continuous playing position of the object to be played according to the memory rate and the stop playing position and by combining the logic information of the object to be played; and the continuous playing position of the object to be played is the position of a logic point in the logic information of the object to be played.
In a possible implementation manner, the determining, according to the off-air duration, a memory rate of the user for the object to be played includes: and determining the memory rate of the user to the object to be played according to the off-air time and by combining a memory rate curve.
In a possible implementation manner, the type of the object to be played is audio; the acquiring the logic information of the object to be played comprises: acquiring voice information from the audio data of the object to be played; and determining the position of each logic point in the plurality of logic points of the object to be played and the logic relation among the plurality of logic points based on the identification of the voice information.
In a possible implementation manner, the object to be played is a vocal book; the plurality of logic points of the object to be played comprise a chapter starting point, a paragraph starting point and a sentence starting point of the audio book.
In a possible implementation manner, the type of the object to be played is a video; the acquiring the logic information of the object to be played comprises: acquiring audio data from the video data of the object to be played; acquiring human voice information from the audio data; and determining the position of each logic point in the plurality of logic points of the object to be played and the logic relation among the plurality of logic points based on the identification of the voice information.
In a possible implementation manner, the type of the object to be played is a video; the acquiring the logic information of the object to be played comprises: acquiring subtitle data from the video data of the object to be played; and determining the position of each logic point in the plurality of logic points of the object to be played and the logic relation among the plurality of logic points based on the identification of the caption data.
In a possible implementation manner, the type of the object to be played is a text; the acquiring the logic information of the object to be played comprises: identifying keywords in the text data of the object to be played; and determining the position of each logic point in the plurality of logic points of the object to be played and the logic relation among the plurality of logic points based on the analysis of the keywords.
In one possible implementation, the method further includes: and sending the resume position and play data corresponding to the resume position and used for caching by the user terminal to the user terminal.
In a third aspect, a continuous playing system is provided, including: a user terminal and the server of any of the first or second aspects; the user terminal is configured to send a resume request to the server.
In a fourth aspect, an electronic device is provided, including a communication module for communicating with other devices; one or more storage memories for storing one or more computer programs; one or more processors configured to execute the one or more computer programs to cause the electronic device to perform the method of any of the first or second aspects.
In a fifth aspect, there is provided a computer readable storage medium comprising a computer program which, when executed on a computer, causes the computer to perform the method of any one of the first or second aspects.
In a sixth aspect, there is provided a computer program for performing the method of any one of the first or second aspects when the computer program is executed by a computer.
In a seventh aspect, a chip comprises a processor and a memory, the memory is used for storing a computer program, and the processor is used for calling and running the computer program stored in the memory to execute the method of any one of the first aspect or the second aspect.
Drawings
Fig. 1 is a schematic view of an application scenario of a continuous playing position determining method according to an embodiment of the present application;
fig. 2 is a first flowchart illustrating a resume position determining method according to an embodiment of the present application;
fig. 3 is a schematic diagram of an interactive interface of an audio book playing application provided in an embodiment of the present application;
FIG. 4 is a schematic diagram of a logical directory of audio books H provided by an embodiment of the present application;
FIG. 5 is a graph illustrating a memory rate curve according to an embodiment of the present disclosure;
FIG. 6 is a schematic diagram of a query dialog provided in an embodiment of the present application;
fig. 7 is a schematic diagram of a logic tree of a tv series B according to an embodiment of the present application;
fig. 8 is a flowchart illustrating a second method for determining a resume position according to an embodiment of the present application;
fig. 9 is a third schematic flowchart of a resume position determining method according to an embodiment of the present application;
fig. 10 is a fourth schematic flowchart of a resume position determination method according to an embodiment of the present application;
fig. 11 is a fifth flowchart illustrating a resume position determining method according to an embodiment of the present application;
fig. 12 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
Detailed Description
To make the purpose, technical solutions and advantages of the present application clearer, the technical solutions in the present application will be clearly and completely described below with reference to the drawings in the present application, and it is obvious that the described embodiments are some, but not all embodiments of the present application. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
The terms "first," "second," and the like in the description examples and claims of this application and in the drawings are used for descriptive purposes only and are not to be construed as indicating or implying relative importance, nor order. Furthermore, the terms "comprises" and "comprising," as well as any variations thereof, are intended to cover a non-exclusive inclusion, such as a list of steps or elements. A method, system, article, or apparatus is not necessarily limited to those steps or elements explicitly listed, but may include other steps or elements not explicitly listed or inherent to such process, system, article, or apparatus.
It should be understood that in the present application, "at least one" means one or more, "a plurality" means two or more. "and/or" for describing an association relationship of associated objects, indicating that there may be three relationships, e.g., "a and/or B" may indicate: only A, only B and both A and B are present, wherein A and B may be singular or plural. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship. "at least one of the following" or similar expressions refer to any combination of these items, including any combination of single item(s) or plural items. For example, at least one (one) of a, b, or c, may represent: a, b, c, "a and b", "a and c", "b and c", or "a and b and c", wherein a, b, c may be single or plural.
Fig. 1 is a schematic view of an application scenario of a resume position determining method according to an embodiment of the present application. The application scenario includes a server 101 and a user terminal 102, wherein:
the server 101 may be, for example, a physical server, or may be a cloud server, and the physical server may be, for example, an independent physical server, or a server cluster composed of a plurality of physical servers, and the like, and this is not particularly limited in this embodiment of the present application.
The user terminal 102 may be, for example, a mobile phone, a tablet computer, a notebook computer, an e-reader, and the like. A user may install various media applications in the user terminal 102 to use various types of services provided in the media applications through the media applications.
The media application may be, for example, a video playing application, a text presentation application (e.g., an electronic reading application), an audio playing application (e.g., an audio book playing application), and the like, which is not particularly limited in this embodiment of the present application.
The media application may present a plurality of playing objects to a user, where if the media application is a video playing application, the type of the playing object may be, for example, a video object such as a tv drama and a variety program, if the type of the media application is an audio playing application, the playing object may be, for example, an audio object such as a vocal book and a broadcast program, and if the media application is a text displaying application, the playing object may be, for example, a text object such as a novel, news, and thesis, which is not particularly limited in this embodiment of the present application.
Various interactions are performed between the server 101 and the user terminal 102 to provide services to the user. For example, a user may initiate a play request to the server 101 through the user terminal 102 installed with the media application, and the server 101 receives and responds to the play request to send play data to the user terminal 102, so that the user terminal 102 plays.
For another example, the user may also send a resume request to the server 101 through the user terminal 102 installed with the media application, and the server 101 receives and responds to the resume request, determines a resume position, and sends the resume position to the user terminal 102, and simultaneously sends the play data to the user terminal 102, so that the user terminal 102 plays the play data from the resume position.
Next, a method of determining the resume position will be described based on the above application scenario.
Fig. 2 is a first flowchart of a resume position determining method according to an embodiment of the present application. In fig. 2, the method for determining the resume position is described by taking the user terminal as a mobile phone, the media application as a vocal book playing application, and the playing object as a vocal book. It should be noted that, in the resume position determining method shown in fig. 2, after the user opens the vocal book playing application, the mobile phone sends a resume request to the server, and the server receives and responds to the resume request to determine only the resume position of the vocal book that was played last time to the user.
As shown in fig. 2, the specific process of resuming location determination includes the following steps:
step 201, a user clicks an icon of an audio book playing application in a mobile phone interface, and the audio book playing application is opened;
step 202, the mobile phone displays an interactive interface of the audio book playing application, as shown in fig. 3, a resume control 301 of an audio book H (not shown in the figure) is displayed in the interactive interface of the audio book playing application, the audio book H is an audio book that has been played last time to the user, and the audio book that has been played last time to the user is an audio book that has been played last time when the user quits the audio book playing application last time. It should be noted that the interactive interface of the audio book playing application may further include other audio book operation controls 302, and the like, which is not particularly limited in this application.
Step 203, after the user clicks the icon of the audio book playing application in the mobile phone interface, the mobile phone sends a resume request to the server, and the resume request carries the identification information of the user and the initiation time of the resume request.
In the embodiment of the application, a user clicks an icon of the vocal book playing application, the vocal book playing application is opened in the mobile phone, the mobile phone acquires identification information and current time of the user, the current time is determined as initiation time of a continuous playing request, then the mobile phone generates the continuous playing request according to the identification information of the user and the initiation time of the continuous playing request, and finally the mobile phone sends the continuous playing request to the server.
For a user who has registered a voice book playing application, the identification information of the user may be, for example, an account number (e.g., a mobile phone number, an identity card number) of the user registered the voice book playing application, and the like, which is not particularly limited in this embodiment of the present application.
For a user who does not register with the audio book playing application, the identification information of the user may be, for example, an identification code of a mobile phone, and this is not particularly limited in this embodiment of the application.
And step 204, the server receives the continuous playing request, and determines the latest stop time and stop position of the audio book H according to the identification information of the user carried by the continuous playing request, namely determines the stop position and stop time of the audio book which is played last when the user quits the audio book playing application last time.
It should be noted that, if the user uses the audio book playing application, the audio book playing application uploads the stop time and stop position of each audio book played by the user in the audio book playing application to the server.
Based on this, the last stop time and stop position of all audio books played by the user before can be obtained according to the identification information of the user, and the last stop time and stop position of the audio book played by the user last time can be determined according to the last stop time of each audio book, that is, the audio book with the last stop time closest to the current time is determined as the audio book played by the user last time (i.e., audio book H).
It should be noted that the last stop time of the audio book is the time when the user last stops playing the audio book, and the last stop position of the audio book is the playing progress of the audio book when the user last stops playing the audio book.
For the audio class playing object, the playing progress of the audio class playing object is generally represented by at least one of the time progress, the identifier of the audio playing file, and the like, so that the stop position of the audio class playing object can be represented by at least one of the time progress, the identifier of the audio playing file, and the like. For example, the stop position of the playing object can be represented by an identification of the audio playing file to which the stop position of the playing object belongs and a time progress of the pause position in the audio playing file to which it belongs.
Since the playing object is an audio book in this embodiment, that is, the playing object is an audio-class playing object, the stop position of the audio book may also be represented by at least one of the time progress, the identifier of the audio playing file, and the like. For example, the stop position of the vocal book can be represented as [06, 00.20.00], 06 represents the identification of the audio play file to which the stop position of the vocal book belongs, and 00.20.00 represents the 20 th minute of the stop position of the vocal book in the audio play file to which it belongs. In the following description of the time schedule, the first two digits are expressed in hours, the middle two digits are expressed in minutes, and the last two digits are expressed in seconds. On this basis, if 06 is the identifier of the audio playback file corresponding to chapter six of the audio book, the stop position of the audio book is the 20 th minute in chapter six of the audio book.
It should be noted that the above-mentioned representation of the stop-and-play position of the audio-like playing object and the audio book is only exemplary and is not used to limit the present application. For example, the representation of the audio-class playing object and the stop-and-play position of the audio book can also be other desired representations.
And step 205, the server determines the off-air time of the audio book H according to the initiation time of the continuous-air request and the latest off-air time of the audio book H. Specifically, the difference between the initiation time of the continuous playing request and the latest stop time of the audio book H is determined as the stop time of the audio book H.
And step 206, the server determines the continuous playing position of the audio book H according to the stop playing time of the audio book H and the latest stop playing position of the audio book H, wherein the continuous playing position of the audio book H is before the latest stop playing position of the audio book H.
Next, a process of determining the resume position of the voiced book H will be explained in the following four ways.
First, the continuous playing position of the audio book H is determined according to the stop playing time of the audio book H and the latest stop playing position of the audio book H, and in combination with the logic information of the audio book H, wherein the logic information of the audio book is used for indicating the logic relationship among a plurality of logic points of the audio book and the position of each logic point. The resume position of the audio book H is the position of one logical point in the logical information thereof.
Before determining the resume position of the vocal book H, the logic information of each vocal book in the vocal book play application may be acquired in advance, and the logic information of each vocal book is used to indicate the logic relationship between a plurality of logic points of the vocal book and the position of each logic point. The logic information of each voiced book is saved. In this way, when the continuous playing position of the audio book H is determined, the logical information of the audio book H can be acquired from the information stored in advance based on the identification information of the audio book H.
Next, a principle of acquiring logic information of one audio book will be described.
First, human voice information is acquired from audio data of a voiced book. The audio data of the audio book includes all audio playback files of the audio book. For example, if a sound book includes ten chapters and each chapter corresponds to one audio playing file, the audio data of the sound book includes ten audio playing files corresponding to the ten chapters. Specifically, the audio data of the vocal book is converted from the time domain to the frequency domain through Fourier transform to obtain audio data of the vocal book based on the frequency domain, the audio data of the vocal book based on the frequency domain is input into an algorithm model capable of filtering background sounds, the algorithm model outputs vocal information based on the frequency domain, and the vocal information based on the frequency domain is converted into vocal information based on the time domain through inverse Fourier transform.
The process of obtaining the algorithm model capable of filtering the background sound comprises the following steps: acquiring a large amount of three audio data, namely, the first audio data with only human voice information, the second audio data with only background sound, and the third audio data obtained by mixing the first audio data with the second audio data (namely, the audio data with the human voice information and the background sound); the method comprises the steps of taking first audio data and second audio data as training data, or taking the first audio data, the second audio data and third audio data as training data, converting the training data from a time domain to a frequency domain through Fourier transform to obtain training data based on the frequency domain, and training an algorithm model through the training data based on the frequency domain to enable the algorithm model to learn the mapping relation of the three audio data to the first audio data so as to obtain the algorithm model capable of filtering background sounds. The algorithm model may be, for example, a Conditional access network (Conditional GAN) or the like, and the present application is not limited to this.
Then, based on the recognition of the vocal information, a position of each of the plurality of logical points of the vocal book and a logical relationship between the plurality of logical points are determined. The plurality of logic points of the vocal book comprise a chapter starting point, a paragraph starting point, a sentence starting point and the like of the vocal book. The logical relationship among the multiple logical points of the audio book is the sequence among the starting points of all chapters, to which chapter starting point a paragraph starting point belongs, the sequence among the multiple paragraph starting points belonging to one chapter starting point, to which paragraph starting point a sentence starting point belongs, and the sequence among the multiple sentence starting points belonging to one paragraph starting point.
The specific process of determining the position of each logical point in the plurality of logical points of the vocal book and the logical relationship between the plurality of logical points based on the recognition of the vocal information comprises: converting the voice information of the audio book into Text characters by a Text-to-Speech (Text-to-Speech) technology, and identifying chapter relations of the audio book in the Text characters by the aid of keywords such as chapter languages, chapters and the like; since the pause time between paragraphs is significantly longer than that between sentences in the human voice information, which is different from the pause time between sentences, the pause time can be classified by using a classifier model, which can be, for example, an SVM (support vector machine) or a CNN (convolutional neural network), so as to identify the pause between paragraphs and the pause between sentences. And finally, obtaining the position of the chapter starting point, the position of the paragraph starting point, the position of the sentence starting point of the vocal book, and the relationship among the chapter starting point, the paragraph starting point and the sentence starting point, so as to obtain the logical relationship of a plurality of logical points in the logical information of the vocal book and the position of each logical point.
The position of the logical point of the vocal book can be represented by the identifier of the audio playing file to which the logical point of the vocal book belongs and the time progress of the logical point in the audio playing file to which the logical point belongs, for example, if the logical point is the starting point of the third section of the first chapter of the vocal book, the position of the logical point can be represented as [01, 00.40.00], 01 represents the identifier of the audio playing file to which the starting point of the third section of the first chapter of the vocal book belongs, and 00.40.00 represents the 40 th minute of the starting point of the third section of the first chapter of the vocal book in the audio playing file to which the logical point belongs.
It should be noted that the principle of obtaining the logic information of other audio books is the same as the principle described above, and therefore, the description is omitted here.
It should be noted that, for other audio-type playing objects, such as broadcast programs, the principle of obtaining the logic information of the playing object is the same as above, and therefore, the details are not described herein again. The determination of the plurality of logical points and the logical relationship among the plurality of logical points in the logical information of the playing object is related to the specific content of the playing object. The plurality of logical points of the play object may be divided into a plurality of levels, each level including at least one logical point, and a logical point of a lower level belongs to a logical point of a higher level corresponding thereto. For example, if the playing object is an audio book, the plurality of logical points may be divided into three levels, where the first level is a chapter start point, the second level is a paragraph start point, and the third level is a sentence start point, each paragraph start point has a chapter start point to which it belongs, and each sentence start point has a paragraph start point to which it belongs.
The presentation form of the logical information of the audio book may be, for example, a logical directory or a logical tree, and this application is not limited to this.
Next, the logical information of the audio book will be described by taking the presentation form of the logical information of the audio book as a logical directory.
If the audio data of the audio book comprises a plurality of audio playing files, the number of the audio playing files is the same as the number of chapters of the audio book, namely, one audio playing file corresponds to one chapter of the audio book. The logical points of the vocal book include a chapter starting point, a paragraph starting point, and a sentence starting point. The logical relationship between the logical points means: the chapter starting points are sequentially arranged, the chapter starting points to which the paragraph starting points belong, the sequence among the paragraph starting points belonging to the same chapter starting point, the paragraph starting point to which the sentence starting point belongs, and the sequence among the sentence starting points belonging to the same paragraph starting point.
Based on this, a plurality of logical points of the audio book can be divided into different levels, then, the corresponding directory level is determined according to the level of the logical point, specifically, the higher the level of the logical point is, the higher the directory level corresponding to the logical point is, and finally, the logical directory of the audio book is set according to the directory level corresponding to the logical point and the logical relationship between the logical points.
Specifically, the name of the voiced book is used as a first-level directory, each chapter starting point of the voiced book is used as a second-level directory, each chapter starting point is arranged in the second-level directory according to the sequence of the chapter starting points, each paragraph starting point of the voiced book is used as a third-level directory, and the third-level directory is arranged under each second-level directory, wherein the paragraph starting point corresponding to the third-level directory under the second-level directory belongs to the chapter starting point corresponding to the second-level directory, each paragraph starting point is arranged in the third-level directory under each second-level directory according to the sequence of the paragraph starting points, and similarly, each sentence starting point of the voiced book is used as a fourth-level directory, and each sentence starting point is arranged in the fourth-level directory under each third-level directory according to the sequence of the sentence starting points.
Fig. 4 is a schematic diagram of a logical directory of an audio book H provided in an embodiment of the present application, where the logical points of the audio book H include a chapter start point, a paragraph start point, and a sentence start point. As can be seen from fig. 4, the logical directories of the audio book H include a primary directory, a secondary directory, a tertiary directory, and a quaternary directory. Wherein the primary directory represents the name of the audio book H. The number of secondary directories is two, which are the starting point [01,00.00.00] of the first chapter and the starting point [02,00.00.00] of the second chapter. For the start point [01,00.00.00] of the first chapter, 01 denotes the identification of the audio play file to which the start point of the first chapter belongs, and 00.00.00 denotes the 0 th minute of the start point of the first chapter in the audio play file to which it belongs. For the start point [02,00.00.00] of the second chapter, 02 denotes the identification of the audio play file to which the start point of the second chapter belongs, and 00.00.00 denotes the 0 th minute of the start point of the second chapter in the audio play file to which it belongs. The starting point [01,00.00.00] of the first chapter includes two three-level directories, which are the starting point [01,00.01.00] of the first paragraph and the starting point [01,00.30.00] of the second paragraph, wherein, for the starting point [01,00.01.00] of the first paragraph, 01 denotes the identifier of the audio playing file to which the starting point of the first paragraph belongs, 00.01.00 denotes the 1 st minute of the starting point of the first paragraph in the audio playing file to which it belongs, for the starting point [01,00.30.00] of the second paragraph, 01 denotes the identifier of the audio playing file to which the starting point of the second paragraph belongs, and 00.30.00 denotes the 30 th minute of the starting point of the second paragraph in the audio playing file to which it belongs. The start point [02,00.00.00] of the second chapter includes two three-level directories, which are the start point [02,00.01.00] of the first paragraph and the start point [02,00.30.00] of the second paragraph, wherein for the start point [02,00.01.00] of the first paragraph, 02 indicates the identifier of the audio play file to which the start point of the first paragraph belongs, 00.01.00 indicates the 1 st minute of the start point of the first paragraph in the audio play file to which it belongs, for the start point [02,00.30.00] of the second paragraph, 02 indicates the identifier of the audio play file to which the start point of the second paragraph belongs, and 00.30.00 indicates the 30 th minute of the start point of the second paragraph in the audio play file to which it belongs.
Each three-level directory also comprises a plurality of four-level directories. The four-level directory is a sentence starting point, the former number in the bracket behind each sentence starting point represents the identification of the audio playing file to which the sentence starting point belongs, and the latter number represents the time progress of the sentence starting point in the audio playing file to which the sentence starting point belongs. As can be seen from the above, the higher the directory level is, the higher the logical point level it represents.
It should be noted that, if the presentation form of the logic information of the voiced book is a logic tree, the logic tree may be set according to the level of the logic point in the voiced book, specifically, the higher the level of the logic point is, the closer the logic point is to the root node of the logic tree, and the lower the level of the logic point is, the farther the logic point is from the root node of the logic tree.
It should be noted that, for other audio-type playing objects, such as broadcast programs, the presentation form of the logical information of the playing object may be, for example, a logical tree or a logical directory, and this application is not limited in particular. Since the construction principle of the logical tree and the logical directory has been described above, the description is omitted.
Next, a principle of specifying the playback position of the voiced book H will be described.
First, the latest stop position of the audio book H is compared with the position of each logical point of the audio book H, and the logical point corresponding to the latest stop position of the audio book H is determined. The logical point corresponding to the last stop position of the audio book H is the logical point having the same position as the last stop position of the audio book H, or the logical point corresponding to the last stop position of the audio book H is the logical point which is located closest to the last stop position of the audio book H and before the last stop position of the audio book H.
And then, determining the continuous playing position of the audio book H according to the stop playing time length of the audio book H and the logic point corresponding to the latest stop playing position of the audio book H.
Specifically, a jump strategy based on different time intervals may be preset, where the jump strategy may refer to, for example, a jump step corresponding to the time interval, and the jump step is used to indicate the number of logic points that jump forward or the number of levels that jump to a logic point at an upper stage. Based on this, the time length section corresponding to the stop playing time length of the audio book H is determined according to the stop playing time length of the audio book H, the corresponding jump strategy is determined according to the corresponding time length section, the continuous playing position of the audio book H is determined according to the logic point corresponding to the latest stop playing position of the audio book H and by combining the corresponding jump strategy, that is, the target logic point can be determined according to the corresponding jump strategy and the logic point corresponding to the latest stop playing position of the audio book H, and the position of the target logic point is determined as the continuous playing position of the audio book H. Wherein the continuous playing position of the voiced book H precedes its last stop playing position.
The skipping strategy can be obtained by counting the big data, and can also be realized by a machine learning algorithm, which is not specially limited in the present application. It should be noted that the foregoing jump strategy is only exemplary and is not intended to limit the present application.
Next, a process of specifying the resume position of the audio book H will be described by taking the logical list of the audio book H in fig. 4 as an example.
If the stop playing time of the audio book H is one month and the last stop playing position of the audio book H is [01,00.05.00], comparing the last stop playing position [01,00.05.00] of the audio book H with the position of each logic point in fig. 4, and determining the logic point corresponding to the last stop playing position [01,00.05.00] of the audio book H. As can be seen from fig. 4, the logical point corresponding to the last stop position [01,00.05.00] of the audio book H is the starting point [01,00.01.02] of the first sentence. The stop playing time of the audio book H is one month, a corresponding jump strategy is determined according to the stop playing time of one month, the jump step length indicated by the corresponding jump strategy is two levels of jump, the jump is carried out forward for two levels on the basis of the logic point [01,00.01.02] corresponding to the latest stop playing position [01,00.05.00] of the audio book H, and the position of the obtained target logic point is the continuous playing position of the audio book H. As can be seen from fig. 4, the resume position of the audio book H is the position of the start point of the first chapter [01,00.00.00 ].
In the first mode, according to the stop playing time of the audio book H and the last stop playing position of the audio book H, the position of one logic point in the logic information of the audio book H is determined as the continuous playing position of the audio book H, that is, the determination of the continuous playing position is related to the logic information of the audio book H, the stop playing time and the last stop playing position, so that the determination of the continuous playing position is more flexible, and the playing requirements of diversity can be better met. In addition, the resume position of the audio book H is before the stop position of the audio book H at the last time, so that when the audio book H is played to the user according to the resume position, the user can review the previous content, and the user experience is improved. In addition, the resume position of the audio book H is the position of one logic point of the audio book H, so that when the audio book H is played according to the resume position, a user can better review the previous content, and the user experience is further improved.
And secondly, determining the memory rate of the user for the audio book H according to the stop playing duration of the audio book H, and determining the continuous playing position of the audio book H according to the memory rate of the user for the audio book H and the latest stop playing position of the audio book H and by combining the logic information of the audio book H, wherein the logic information of the audio book is used for indicating the logic relationship among a plurality of logic points of the audio book and the position of each logic point. The resume position of the audio book H is the position of one logical point in the logical information thereof.
The process of determining the memory rate of the user for the voiced book H includes: and determining the memory rate of the user for the audio book H according to the stop playing time of the audio book H and a memory rate curve. The memory rate curve may be, for example, as shown in fig. 5, where the horizontal axis of the memory rate curve is the time length, and the vertical axis is the memory rate, so that the memory rate of the user for the voiced book H can be determined according to the memory rate curve when the off-air time length of the voiced book H is known.
It should be noted that, a memory rate calculation model may also be provided, and the memory rate calculation model is trained by the neural network model. By inputting the stop playing time of the audio book H into the memory rate calculation model, the memory rate calculation model can output the memory rate of the user to the audio book H.
Since the logic information about the audio book, the acquisition principle and the display mode thereof have been described above, they are not described herein again.
Next, a principle of specifying the playback position of the voiced book H will be described.
First, the latest stop position of the audio book H is compared with the position of each logical point of the audio book H, and the logical point corresponding to the latest stop position of the audio book H is determined. Since the principle of this process has already been explained above, it is not described here in detail.
And then, determining the continuous playing position of the audio book H according to the memory rate of the audio book H by the user and the logic point corresponding to the latest stop playing position of the audio book H.
Specifically, a jump strategy based on different memory rate intervals may be preset, where the jump strategy may refer to, for example, a jump step corresponding to the memory rate interval, and the jump step is used to indicate the number of logic points that jump forward or the number of levels that jump to a previous logic point, and the like, and this is not particularly limited in this embodiment of the present application.
Based on the above, the memory rate interval corresponding to the memory rate of the user for the audio book H is determined according to the memory rate of the user for the audio book H, the corresponding jump strategy is determined according to the corresponding memory rate interval, the continuous playing position of the audio book H is determined according to the logic point corresponding to the latest stop playing position of the audio book H and by combining the corresponding jump strategy, that is, the target logic point can be determined according to the corresponding jump strategy and the logic point corresponding to the latest stop playing position of the audio book H, and the position of the target logic point is determined as the continuous playing position of the audio book H. Wherein the continuous playing position of the voiced book H precedes its last stop playing position.
The skipping strategy can be obtained by counting the big data, and can also be realized by a machine learning algorithm, which is not specially limited in the present application. It should be noted that the foregoing jump strategy is only exemplary and is not intended to limit the present application.
In the second mode, the memory rate of the user for the audio book H is determined according to the stop playing duration of the audio book H, and the position of one logical point in the logical information of the audio book H is determined as the resume position of the audio book H according to the memory rate of the user for the audio book H and the last stop playing position of the audio book H, that is, the determination of the resume position is related to the logical information of the audio book H, the last stop playing position and the memory rate of the user for the audio book H, so that the determination of the resume position is more flexible, and the playing requirements of diversity are better met. In addition, the resume position of the audio book H is before the stop position of the audio book H at the last time, so that when the audio book H is played to the user according to the resume position, the user can review the previous content, and the user experience is improved. In addition, the resume position of the audio book H is the position of one logic point of the audio book H, so that when the audio book H is played according to the resume position, a user can better review the previous content, and the user experience is further improved. In addition, the position of one logic point in the plurality of logic points in the audio book H is determined as the continuous playing position according to the memory rate of the user for the audio book H and the latest stop playing position of the audio book H, namely the continuous playing position is determined to be related to the memory rate of the user for the audio book H, so that the continuous playing position can be determined based on the memory rates of different users, and the accuracy of determining the continuous playing position is further improved.
Thirdly, analyzing each vocal book in advance to obtain the position of the starting point of each sentence in each vocal book and the sequence of the starting points of the sentences. The representation of the position of the starting point of the sentence has been described above, and therefore, the description thereof is omitted here.
And acquiring the position of the starting point of each sentence in the audio book H according to the identification information of the audio book H.
And determining a corresponding jump strategy according to the off-air time of the audio book H, wherein the jump strategy is preset, and the jump strategy can refer to a jump step length corresponding to a time interval, and the jump step length is used for indicating the number of forward jumping sentences. Specifically, the longer the duration corresponding to the duration interval is, the more the number of forward-skipped sentences is. Therefore, the jump strategy corresponding to the time interval to which the off-air time belongs can be determined as the corresponding jump strategy.
And determining the starting point of the sentence corresponding to the latest stop-air position of the vocal book H according to the latest stop-air position of the vocal book H and the position of the starting point of each sentence in the vocal book H. The starting point of the sentence corresponding to the last stop position of the vocal book H is the starting point of the sentence having the same position as the last stop position of the vocal book H, or the starting point of the sentence corresponding to the last stop position of the vocal book H is the starting point of the sentence having the position before and closest to the last stop position of the vocal book H.
And determining the continuous playing position of the vocal book H according to the starting point of the sentence corresponding to the last stop playing position of the vocal book H and the corresponding jump strategy, namely determining the starting point of the target sentence according to the corresponding jump strategy and the starting point of the sentence corresponding to the last stop playing position of the vocal book H, and determining the position of the starting point of the target sentence as the continuous playing position of the vocal book H. Wherein the continuous playing position of the voiced book H precedes its last stop playing position.
It should be noted that the above description of the jump strategy is only exemplary and is not intended to limit the present application.
In the third mode, according to the stop playing time of the voiced book H and the latest stop playing position of the voiced book H, the position of the starting point of one sentence in the voiced book H is determined as the continuous playing position of the voiced book H, that is, the determination of the continuous playing position is related to the starting point of the sentence in the voiced book H, the stop playing time and the latest stop playing position, so that the determination of the continuous playing position of the voiced book H is more flexible, and the diversified playing requirements can be better met. In addition, the resume position of the audio book H is before the stop position of the audio book H at the last time, so that when the audio book H is played to the user according to the resume position, the user can review the previous content, and the user experience is improved.
Fourthly, analyzing each vocal book in advance to obtain the position of the starting point of each sentence in each vocal book and the sequence between the starting points of the sentences. The representation of the position of the starting point of the sentence has been described above, and therefore, the description thereof is omitted here.
And acquiring the position of the starting point of each sentence in the audio book H according to the identification information of the audio book H.
And determining the memory rate of the user for the audio book H according to the stop playing time of the audio book H. Since the principle of this step has already been explained above, it is not described here in detail.
And determining a corresponding jump strategy according to the memory rate of the user to the audio book H, wherein the jump strategy is preset, and the jump strategy can refer to a jump step length corresponding to a memory rate interval, and the jump step length is used for indicating the number of forward jumping sentences. Specifically, the lower the memory rate corresponding to the memory rate interval, the greater the number of forward-jumping sentences. In this way, the jump strategy corresponding to the memory rate interval to which the memory rate of the user for the audio book H belongs can be determined as the corresponding jump strategy.
And determining the starting point of the sentence corresponding to the latest stop-air position of the vocal book H according to the latest stop-air position of the vocal book H and the position of the starting point of each sentence of the vocal book H. Since this step has already been described above, it is not described here in detail.
And determining the continuous playing position of the audio book H according to the starting point of the sentence corresponding to the latest stop playing position of the audio book H and the corresponding jump strategy. Since the principle of this step has already been explained above, it is not described here in detail.
In the fourth mode, the memory rate of the user for the voiced book H is determined according to the stop playing duration of the voiced book H, and the position of the starting point of one sentence of the voiced book H is determined as the continuous playing position of the voiced book H according to the memory rate of the user for the voiced book H and the latest stop playing position of the voiced book H, that is, the determination of the continuous playing position is related to the starting point of the sentence of the voiced book H, the latest stop playing position and the memory rate of the user for the voiced book H, so that the determination of the continuous playing position is more flexible, and the playing requirements of diversity can be better met. In addition, the resume position of the audio book H is before the stop position of the audio book H at the last time, so that when the audio book H is played to the user according to the resume position, the user can review the previous content, and the user experience is improved. In addition, the position of the starting point of one sentence in the vocal book H is determined as the continuous playing position according to the memory rate of the user for the vocal book H and the stop playing position of the vocal book H, namely the continuous playing position is determined to be related to the memory rate of the user for the vocal book H, so that the continuous playing position can be determined based on the memory rates of different users, and the accuracy of determining the continuous playing position is improved.
It should be noted that the four ways described above are merely exemplary and are not used to limit the present application.
And step 207, the server sends the resume position of the vocal book H and the playing data corresponding to the resume position of the vocal book H to the mobile phone. The playback data may be, for example, an audio playback file to which the resume position of the audio book H belongs.
For better service of the user, in step 207, the server may also send the last stop position of the vocal book H and the audio playing file to which the last stop position of the vocal book H belongs to the mobile phone.
And step 208, the mobile phone receives the resume position of the audio book H and the play data corresponding to the resume position of the audio book H. In addition, the mobile phone can also receive the last stop-air position of the audio book H and the audio playing file to which the last stop-air position of the audio book H belongs, and cache the received data.
Step 209, the user clicks the resume control 301 of the audio book H.
Step 210, in response to the click operation, the mobile phone starts playing the corresponding playing data according to the resume position of the vocal book H, or if the audio playing file to which the last stop position of the vocal book H and the last stop position of the vocal book H belong is received, and in order to provide a better resume experience for the user, as shown in fig. 6, an inquiry dialog box for inquiring whether to resume playing according to the resume position or the stop position may be displayed in the interactive interface. The query statement in the query dialog box may be set by itself, for example, the query dialog box displays "whether to continue playing according to a new position after a long time from the playing" that is, the user is queried whether to play according to a continuous playing position, if the user clicks the positive control, the user continues playing according to the continuous playing position, and if the user clicks the negative control, the user continues playing according to the stop playing position. And if the user does not click the positive control and the negative control within the preset time, continuing playing according to the continuous playing position after the preset time.
Therefore, after the vocal book playing application is opened, the continuous playing request is sent to the server, so that the server responds to the continuous playing request to determine the continuous playing position of the vocal book H, and sends the continuous playing position of the vocal book H to the mobile phone.
It should be noted that, for the playing object of other audio classes (for example, a broadcast program, etc.), the principles of determining the playback position thereof, determining the logic information of the playing object, determining the memory rate of the user for the playing object, etc. are the same as the principles of the corresponding part of the audio book described above, and therefore, the details are not repeated here.
In other embodiments of the present application, the media application may also be a video playing application, that is, the type of the playing object is a video, for example, the playing object may be a tv show, a variety program, or the like. For a video-type playing object, the principle of determining the resume position of the playing object is substantially the same as the principle of determining the resume position of a vocal book, and the difference is the above expression of the logical information of the playing object, the position of the logical point, and the stop position of the playing object.
Next, a description will be given of a manner of indicating the stop position of the video-class playback object.
For the video-class playing object, the playing progress of the playing object is generally represented by at least one of the time progress, the identifier of the video playing file, and the like, so that the stop position of the video-class playing object can be represented by at least one of the time progress, the identifier of the video playing file, and the like. For example, the stop position of the playing object can be represented by the identifier of the video playing file to which the stop position of the playing object belongs and the time progress of the stop position of the playing object in the video playing file to which the stop position of the playing object belongs. For example, if the playing object is a television play, the stop position of the playing object may be represented as [03, 00.10.00], where 03 represents the identifier of the video playing file to which the stop position of the playing object belongs, 00.10.00 represents the 10 th minute of the stop position of the playing object in the video playing file to which the stop position belongs, and it should be noted that, regarding the representation of the subsequent time schedule, the unit of the first two digits is hour, the unit of the middle two digits is minute, and the unit of the second two digits is second. On this basis, if 03 indicates the identifier of the third episode of the drama, the stop position of the playback object is the 10 th minute in the third episode.
Next, logical information of the video playback object will be described.
The logic information of the video-class playing object comprises the logic relation of a plurality of logic points of the playing object and the position of each logic point. Because the video data of one playing object comprises at least one video playing file, different video playing files express different themes, and the theme expressed by one video playing file can also be divided into a plurality of sub-themes, each video playing file of the playing object can be divided into a plurality of video segments, and one video segment corresponds to one sub-theme. Thus, the logical points of the playing object include the start of each video playing file and the start of each video segment. The logical relationship among the logical points of the playing object is the sequence among the starting points of a plurality of video playing files of the playing object, the starting point of which video playing file the starting point of the video clip belongs to, and the sequence among the starting points of the video clips belonging to the starting point of the same video playing file. For example, if the playing object is a tv series, since a tv series includes multiple tv series, a tv series corresponds to a video playing file, a tv series can express a theme, and the theme expressed by a tv series can be divided into multiple sub-themes, the video playing file of a tv series can be divided into multiple video segments, and a video segment corresponds to a sub-theme. Thus, the plurality of logical points of the portion of the series includes a start of each episode of the series, a start of each video segment. The logical relation among the plurality of logical points refers to the sequence among the starting points of each episode of the television series, the starting points of the video segments belong to the starting point of which episode of the television series, and the sequence among the starting points of different video segments belonging to one episode of the television series.
It should be noted that, the plurality of logical points of the playing object may be divided into a plurality of levels, each level includes at least one logical point, and a logical point with a lower level belongs to a logical point with a higher level. For example, if the playing object is a tv series, the plurality of logical points of the tv series may be divided into two levels, where the first level is the start point of each episode of the tv series, the second level is the start point of a video segment, and the start point of each video segment has the start point of a tv series to which it belongs.
Next, the principle of acquiring the logical information of the video playback object will be described. The acquisition modes include the following two modes.
First, in a case where video data of a playback object does not include subtitle data:
first, audio data is acquired from video data of a playback object.
Then, the voice information is obtained from the audio data, and the steps are already described above, so that the detailed description is omitted here.
Finally, based on the recognition of the human voice information, the position of each of the plurality of logical points of the playing object and the logical relationship between the plurality of logical points are determined. Specifically, the voice information can be converted into text characters through a voice-to-text technology, semantic analysis is performed on the text characters, a plurality of semantic segments of the text characters are obtained according to a semantic analysis result, and each semantic segment corresponds to one subtopic. And obtaining a plurality of video clips of the video playing file according to each semantic clip and by combining the video playing file, thereby obtaining the sequence between the position of the starting point of each video clip in the plurality of video clips and the starting point of the video clip. And determining the position of the starting point of each video playing file, the sequence between the starting points of each video playing file, and finally the logical relationship of a plurality of logical points in the logical information of the playing object and the position of each logical point according to the video playing files and the sequence thereof.
Second, in case that the video data of the play object includes subtitle data:
and acquiring caption data from the video data of the playing object, identifying the caption data, and determining the position of each logical point in a plurality of logical points of the historical playing object and the logical relationship between the plurality of logical points. The principle of recognizing the subtitle data is the same as the above-mentioned principle of semantic analysis of text characters, and therefore, the description thereof is omitted here.
The position of the logical point of the video-class playing object is the same as the representation principle of the stop-playing position of the video-class playing object, and therefore, the description thereof is omitted. For example, if the playing object is a series and the logical point is the start point of the second video segment of the fifth episode of the series, the location of the logical point may be represented as [05, 00.20.00], 05 representing the identifier of the video playing file to which the start point of the second video segment of the fifth episode of the series belongs, and 00.20.00 representing the 20 th minute of the start point of the second video segment of the fifth episode of the series in the video playing file to which the start point belongs.
It should be noted that the representation of the position of the logical point is merely exemplary and is not intended to limit the present application.
The presentation form of the logic information of the video class playing object may be a logic tree or a logic directory, and the like, which is not particularly limited in this application embodiment.
Next, the logical information of a tv play will be described by taking an example in which the video-type playback object is a tv play and the presentation format of the logical information of the tv play is a logical tree.
If the video data of the tv play includes a plurality of video playing files, the number of the video playing files is the same as the number of the episode of the tv play, that is, one video playing file corresponds to one episode of the tv play. The logical points of the portion of the series include the start of each episode of the series, the start of each video segment. The logical relationship between the logical points means: the sequence between the starting points of each episode of the television series, the starting points of the video segments belong to the starting point of which episode of the television series, and the sequence between the starting points of the video segments belonging to the starting point of the same episode of the television series.
Based on the above, setting a root node, wherein the root node represents the name of the television play; setting child nodes (namely first-level child nodes) of a root node, wherein the number of the first-level child nodes is the same as the number of episodes of the series, one first-level child node represents the starting point of the series, and the sequence of the first-level child nodes from left to right or from right to left indicates the sequence of the starting points of the series represented by the first-level child nodes; and setting second-level sub-nodes under each first-level sub-node, wherein one second-level sub-node corresponds to the starting point of one video segment, the starting point of the video segment corresponding to one second-level sub-node belongs to the starting point of a set of television series corresponding to the first-level sub-node to which the video segment belongs, and the sequence of the starting points of the video segments represented by the second-level sub-nodes from left to right or from right to left under one first-level sub-node indicates the sequence of the starting points of the video segments represented by the second-level sub-nodes.
Fig. 7 is a schematic diagram of a logical tree of a series B provided in an embodiment of the present application, where the logical points of the series B include a start point of each episode of the series and a start point of a video segment. As can be seen from fig. 7, the tv series B includes a root node, a first level child node and a second level child node. Wherein the tracking node indicates the name of the series B. The number of the first level child nodes is three, which are respectively the starting point [01,00.00.00] of the first episode, the starting point [02,00.00.00] of the second episode, and the starting point [03,00.00.00] of the third episode. Wherein, regarding the start point [01,00.00.00] of the first episode, 01 denotes the identifier of the video play file to which the start point of the first episode belongs, and 00.00.00 denotes that the start point of the first episode is at the 0 th minute of the video play file to which it belongs. With respect to the start point [02,00.00.00] of the second episode, 02 denotes an identifier of a video play file to which the start point of the second episode belongs, and 00.00.00 denotes that the start point of the second episode is at the 0 th minute of the video play file to which it belongs. With respect to the start point [03,00.00.00] of the third series, 03 denotes an identifier of a video play file to which the start point of the third series belongs, and 00.00.00 denotes the 0 th minute of the start point of the third series in the video play file to which it belongs.
Each first-level child node comprises a plurality of second-level child nodes, wherein the second-level child nodes are the starting points of the video segments. The first number in parentheses after each second-level child node represents the identifier of the video playing file to which the start point of the video clip belongs, and the second number represents the time progress of the video clip in the video playing file to which the video clip belongs.
In other embodiments of the present application, the media application may also be a text-type application, i.e. the type of the playing object is text, e.g. the playing object is a novel, a paper, etc. For the playing object of the text class, the principle of determining the continuous playing position of the playing object is substantially the same as the principle of determining the continuous playing position of the audio book, and the difference lies in the logical information of the playing object and the obtaining mode thereof, and the representation of the position of the logical point and the stop playing position of the playing object.
Next, a description will be given of a manner of representing the stop position of the text-type playback object.
For a text type playing object, the playing progress of the playing object is usually represented by a page number, a paragraph number in the page number, a sentence number in the paragraph, and the like, and therefore, the stop position of the playing object can be represented by at least one of the page number, the paragraph number in the page number, the sentence number in the paragraph, and the like. For example, the stop position of the play object can be represented by a page number to which the stop position of the play object belongs, a paragraph number of the stop position of the play object in the page number to which the stop position belongs, and a sentence number of the stop position of the play object in the corresponding paragraph. For example, if the playing object is a novel, the stop position of the novel can be represented as [62, 03, 02], 62 represents that the stop position of the novel belongs to a page number of 62, 03 represents that the stop position of the novel belongs to a third section of the page to which the novel belongs, and 02 represents a second sentence that the stop position of the novel belongs to a third section of the page to which the novel belongs. As can be seen from the above, the stop position of the present novel is the second sentence of the third stage in the 62 th paragraph of the present novel.
It should be noted that, when the user is reading a text file, the device may directly obtain the page number of the current page read by the user, but the paragraph number and the sentence number cannot be directly obtained, so the total word number read by the user in the current page may be determined according to the time the user stays in the current page and the reading speed, and then the several sentences read by the user to the several paragraphs in the current page may be determined according to the total word number.
Next, logical information of the text-type playback object will be described.
The logical information of the playing object of the text class includes a logical relationship between a plurality of logical points of the playing object, and a position of each logical point. Since the text file of one play object includes a plurality of chapters, paragraphs, and sentences, the logical points of the play object of the text class include a plurality of chapter start points, paragraph start points, sentence start points, and the like. If the logic points of the playing object of the text class include chapter starting points, paragraph starting points and sentence starting points, the logic relationship among the logic points of the playing object of the text class is as follows: the chapter starting points are in sequence, the chapter starting points to which the paragraph starting points belong are in sequence, the paragraph starting points of one chapter starting point belong in sequence, the paragraph starting points to which the sentence starting points belong are in sequence, and the sentence starting points of the same paragraph starting point belong in sequence.
It should be noted that, the plurality of logical points of the playing object may be divided into a plurality of levels, each level includes at least one logical point, and a logical point with a lower level belongs to a logical point with a higher level. For example, if the playing object is a novel, the plurality of logical points of the novel can be divided into three levels, the first level is each chapter starting point, the second level is each paragraph starting point, each paragraph starting point has a starting point of a chapter to which the paragraph starting point belongs, and the third level is each sentence starting point, and each sentence starting point has a paragraph starting point to which the paragraph starting point belongs.
Next, the principle of acquiring logical information of a text-type playback object will be described.
Keywords in text data (i.e., text file) of the play object are identified. The key words may include, for example, but are not limited to, a directory, paragraph identifiers, line breaks, periods, page numbers, and the like. Based on the analysis of the keyword, determining the position of each logical point in the plurality of logical points of the playing object and the logical relationship between the plurality of logical points, namely according to the keyword: the directory, paragraph mark, line feed character, sentence number, page number, etc. determine the position and sequence (i.e. logical relationship) of chapter start point, paragraph start point, sentence start point of the playing object.
The position of the logical point of the text type playing object is the same as the representation principle of the stop position of the text type playing object, and therefore, the description is omitted here.
Fig. 8 is a second flowchart illustrating a method for determining a resume position according to an embodiment of the present application, where in fig. 8, a user terminal is taken as a tablet computer, a media application is taken as a video playing application, and a playing object is taken as a tv series, which is used to describe the method for determining the resume position. It should be noted that, in the resume position determining method shown in fig. 8, after the user opens the video playing application, the tablet pc sends a resume request to the server, and the server receives and determines, in response to the resume request, a resume position of each of at least one of the user's historically played television shows.
As shown in fig. 8, the resume position determining method includes the steps of:
step 801, clicking an icon of a video playing application in a tablet computer interface by a user, and opening the video playing application;
and step 802, the tablet computer displays an interactive interface of the video playing application.
Step 803, after the user clicks the icon of the video playing application in the tablet computer interface, the tablet computer sends a resume request to the server, and the resume request carries the identification information of the user and the initiation time of the resume request. Since the principle of this step has already been explained above, it is not described here in detail.
Step 804, the server receives the resume request, and determines the last stop time and stop position of at least one historical play television play of the user according to the identification information of the user carried in the resume request, that is, determines the last stop time and stop position of each historical play television play of the at least one historical play television play of the user.
In the embodiment of the present application, the user's history playing drama refers to a drama that the user has played in the video playing application. The specific number of at least one history playing television play can be set according to design requirements. For example, at least one of the historically played television shows may be all of the television shows that have been played to the user in the video playback application. For another example, since more and more tv shows may be played in the video playing application by the user over time, but at least one tv show that is recently played may be selected and played by the user, in order to ensure the user's experience of resuming playing, the calculation workload is reduced, and the at least one history playing tv show may also be the tv show that is recently played to the user in the video playing application.
If the user uses the video playing application, the video playing application uploads the stop time and stop position of each tv play played by the user in the video playing application to the server.
Based on this, the last stop-off time and stop-off position of all the television plays that have been played to the user can be obtained according to the identification information of the user, and then at least one history playing television play of the user is determined in all the television plays that have been played to the user, so as to obtain the last stop-off time and stop-off position of each history playing television play in the at least one history playing television play.
It should be noted that the concept of the last stop time and stop position of the history playing tv play is the same as that of the last stop time and stop position of the audio book in the above, and therefore, the description thereof is omitted. The representation of the last stop position of a history play series has already been explained above and is therefore not described in detail here.
Step 805, the server determines the stop-air time length of at least one historical play television play according to the last stop-air time of at least one historical play television play and the initiation time of the continuous play request. Namely, the stop-play duration of each historical play television play is respectively determined according to the latest stop-play duration of each historical play television play and the initiation time of the continuous play request. Since the principle of this step has already been explained above, it is not described here in detail.
Step 806, the server determines a resume position of at least one of the historically played dramas according to the stop time duration of at least one of the historically played dramas and the latest stop position of at least one of the historically played dramas, wherein the resume position of each of the historically played dramas is before the latest stop position thereof.
In the embodiment of the present application, the resume position of each of the at least one history play tv episode may be determined in the four ways of determining the resume position of the audio book.
For each of the historically played dramas, determining the resume position of the historically played drama includes the following four ways.
The method comprises the steps of firstly, determining a play-back position of a historical playing television play according to the play-back time length of the historical playing television play and the latest play-back position of the historical playing television play and the logic information of the historical playing television play, wherein the play-back position of the historical playing television play is the position of a logic point in the logic information.
It should be noted that, since the logic information of the history playing drama, the obtaining manner thereof, and the specific implementation principle of the first manner have been described above, no further description is given here.
And secondly, determining the memory rate of the user to the historical playing television series according to the stop-off duration of the historical playing television series, determining the resume position of the historical playing television series according to the memory rate of the user to the historical playing television series and the latest stop-off position of the historical playing television series and combining the logic information of the historical playing television series, wherein the resume position of the historical playing television series is the position of one logic point in the logic information.
It should be noted that, because the logic information and the obtaining manner of the history play drama, the determination manner of the user for the memory rate of the history play drama, and the specific implementation principle of the second manner have been described above, they are not described herein again.
Thirdly, analyzing the historical playing television play in advance to obtain the starting point of each statement in the historical playing television play and the sequence of the starting points of the statements. And determining a corresponding jump strategy according to the stop-play duration of the history playing television play, namely determining a jump step length, wherein the jump step length is used for indicating the number of forward jumping sentences. And determining the starting point of the sentence corresponding to the last stop-broadcast position of the history broadcast television play according to the last stop-broadcast position of the history broadcast television play and the position of the starting point of each sentence. And determining the continuous playing position of the historical playing television play according to the starting point of the sentence corresponding to the latest stop playing position of the historical playing television play and the corresponding jump strategy. Since the principle of the above process has been explained above, it is not described herein in detail.
Fourthly, analyzing the historical playing television play in advance to obtain the starting point of each statement in the historical playing television play and the sequence of the starting points of the statements. Determining the memory rate of the user to the historical playing television play according to the stop-play duration of the historical playing television play, and determining a corresponding jump strategy according to the memory rate of the user to the historical playing television play. And determining the starting point of the sentence corresponding to the last stop-broadcast position of the history broadcast television play according to the last stop-broadcast position of the history broadcast television play and the position of the starting point of each sentence. And determining the continuous playing position of the historical playing television play according to the starting point of the sentence corresponding to the latest stop playing position of the historical playing television play and the corresponding jump strategy. Since the principle of the above process has been explained above, it is not described herein in detail.
It should be noted that the effect achieved by each of the above modes is the same as that achieved by the above audio book portion, and therefore, the detailed description thereof is omitted here.
In step 807, the server sends the resume position of each of the at least one historically played dramas and the play data corresponding to the resume position of each of the at least one historically played dramas to the tablet computer. The play data corresponding to the resume position of one of the historically played dramas may be, for example, a video play file to which the resume position of the historically played drama belongs.
Step 808, the tablet computer receives the resume position of each of the at least one historically played dramas and the play data corresponding to the resume position of each of the at least one historically played dramas. In addition, the tablet computer can also receive play data corresponding to the last stop-off position of each of the at least one historically played dramas and the last stop-off position of each of the at least one historically played dramas, and cache the received data.
Step 809, selecting one historical playing drama from at least one historical playing drama as a target drama by the user, and clicking a resume control of the target drama.
Step 810, in response to the operation of clicking the target drama, the tablet pc starts playing corresponding playing data according to the resume position of the target drama, or if receiving playing data corresponding to the last stop position of each of the at least one historically played drama and the last stop position of each of the at least one historically played drama, and in order to provide a better resume experience for the user, may display an inquiry dialog box in the interactive interface, the inquiry dialog box being used to inquire whether the user resumes playing according to the resume position or the stop position, if the user clicks the positive control, resume playing is performed according to the resume position, and if the user clicks the negative control, resume playing is performed according to the stop position. And if the user does not click the positive control and the negative control within the preset time, continuing playing according to the continuous playing position after the preset time. It should be noted that the query dialog box has already been described above, and therefore, the description thereof is omitted here.
As can be seen from the above, the stop-air time of each historically played drama is determined according to the initiation time of the resume request and the latest stop-air time of each historically played drama, and the resume position of each historically played drama is determined according to the stop-air time of each historically played drama and the latest stop-air position of each historically played drama, so that the determination of the resume position of the historically played drama is more flexible, and thus diverse play requirements can be better met. In addition, after the video playing application is opened, the resume request can be sent to the server, so that the server responds to the resume request to determine the resume position of each historical playing televised series, and sends the resume position of each historical playing televised series to the tablet personal computer.
It should be noted that, the principle of determining the continuous playing position of the history playing object (e.g., the variety program) of other video classes is as described above, and therefore, the description thereof is omitted here.
In other embodiments of the present application, the history playing object may also be an audio-class or text-class history playing object. Since the principle of determining such a history play object is as described above, it will not be described herein.
In summary, the present application provides a resume position determining method, which is applied to a server, and as shown in fig. 9, the method includes the following steps:
Specifically, the following two modes are adopted, wherein:
firstly, logic information of at least one part of historical playing objects is acquired in advance, wherein the logic information of each historical playing object in the at least one part of historical playing objects is used for indicating the logic relationship among a plurality of logic points of the historical playing objects and the position of each logic point; then, according to the stop-play duration of at least one part of historical play objects, the latest stop-play position of at least one part of historical play objects and the logic information of at least one part of historical play objects, the continuous play position of at least one part of historical play objects is determined; and the continuous playing position of each historical playing object in at least one part of the historical playing objects is the position of one logic point in the logic information.
Secondly, logic information of at least one part of historical playing objects is acquired in advance, wherein the logic information of each historical playing object in the at least one part of historical playing objects is used for indicating the logic relationship among a plurality of logic points of the historical playing objects and the position of each logic point; determining the memory rate of the user to at least one part of historical playing objects according to the off-air time of at least one part of historical playing objects, and specifically determining the memory rate of the user to at least one part of historical playing objects according to the off-air time of at least one part of historical playing objects and by combining a memory rate curve; determining the continuous playing position of at least one part of historical playing objects according to the memory rate of the user to at least one part of historical playing objects, the latest stop playing position of at least one part of historical playing objects and the logic information of at least one part of historical playing objects; and the continuous playing position of each historical playing object in at least one part of the historical playing objects is the position of one logic point in the logic information.
It should be noted that the two manners described above are merely exemplary and are not intended to limit the present application, and for example, the resume position may also be determined by the other two manners shown above.
Next, the principle of acquiring the logical information of the history playback object will be explained for each history playback object.
If the type of the historical playing object is audio, acquiring voice information from the audio data of the historical playing object; based on the recognition of the human voice information, a position of each of the plurality of logical points of the history play object and a logical relationship between the plurality of logical points are determined. If the history playing object is a vocal book, the plurality of logic points of the history playing object comprise a chapter starting point, a paragraph starting point and a sentence starting point of the vocal book.
If the type of the historical playing object is a video and the video data of the historical playing object does not include subtitle data, acquiring audio data from the video data of the historical playing object; acquiring human voice information from audio data; based on the identification of the human voice information, a position of each of the plurality of logical points of the history play object and a logical relationship between the plurality of logical points are determined.
If the type of the historical playing object is a video and the video data of the historical playing object comprises subtitle data, acquiring the subtitle data from the video data of the historical playing object; based on the identification of the subtitle data, a position of each of the plurality of logical points of the historical play object and a logical relationship between the plurality of logical points are determined.
If the type of the historical playing object is a text, identifying keywords in the text data of the historical playing object; based on the analysis of the keywords, the position of each of the plurality of logical points of the history playing object and the logical relationship between the plurality of logical points are determined.
It should be noted that the implementation principle and details of the above process have been described above, and therefore are not described herein again.
In addition, the continuous playing position of the historical playing object is in front of the stop playing position of the historical playing object, so that when the historical playing object is played to a user according to the continuous playing position of the historical playing object, the user can review the previous content, and the user experience is improved.
Fig. 10 is a fourth flowchart of the resume position determining method according to the embodiment of the present application, and in fig. 10, the method for determining the resume position is described by taking the user terminal as a mobile phone, the media application as a text reading application, and the playing object as a novel. It should be noted that, in the resume position determining method shown in fig. 10, after the user opens the text reading application and determines the novel to be played in the text reading application, the mobile phone sends a resume request to the server, and the server receives and determines the resume position of the novel to be played in response to the resume request.
As shown in fig. 10, the process of resuming the location determination includes the following steps:
1001, clicking an icon of a text reading application in a mobile phone interface by a user, and opening the text reading application;
step 1002, the mobile phone displays an interactive interface of the text reading application.
Step 1003, the user selects a novel from the interactive interface of the text reading application as the novel to be played, and clicks the continuous playing control of the novel to be played. The novel to be played is one of the novels that the user has read in the text reading application. It should be noted that, since the type of the playback object here is text, the playback here can be understood as display.
Step 1004, the mobile phone responds to the operation of clicking the continuous playing control of the novel to be played, and generates a continuous playing request, wherein the continuous playing request carries the identification information of the user, the identification information of the novel to be played and the initiation time of the continuous playing request.
In the embodiment of the application, after the user clicks the continuous playing control of the novel to be played, the mobile phone obtains the identification information of the user and the identification information of the novel to be played, determines the current time as the initiation time of the continuous playing request, and then generates the continuous playing request according to the identification information of the user, the identification information of the novel to be played and the initiation time of the continuous playing request.
Step 1005, the mobile phone sends the continuous playing request to the server.
Step 1006, the server receives the continuous playing request, and determines the latest stop time and stop position of the novel to be played according to the identification information of the user and the identification information of the novel to be played carried by the continuous playing request.
In the embodiment of the present application, if the user uses the text reading application, the text reading application uploads the stop time and stop position of each novel read by the user in the text reading application to the server.
Based on the above, the latest stop time and stop position of the novel to be played can be obtained according to the identification information of the user and the identification information of the novel to be played. The latest stop-play time of the novel to be played is the time when the user stops playing the novel to be played the latest time, and the latest stop-play position of the novel to be played is the playing progress of the novel to be played when the user stops playing the novel to be played the latest time.
Since the representation of the stop-and-play position of the text type playing object has been described above, it is not described herein again.
Step 1007, the server calculates the stop-air time of the novel to be played according to the latest stop-air time of the novel to be played and the initiation time of the continuous playing request. Since the principle of this step has already been explained above, it is not described here in detail.
Step 1008, the server determines a continuous playing position of the novel to be played according to the stop playing time of the novel to be played and the latest stop playing position of the novel to be played, wherein the continuous playing position of the novel to be played is before the latest stop playing position.
In the embodiment of the present application, the resume position of the novel to be played can be determined in four ways as described in step 206. The specific mode is as follows:
the first method comprises the steps of determining a continuous playing position of a novel to be played according to the stop playing time of the novel to be played and the latest stop playing position of the novel to be played and the logic information of the novel to be played, wherein the logic information of the novel to be played is used for indicating the logic relationship among a plurality of logic points of the novel to be played and the position of each logic point. The continuous playing position of the novel to be played is the position of one logic point in the logic information.
It should be noted that, since the logic information of the novel to be played, the obtaining manner thereof, and the specific execution process of the first manner have been described above, they are not described herein again.
And secondly, determining the memory rate of the novel to be played by the user according to the stop-play duration of the novel to be played, determining the continuous playing position of the novel to be played by the user according to the memory rate of the novel to be played by the user and the latest stop-play position of the novel to be played, and combining the logic information of the novel to be played, wherein the continuous playing position of the novel to be played is the position of one logic point in the logic information.
It should be noted that, because the logic information of the novel to be played and the obtaining manner thereof, the determining manner of the memory rate of the novel to be played by the user, and the specific implementation principle of the second manner have been described above, they are not described herein again.
And thirdly, analyzing the novel to be played in advance to obtain the starting point of each sentence in the novel to be played and the sequence of the starting points of the sentences. And determining a corresponding jump strategy according to the stop-play duration of the novel to be played, namely determining a jump step length, wherein the jump step length is used for indicating the number of forward-jumping sentences. And determining the starting point of the sentence corresponding to the latest stop-broadcasting position of the novel to be played according to the latest stop-broadcasting position of the novel to be played and the position of the starting point of each sentence. And determining the continuous playing position of the novel to be played according to the starting point of the sentence corresponding to the latest stop playing position of the novel to be played and the corresponding jump strategy. Since the principle of the above process has been explained above, it is not described herein in detail.
And fourthly, analyzing the novel to be played in advance to obtain the starting point of each sentence in the novel to be played and the sequence between the starting points of the sentences. Determining the memory rate of the user to the novel to be played according to the stop playing time of the novel to be played, and determining a corresponding skip strategy according to the memory rate of the user to the novel to be played. And determining the starting point of the sentence corresponding to the latest stop-broadcasting position of the novel to be played according to the latest stop-broadcasting position of the novel to be played and the position of the starting point of each sentence. And determining the continuous playing position of the novel to be played according to the starting point of the sentence corresponding to the latest stop playing position of the novel to be played and the corresponding jump strategy. Since the principle of the above process has been explained above, it is not described herein in detail.
It should be noted that the effect achieved by each of the above modes is the same as that achieved by the above audio book portion, and therefore, the detailed description thereof is omitted here.
And step 1009, the server sends the resume position of the novel to be played and the playing data corresponding to the resume position of the novel to be played to the mobile phone. In addition, the server can also send the playing data corresponding to the latest stop-air position of the novel to be played and the latest stop-air position of the novel to be played to the mobile phone.
Step 1010, the mobile phone receives the resume position of the novel to be played and the play data corresponding to the resume position of the novel to be played. In addition, the mobile phone can also receive the latest stop-play position of the novel to be played and the playing data corresponding to the latest stop-play position of the novel to be played, and cache the received data.
Step 1011, the mobile phone starts playing the corresponding playing data according to the resume position of the novel to be played, or if receiving the stop position of the novel to be played and the playing data corresponding to the stop position of the novel to be played, and in order to provide better resume experience for the user, an inquiry dialog box can be displayed in the interactive interface, the inquiry dialog box is used for inquiring whether the user resumes playing according to the resume position or the stop position, if the user clicks the positive control, resuming playing according to the resume position, and if the user clicks the negative control, resuming playing according to the stop position. And if the user does not click the positive control and the negative control within the preset time, continuing playing according to the continuous playing position after the preset time. It should be noted that the query dialog box has already been described above, and therefore, the description thereof is omitted here.
In addition, the continuous playing position is in front of the stop playing position, so that when the novel to be played is played to the user according to the continuous playing position, the user can review the previous content, the user experience is improved, in addition, the continuous playing position of the novel to be played is only determined, the calculated amount is reduced, and the calculation efficiency is improved.
It should be noted that, the principle of determining the playback position of an object to be played (e.g., a paper) of another text class is as described above, and therefore, the description is omitted here.
In other embodiments of the present application, the playback object may also be an audio-class or video-class playback object. The principle of determining the playback position of such a playback object is the same as that of the playback position of the text playback object, and therefore, the description thereof is omitted here.
In summary, the present application provides a resume position determining method, which is applied to a server, and as shown in fig. 11, the method includes the following steps:
And 1104, determining a resume position of the object to be played according to the stop time of the object to be played and the latest stop position of the object to be played, wherein the resume position of the object to be played is before the latest stop position of the object to be played.
Specifically, the following two modes are adopted, wherein:
first, logic information of an object to be played is obtained in advance, where the logic information of the object to be played is used to indicate a logic relationship among a plurality of logic points of the object to be played and a position of each logic point. And then, determining a continuous playing position of the object to be played according to the stop playing time of the object to be played and the latest stop playing position of the object to be played and the logic information of the object to be played, wherein the continuous playing position of the object to be played is the position of a logic point in the logic information of the object to be played.
Secondly, logic information of an object to be played is obtained in advance, wherein the logic information of the object to be played is used for indicating a logic relation among a plurality of logic points of the object to be played and the position of each logic point, then, the memory rate of the object to be played of a user is determined according to the stop playing time of the object to be played of the user, and specifically, the memory rate of the object to be played of the user is determined according to the stop playing time of the object to be played of the user and by combining a memory rate curve; determining a continuous playing position of the object to be played according to the memory rate of the object to be played and the latest stop playing position of the object to be played by the user and the logic information of the object to be played; and the continuous playing position of the object to be played is the position of a logic point in the logic information of the object to be played.
It should be noted that the two manners described above are merely exemplary and are not intended to limit the present application, and for example, the resume position may also be determined by the other two manners shown above.
Next, a principle of acquiring logical information of an object to be played will be explained.
If the type of the object to be played is audio, acquiring voice information from audio data of the object to be played; based on the recognition of the human voice information, the position of each logical point in the plurality of logical points of the object to be played and the logical relationship between the plurality of logical points are determined. If the object to be played is a voiced book, the plurality of logical points of the object to be played comprise a chapter starting point, a paragraph starting point and a sentence starting point of the voiced book.
If the type of the object to be played is a video and the video data of the object to be played does not include subtitle data, acquiring audio data from the video data of the object to be played; acquiring human voice information from audio data; based on the recognition of the human voice information, the position of each logical point in the plurality of logical points of the object to be played and the logical relationship between the plurality of logical points are determined.
If the type of the object to be played is a video and the video data of the object to be played comprises subtitle data, acquiring the subtitle data from the video data of the object to be played; based on the identification of the subtitle data, the position of each logical point in the plurality of logical points of the object to be played and the logical relationship between the plurality of logical points are determined.
If the type of the object to be played is a text, identifying keywords in text data of the object to be played; and determining the position of each logic point in the plurality of logic points of the object to be played and the logic relation among the plurality of logic points based on the analysis of the keywords.
It should be noted that the implementation principle and details of the above process have been described above, and therefore are not described herein again.
To sum up, the stop-air time of the object to be played is determined according to the initiation time of the continuous-air request and the latest stop-air time of the object to be played, and the continuous-air position of the object to be played is determined according to the stop-air time of the object to be played and the latest stop-air position of the object to be played, so that the determination of the continuous-air position of the novel to be played is more flexible, and the playing requirement of diversity is better met.
Fig. 12 is a schematic structural diagram of an electronic device according to an embodiment of the present application. As shown in fig. 12, the components of electronic device 1200 may include, but are not limited to: the at least one processor 1210, the at least one memory 1220, the at least one data storage 1270, and the bus 1230 connecting the various system components including the memory 1220, the processor 1210, and the data storage 1270.
The memory 1220 stores a computer program, which can be executed by the processor 1210, so that the processor 1210 executes the steps executed by the server in the present specification.
The memory 1220 may include volatile memory units, such as a random access memory unit (RAM)12201 and/or a cache memory unit 12202, and may further include a read only memory unit (ROM) 12203.
A data storage 1270 for storing data required and generated by the above-mentioned services during execution, such as logic information, stop-and-go position, stop-and-go time, stop-and-go duration, resume position, etc.
The electronic device 1200 may also communicate with one or more external devices 1260 (e.g., keyboard, pointing device, bluetooth device, etc.) via input/output (I/O) interfaces 1240. The electronic device 1200 may also communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network such as the internet) through the network adapter 1250. As shown, the network adapter 1250 communicates with the other modules of the electronic device 1200 via the bus 1230. It should be appreciated that although not shown, other hardware and/or software modules may be used in conjunction with the electronic device 1200, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, and data backup storage systems, among others.
The present application also provides a computer-readable storage medium comprising a computer program which, when executed on a computer, causes the computer to perform any one of the methods described in the method embodiments above.
The present application also provides a computer program for performing any one of the methods described in the above method embodiments when the computer program is executed by a computer.
The present application further provides a chip, which includes a processor and a memory, where the memory is used to store a computer program, and the processor is used to call and run the computer program stored in the memory to execute any one of the methods described in the above method embodiments.
Further, the chip may also include a memory and a communication interface. The communication interface may be an input/output interface, a pin or an input/output circuit, etc.
In implementation, the steps of the above method embodiments may be performed by integrated logic circuits of hardware in a processor or instructions in the form of software. The processor may be a general purpose processor, a Digital Signal Processor (DSP), an application-specific integrated circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, or discrete hardware components. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in the embodiments of the present application may be directly implemented by a hardware encoding processor, or implemented by a combination of hardware and software modules in the encoding processor. The software module may be located in ram, flash memory, rom, prom, or eprom, registers, etc. storage media as is well known in the art. The storage medium is located in a memory, and a processor reads information in the memory and completes the steps of the method in combination with hardware of the processor.
The memory referred to in the various embodiments above may be volatile memory or non-volatile memory, or may include both volatile and non-volatile memory. The non-volatile memory may be a read-only memory (ROM), a Programmable ROM (PROM), an Erasable PROM (EPROM), an electrically Erasable EPROM (EEPROM), or a flash memory. Volatile memory can be Random Access Memory (RAM), which acts as external cache memory. By way of example, but not limitation, many forms of RAM are available, such as Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), Synchronous Dynamic Random Access Memory (SDRAM), double data rate SDRAM, enhanced SDRAM, SLDRAM, Synchronous Link DRAM (SLDRAM), and direct rambus RAM (DR RAM). It should be noted that the memory of the systems and methods described herein is intended to comprise, without being limited to, these and any other suitable types of memory.
Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.
The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application or portions thereof that substantially contribute to the prior art may be embodied in the form of a software product stored in a storage medium and including instructions for causing a computer device (personal computer, server, network device, or the like) to execute all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a read-only memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
The above description is only for the specific embodiments of the present application, but the scope of the present application is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present application, and shall be covered by the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.
Claims (28)
1. A resume position determination method, comprising:
receiving a continuous playing request, wherein the continuous playing request carries identification information of a user and initiation time of the continuous playing request;
determining the latest stop time and stop position of at least one part of historical playing objects of the user according to the identification information of the user;
determining the off-air time of at least one part of historical playing objects according to the latest off-air time and the initiation time of at least one part of historical playing objects;
determining the continuous playing position of at least one part of historical playing objects according to the stop playing time of at least one part of historical playing objects and the latest stop playing position of at least one part of historical playing objects, wherein:
the resume position of each of the at least a portion of the historical play objects is prior to its last stop position.
2. The method of claim 1, further comprising:
acquiring logic information of at least one part of historical playing objects, wherein the logic information of each historical playing object in the at least one part of historical playing objects is used for indicating the logic relationship among a plurality of logic points of the historical playing objects and the position of each logic point;
the determining the resume position of the at least a part of the historical playing object comprises:
determining the continuous playing position of at least one part of historical playing objects according to the stop playing duration of at least one part of historical playing objects and the latest stop playing position of at least one part of historical playing objects and the logic information of at least one part of historical playing objects;
and the continuous playing position of each historical playing object in the at least part of the historical playing objects is the position of one logic point in the logic information.
3. The method of claim 1,
the determining the resume position of the at least a part of the historical playing object comprises:
determining the memory rate of the user to the at least one part of historical playing objects according to the off-air time of the at least one part of historical playing objects;
and determining the continuous playing position of the at least one part of the historical playing object according to the memory rate of the user to the at least one part of the historical playing object and the latest stop playing position of the at least one part of the historical playing object.
4. The method of claim 1, further comprising:
acquiring logic information of at least one part of historical playing objects, wherein the logic information of each historical playing object in the at least one part of historical playing objects is used for indicating the logic relationship among a plurality of logic points of the historical playing objects and the position of each logic point;
the determining the resume position of the at least a part of the historical playing object comprises:
determining the memory rate of the user to the at least one part of historical playing objects according to the off-air time of the at least one part of historical playing objects;
determining a continuous playing position of at least one part of historical playing objects according to the memory rate of the user to the at least one part of historical playing objects, the latest stop playing position of the at least one part of historical playing objects and the logic information of the at least one part of historical playing objects;
and the continuous playing position of each historical playing object in the at least part of the historical playing objects is the position of one logic point in the logic information.
5. The method of claim 3 or 4, wherein the determining the user's memory rate of the at least a portion of the historical playback objects comprises:
and determining the memory rate of the user to the at least one part of the historical playing objects according to the off-air time length of the at least one part of the historical playing objects and a memory rate curve.
6. The method according to any one of claims 2-5, wherein the type of the history playing object is audio;
for each history playing object, acquiring the logic information of the history playing object comprises:
acquiring voice information from the audio data of the historical playing object;
based on the identification of the voice information, determining a position of each of a plurality of logical points of the history playing object and a logical relationship between the plurality of logical points.
7. The method according to any one of claims 2 to 5, wherein the history playing object is a vocal book; the plurality of logical points of the history playing object include a chapter start point, a paragraph start point, and a sentence start point of the audio book.
8. The method according to any one of claims 2 to 5, wherein the type of the history playing object is video;
for each history playing object, acquiring the logic information of the history playing object comprises:
acquiring audio data from the video data of the historical playing object;
acquiring human voice information from the audio data;
based on the identification of the voice information, determining a position of each of a plurality of logical points of the history playing object and a logical relationship between the plurality of logical points.
9. The method according to any one of claims 2 to 5, wherein the type of the history playing object is video;
for each history playing object, acquiring the logic information of the history playing object comprises:
acquiring subtitle data from the video data of the historical playing object;
based on the identification of the subtitle data, determining a position of each of a plurality of logical points of the historical play object and a logical relationship between the plurality of logical points.
10. The method according to any one of claims 2 to 5, wherein the type of the history playing object is text;
for each history playing object, acquiring the logic information of the history playing object comprises:
identifying keywords in the text data of the historical playing object;
based on the analysis of the keywords, determining the position of each logical point in the plurality of logical points of the historical playing object and the logical relationship between the plurality of logical points.
11. The method according to any one of claims 1 to 10, wherein the at least one part of the history playing object is a playing object played to the user last time.
12. The method according to any one of claims 1 to 10, further comprising:
and sending the resume position of the at least part of the historical playing object and playing data which is used for the user terminal to cache and corresponds to the resume position of the at least part of the historical playing object to the user terminal.
13. A resume position determination method, comprising:
receiving a continuous playing request, wherein the continuous playing request carries identification information of a user, identification information of an object to be played and initiation time of the continuous playing request;
determining the latest stop time and stop position of the object to be played according to the identification information of the object to be played and the identification information of the user;
determining the broadcasting stop duration of the object to be played according to the broadcasting stop time and the initiating time;
and determining the continuous playing position of the object to be played according to the stop playing time length and the stop playing position, wherein the continuous playing position of the object to be played is before the stop playing position.
14. The method of claim 13, further comprising:
acquiring logic information of the object to be played, wherein the logic information of the object to be played is used for indicating the logic relationship among a plurality of logic points of the object to be played and the position of each logic point;
the determining the continuous playing position of the object to be played comprises:
determining the continuous playing position of the object to be played according to the off-air time length and the off-air position and by combining the logic information of the object to be played;
and the continuous playing position of the object to be played is the position of a logic point in the logic information of the object to be played.
15. The method of claim 13,
the determining the continuous playing position of the object to be played comprises:
determining the memory rate of the user to the object to be played according to the off-air time of the object to be played;
and determining the continuous playing position of the object to be played according to the memory rate of the user to the object to be played and the latest stop playing position of the object to be played.
16. The method of claim 13, further comprising:
acquiring logic information of the object to be played, wherein the logic information of the object to be played is used for indicating the logic relationship among a plurality of logic points of the object to be played and the position of each logic point;
the determining the continuous playing position of the object to be played comprises:
determining the memory rate of the user to the object to be played according to the off-air time;
determining a continuous playing position of the object to be played according to the memory rate and the stop playing position and by combining the logic information of the object to be played;
and the continuous playing position of the object to be played is the position of a logic point in the logic information of the object to be played.
17. The method according to claim 15 or 16, wherein the determining the user's memory rate of the object to be played according to the off-air time period comprises:
and determining the memory rate of the user to the object to be played according to the off-air time and by combining a memory rate curve.
18. The method according to any one of claims 13-17, wherein the type of the object to be played is audio;
the acquiring the logic information of the object to be played comprises:
acquiring voice information from the audio data of the object to be played;
and determining the position of each logic point in the plurality of logic points of the object to be played and the logic relation among the plurality of logic points based on the identification of the voice information.
19. The method according to any one of claims 13 to 17, wherein the object to be played is a vocal book; the plurality of logic points of the object to be played comprise a chapter starting point, a paragraph starting point and a sentence starting point of the audio book.
20. The method according to any one of claims 13 to 17, wherein the type of the object to be played is a video;
the acquiring the logic information of the object to be played comprises:
acquiring audio data from the video data of the object to be played;
acquiring human voice information from the audio data;
and determining the position of each logic point in the plurality of logic points of the object to be played and the logic relation among the plurality of logic points based on the identification of the voice information.
21. The method according to any one of claims 13 to 17, wherein the type of the object to be played is a video;
the acquiring the logic information of the object to be played comprises:
acquiring subtitle data from the video data of the object to be played;
and determining the position of each logic point in the plurality of logic points of the object to be played and the logic relation among the plurality of logic points based on the identification of the caption data.
22. The method according to any one of claims 13 to 17, wherein the type of the object to be played is text;
the acquiring the logic information of the object to be played comprises:
identifying keywords in the text data of the object to be played;
and determining the position of each logic point in the plurality of logic points of the object to be played and the logic relation among the plurality of logic points based on the analysis of the keywords.
23. The method of any one of claims 13 to 22, further comprising:
and sending the resume position and play data corresponding to the resume position and used for caching by the user terminal to the user terminal.
24. An electronic device, comprising:
the communication module is used for communicating with other equipment;
one or more storage memories for storing one or more computer programs;
one or more processors configured to execute the one or more computer programs to cause the electronic device to perform the method of any of claims 1-12 or 13-23.
25. A continuous play system, comprising: a user terminal and an electronic device according to claim 24;
the user terminal is configured to send a resume request to the electronic device.
26. A computer readable storage medium comprising a computer program which, when executed on a computer, causes the computer to perform the method of any of claims 1 to 12 or 13 to 23.
27. A computer program for performing the method of any one of claims 1 to 12 or 13 to 23 when executed by a computer.
28. A chip comprising a processor and a memory, the memory for storing a computer program, the processor for invoking and running the computer program stored in the memory to perform the method of any one of claims 1-12 or 13-23.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010617722.7A CN113868445A (en) | 2020-06-30 | 2020-06-30 | Continuous playing position determining method and continuous playing system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010617722.7A CN113868445A (en) | 2020-06-30 | 2020-06-30 | Continuous playing position determining method and continuous playing system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113868445A true CN113868445A (en) | 2021-12-31 |
Family
ID=78981820
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010617722.7A Pending CN113868445A (en) | 2020-06-30 | 2020-06-30 | Continuous playing position determining method and continuous playing system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113868445A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114979769A (en) * | 2022-06-01 | 2022-08-30 | 山东福生佳信科技股份有限公司 | Video continuous playing progress management system and method |
-
2020
- 2020-06-30 CN CN202010617722.7A patent/CN113868445A/en active Pending
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114979769A (en) * | 2022-06-01 | 2022-08-30 | 山东福生佳信科技股份有限公司 | Video continuous playing progress management system and method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7335062B2 (en) | Voice service providing method and apparatus | |
US11302337B2 (en) | Voiceprint recognition method and apparatus | |
US9824150B2 (en) | Systems and methods for providing information discovery and retrieval | |
US9438850B2 (en) | Determining importance of scenes based upon closed captioning data | |
CN106971009B (en) | Voice database generation method and device, storage medium and electronic equipment | |
CN117194609A (en) | Providing command bundle suggestions for automated assistants | |
US9190049B2 (en) | Generating personalized audio programs from text content | |
CN109979450B (en) | Information processing method and device and electronic equipment | |
US12086503B2 (en) | Audio segment recommendation | |
CN111177542B (en) | Introduction information generation method and device, electronic equipment and storage medium | |
US12026354B2 (en) | Video generation | |
CN109710799B (en) | Voice interaction method, medium, device and computing equipment | |
CN112987996B (en) | Information display method, information display device, electronic equipment and computer readable storage medium | |
CN105244027A (en) | Method of generating homophonic text and system thereof | |
US11775070B2 (en) | Vibration control method and system for computer device | |
US20200112755A1 (en) | Providing relevant and authentic channel content to users based on user persona and interest | |
CN113868445A (en) | Continuous playing position determining method and continuous playing system | |
CN115687807A (en) | Information display method, device, terminal and storage medium | |
CN112989102A (en) | Audio playing control method and device, storage medium and terminal equipment | |
JP7166370B2 (en) | Methods, systems, and computer readable recording media for improving speech recognition rates for audio recordings | |
CN109994000B (en) | Reading accompanying method, electronic equipment and computer storage medium | |
CN112017487A (en) | Flat Flash learning system based on artificial intelligence | |
CN110287365B (en) | Data processing method and electronic equipment | |
CN116405713A (en) | Audio recommendation method, device, medium and computing equipment | |
CN118042235A (en) | Service information pushing method and device, computer equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |