CN115150633A - Processing method for live broadcast reading, computer equipment and storage medium - Google Patents

Processing method for live broadcast reading, computer equipment and storage medium Download PDF

Info

Publication number
CN115150633A
CN115150633A CN202210776573.8A CN202210776573A CN115150633A CN 115150633 A CN115150633 A CN 115150633A CN 202210776573 A CN202210776573 A CN 202210776573A CN 115150633 A CN115150633 A CN 115150633A
Authority
CN
China
Prior art keywords
text content
anchor
audio
preset text
terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210776573.8A
Other languages
Chinese (zh)
Inventor
曾家乐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Cubesili Information Technology Co Ltd
Original Assignee
Guangzhou Cubesili Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Cubesili Information Technology Co Ltd filed Critical Guangzhou Cubesili Information Technology Co Ltd
Priority to CN202210776573.8A priority Critical patent/CN115150633A/en
Publication of CN115150633A publication Critical patent/CN115150633A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/2187Live feed
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/194Calculation of difference between files
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • H04N21/4312Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
    • H04N21/4316Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations for displaying supplemental content in a region of the screen, e.g. an advertisement in a separate window
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computing Systems (AREA)
  • Molecular Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The application discloses a processing method for live broadcast reading, computer equipment and a storage medium, wherein the method comprises the following steps: receiving an audio stream uploaded by a main broadcast terminal of a main broadcast in the process of live broadcast reading of preset text content; matching the audio stream with the preset text content to determine an audio segment matched with the preset text content; and at least sending the text content segment corresponding to the audio segment to a spectator terminal in a live broadcast room, so that the spectator terminal at least displays the text content segment corresponding to the audio segment on a live broadcast interface of the spectator terminal. Through the mode, the interactive function in the live broadcast reading process can be enriched.

Description

Processing method for live broadcast reading, computer equipment and storage medium
Technical Field
The present application relates to the field of live broadcast technologies, and in particular, to a live broadcast reading processing method, a computer device, and a storage medium.
Background
With the rapid development of the live broadcast industry, live broadcast becomes an important way for people to entertain through the internet. In the live broadcasting process, the anchor performs at the anchor terminal, and a user can watch the performance of the anchor at an audience terminal and can interact with the anchor through the audience terminal.
In a live broadcast of a reading channel, a main broadcast can read various books, and a user can listen to the books read by the main broadcast in the live broadcast. However, in the live broadcasting process, the anchor only simply reads the preset text content and interacts with the user, so that the interactive function in the live broadcasting reading process is lacked, and the experience of the user in the live broadcasting watching process is not favorably improved.
Disclosure of Invention
The technical problem mainly solved by the application is to provide a live broadcast reading processing method, computer equipment and a storage medium, and the interaction function in the live broadcast reading process can be enriched.
In order to solve the technical problem, the application adopts a technical scheme that: a processing method of live broadcast reading is provided, and the method comprises the following steps: receiving an audio stream uploaded by a main broadcast terminal of a main broadcast in the process of live broadcast reading of preset text content; matching the audio stream with the preset text content to determine an audio segment matched with the preset text content; and at least sending the text content segment corresponding to the audio segment to a viewer terminal in a live broadcast room, so that the viewer terminal at least displays the text content segment corresponding to the audio segment on a live broadcast interface of the viewer terminal.
In order to solve the technical problem, the other technical scheme adopted by the application is as follows: a processing method for live broadcast reading is provided, and the method comprises the following steps: receiving a text content segment corresponding to the audio segment sent by the server; the audio clip is determined after the server receives an audio stream uploaded by the anchor terminal in the process of live-broadcasting reading of the preset text content, the audio stream is matched with the preset text content, and the audio stream is matched with the text content; and at least displaying the text content segment corresponding to the audio segment on the live interface.
In order to solve the above technical problem, another technical solution adopted by the present application is: a processing method of live broadcast reading is provided, and the method comprises the following steps: displaying preset text content appointed for reading by a main broadcast in the live broadcast on a live broadcast interface; acquiring an audio stream generated by a main broadcast in the process of live-broadcasting and reading preset text content; and sending the audio stream to a server so that the server matches the audio stream with the preset text content to determine an audio clip matched with the preset text content, and at least sending the text content clip corresponding to the audio clip to a viewer terminal in a live broadcast room, so that the viewer terminal at least displays the text content clip corresponding to the audio clip on a live broadcast interface of the viewer terminal.
In order to solve the above technical problem, another technical solution adopted by the present application is: there is provided a computer device comprising: a processor, memory, and communication circuitry; the memory and the communication circuit are coupled to the processor, the memory stores a computer program, and the processor can execute the computer program to implement the processing method for live broadcast reading as provided in the above application.
In order to solve the technical scheme, the other technical scheme adopted by the application is as follows: there is provided a computer-readable storage medium storing a computer program executable by a processor to implement a processing method of live reading as provided in the above-mentioned application.
The beneficial effect of this application is: different from the prior art, because the anchor needs to upload the live audio stream or video stream to the server in the live broadcasting process, the received audio stream or video stream is sent to the viewer terminal corresponding to the viewer through the server, so that the viewer can watch the live content of the anchor, in the live broadcasting reading process of the anchor, the server can receive the audio stream uploaded by the anchor terminal of the anchor in the live broadcasting reading process of the preset text content, the audio stream is matched with the preset text content to determine the audio segment matched with the preset text content, at least the text content segment corresponding to the audio segment is sent to the viewer terminal in the live broadcasting room, so that the viewer terminal at least displays the text content segment corresponding to the audio segment on the live broadcasting interface of the viewer terminal, so that the user can listen to the anchor in the live broadcasting room to read the preset text content, at least the text content segment matched with the content read by the anchor can be simultaneously watched on the display interface, the live broadcasting reading process of the anchor can be more synchronously with the anchor, further the interaction function in the live broadcasting reading process can be enriched, and the live broadcasting reading experience in the live broadcasting process is favorable for improving the live broadcasting reading experience of the live broadcasting process of the live broadcasting.
Drawings
Fig. 1 is a schematic system composition diagram of an embodiment of a live broadcast system of the present application;
FIG. 2 is a flowchart illustrating a first embodiment of a processing method for live reading according to the present application;
FIG. 3 is a timing diagram illustrating a first embodiment of a processing method for live reading according to the present application;
fig. 4 is a first schematic view of a live interface of a viewer terminal in the processing method for live reading of the present application;
FIG. 5 is a process of live reading of the present application a flow diagram of a second embodiment of the method;
FIG. 6 is a second schematic view of a live interface of a viewer's terminal in the processing method for live reading of the present application;
FIG. 7 is a flowchart illustrating a processing method for live reading according to a third embodiment of the present application;
FIG. 8 is a schematic circuit diagram of an embodiment of a computer apparatus of the present application;
FIG. 9 is a schematic circuit diagram of an embodiment of a computer-readable storage medium according to the present application.
Detailed Description
The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
With the rapid development of the live broadcast industry, live broadcast becomes an important way for people to entertain through the internet. In the live broadcasting process, the anchor performs at the anchor terminal, and a user can watch the performance of the anchor at an audience terminal and can interact with the anchor through the audience terminal. The live view of the anchor may be presented in a personal live room. In order to enrich the live broadcast form, a live broadcast reading channel is arranged, in a live broadcast room of the live broadcast reading channel, a main broadcast can select books to read, audiences in the live broadcast room can watch and listen to the contents read by the main broadcast, and in the live broadcast process, the main broadcast and the audiences can also interact through public screens and other forms.
The inventor finds that, in the process of live broadcast reading of the anchor, the anchor simply reads the preset text content and interacts with the user, and the user can only obtain the content read by the anchor by listening to the audio sent by the anchor in the client terminal, so that the situation that the actual reading content of the anchor cannot be accurately obtained may exist, and meanwhile, the user is not beneficial to being intensively attentively listened to, so that the interactive function in the process of live broadcast reading is lacked, and the user is quickly interested in the reading of the anchor and quits the live broadcast room. And when the user misses a part of content read by the anchor for some reasons, the missed content cannot be checked, which is not beneficial to improving the experience of the user in the live broadcast watching process. In addition, in the process of live broadcast reading of the anchor, the anchor may interact with other users in the live broadcast room, such as answering user questions and the like, so that the reading of the anchor is intended, and the experience of the user in the live broadcast watching process is also reduced. To solve the above technical problems, the present application proposes the following embodiments.
As shown in fig. 1, a live system 1 described in the present embodiment of the live system may include a server 10, an anchor terminal 20, and a viewer terminal 30. The anchor terminal 20 and the viewer terminal 30 may be electronic terminals, and specifically, the anchor terminal 20 and the viewer terminal 30 are electronic terminals installed with corresponding client programs, that is, client terminals. The electronic terminal can be a mobile terminal, a computer, a server or other terminals, the mobile terminal can be a mobile phone, a notebook computer, a tablet computer, an intelligent wearable device and the like, and the computer can be a desktop computer and the like.
The server 10 may pull the live data stream from the anchor terminal 20 and may push the obtained live data stream to the viewer terminal 30 after performing corresponding processing. The viewer terminal 30 can view the live broadcast process of the anchor or guest after acquiring the live broadcast data stream. The mixing of the live data streams may occur at least one of the server 10, the anchor terminal 20 and the viewer terminal 30. Video or voice connections may be made between the anchor terminal 20 and the anchor terminal 20, and between the anchor terminal 20 and the viewer terminals 30. In the video microphone, the microphone connecting party may push the live data stream including the video stream to the server 10, and further push the corresponding live data to the corresponding microphone connecting party and the viewer terminal 30. The anchor terminal 20 and the viewer terminal 30 can display to the respective live pictures in the live room. Specifically, the server 10 may be, for example, a server cluster, and may be used not only to collect and push a live data stream, but also to process a service request and related matters, such as storing and processing data related to a service generated in a live broadcast process, for example, to process virtual gift presentation, virtual coin charging and consumption, public screen information transceiving, authentication and authentication, microphone attachment, and automatic identification of sensitive words/pictures.
Of course, the anchor terminal 20 and the viewer terminal 30 are relative, and the terminal in the live broadcasting process is the anchor terminal 20, and the terminal in the live broadcasting watching process is the viewer terminal 30.
The server 10 may be preset with a live reading template corresponding to a live reading channel, and before the live reading channel is played, the anchor may select the live reading template to play through the anchor terminal 20, so that the server 10 may respond to a selection operation of the anchor selecting the live reading template at the anchor terminal 20, and send a configuration related to the live reading template to the anchor terminal 20. Specifically, the configuration related to the live reading template may be a configuration corresponding to the processing method described in the first embodiment of the processing method for live reading in the present application.
As shown in fig. 2, the first embodiment of the processing method for live reading of the present application may use the server 10 as an execution subject. The present embodiment may include the following steps: s100: and receiving an audio stream uploaded by the anchor terminal of the anchor in the process of live broadcast reading of the preset text content. S200: and matching the audio stream with the preset text content to determine the audio segment matched with the preset text content. S300: and at least sending the text content segment corresponding to the audio segment to a viewer terminal in a live broadcast room, so that the viewer terminal at least displays the text content segment corresponding to the audio segment on a live broadcast interface of the viewer terminal.
The method comprises the steps that a main broadcast uploads live audio streams or video streams to a server in a live broadcast process, the received audio streams or video streams are sent to audience terminals corresponding to audiences through the server, so that the audiences can watch live broadcast contents of the main broadcast, in the live broadcast reading process of the main broadcast, the server can receive the audio streams uploaded by the main broadcast terminals of the main broadcast in the live broadcast reading process of preset text contents, the audio streams are matched with the preset text contents to determine audio segments matched with the preset text contents, at least text content segments corresponding to the audio segments are sent to the audience terminals in a live broadcast room, so that the audience terminals at least display the text content segments corresponding to the audio segments on a live broadcast interface, the user can listen to the main broadcast in the live broadcast room to read the preset text contents, at least the text content segments matched with the contents read by the main broadcast can be watched on the display interface at the same time, the user can watch the text content segments matched with the contents read by the main broadcast more synchronously in the live broadcast reading process of the main broadcast, further enrich interaction in the live broadcast reading process, and be beneficial to improving the viewing sense of the main broadcast in the live broadcast reading process.
The method described in this embodiment can be applied to a scene in which a live broadcast is read in a live broadcast room, and as shown in fig. 3, the following describes the embodiment in detail with the server 10 as an execution subject.
S100: and receiving an audio stream uploaded by the anchor terminal of the anchor in the process of live-broadcasting and reading the preset text content.
The preset text content may be a text content designated to be read by the anchor in the live broadcast reading. The audio stream may be sound data generated by the anchor terminal 20 during live broadcast reading of the preset text content, and during live broadcast reading of the anchor, the audio stream may be uploaded to the server 10 through the anchor terminal 20, and the server 10 may receive the audio stream uploaded by the anchor terminal 20 and then send the audio stream to the audience terminals 30 corresponding to all users in the current live broadcast room, so that the users can listen to the audio stream uploaded by the anchor through the audience terminals 30.
In one implementation, S100 may include the following steps before:
s110: and receiving the related information of the text content which is appointed to be read by the anchor broadcast in the live broadcast and uploaded by the anchor broadcast terminal.
The related information may be related information of a book, and the anchor may input, in the anchor terminal 20 before the anchor is started, related information of text content that the anchor specifies reading this time, that is, if the anchor is to read a book this time, the anchor may input, in the anchor terminal 20 before the anchor is started, related information of the book, such as a title of the book, an author, and the like, the anchor terminal 20 may upload the received related information of the book to the server 10, and the server 10 may receive the related information of the text content uploaded by the anchor terminal 20.
S120: and acquiring preset text content based on the related information, and sending the preview content of the preset text content to the anchor terminal for confirmation.
After receiving the related information uploaded by the anchor terminal, the server 10 may obtain the preset text content based on the related information, and may send the preview content of the preset text content to the anchor for confirmation, so that the server 10 may obtain the correct preset text content corresponding to the related information uploaded by the anchor, and may start live reading of the preset text content.
In one implementation, for how to obtain the preset text content based on the related information, reference may be made to the following steps included in S120:
s121: and acquiring a chapter catalog of the corresponding book from the network based on the relevant information, and sending the chapter catalog to the anchor terminal so as to enable the anchor to select chapters in the chapter catalog.
The server 10, upon receiving the related information uploaded by the anchor terminal 20, may obtain a chapter list of the corresponding book from the network based on the related information. Specifically, the server 10 may obtain the chapter directory corresponding to the book corresponding to the related information from the browser, the book website, or the novel website. After acquiring the chapter list of the corresponding book, the server 10 may send the chapter list to the corresponding anchor terminal 20, so that the anchor selects a chapter to be read from the chapter list at the anchor terminal 20. For example, if the anchor nail uploads the related information including the name of a book to the corresponding anchor terminal 20, the server 10 acquires the chapter list of the book a from a certain novel website for 5 chapters, and the server 10 may send the chapter list of the book to the anchor terminal 20 corresponding to the anchor nail, so that the anchor nail can select a chapter to be read from the received chapter list.
S122: and responding to the selection operation of the anchor terminal for selecting at least one chapter in the chapter catalog, and acquiring the text content corresponding to the at least one chapter from the network as preset text content.
The preset text content may be a text content to be read by the anchor in the live reading, that is, after the anchor terminal 20 receives the chapter list of the book corresponding to the related information acquired by the server 10, the anchor may select at least one chapter as a content to be read. Specifically, after the anchor terminal 20 performs a selection operation of selecting at least one chapter in the chapter list, the server 10 may obtain, as the preset text content, the text content corresponding to the at least one chapter from the network in response to the selection operation of the anchor terminal 20, where the obtained text content corresponding to the at least one chapter corresponds to the selection operation of the anchor terminal 20 for selecting the at least one chapter. For example, after the anchor terminal 20 receives the list of 5 chapters of the book a acquired by the server 10, the anchor a selects the first chapter and the second chapter through the anchor terminal 20, and the server 10 may acquire the text contents of the first chapter and the second chapter of the book a from the network as the preset text contents to be read by the anchor a in the live broadcast.
In one implementation, for how to send the preview content of the preset text content to the anchor terminal for confirmation, reference may be made to the following steps included in S120:
s123: and sending an acquisition success notification of the preview content carrying the preset text content to the anchor terminal, so that the anchor terminal can confirm the text content corresponding to at least one chapter based on the acquisition success notification.
After acquiring the preset text content to be read by the anchor live broadcast, the server 10 may send an acquisition success notification to the anchor terminal 20, so that the anchor terminal 20 can confirm the acquired content based on the acquisition success notification. Specifically, the acquisition success notification may carry the acquired preview content of the preset text content, and after receiving the acquisition notification, the anchor terminal 20 may confirm the preview content of the preset text content carried in the acquisition notification, that is, confirm the text content corresponding to at least one chapter acquired by the server 10 based on the selection operation of the anchor terminal 20 selecting at least one chapter in the chapter list.
In one implementation, the following steps may be included after S120:
s124 the method comprises the following steps: and in response to the confirmation operation that the preview content of the preset text content is confirmed to be wrong by the anchor terminal, re-executing the text content corresponding to at least one chapter acquired from the network.
If the anchor performs a confirmation operation of confirming that the preview content of the preset text content is incorrect at the anchor terminal 20, the server 10 may re-perform, in response to the confirmation operation, acquiring the text content corresponding to at least one chapter from the network to obtain the correct text content.
In one implementation, after S124, the following steps may be included:
s125: and if the text content corresponding to the at least one chapter cannot be obtained, sending acquisition failure information to the anchor terminal to inform the anchor terminal of uploading the text content corresponding to the at least one chapter.
If the server 10 does not acquire the text content corresponding to at least one chapter corresponding to the selection operation of the anchor terminal 20 for selecting at least one chapter in the chapter list from the network, the server 10 may send acquisition failure information to the anchor terminal 20 to notify the anchor terminal 20 of uploading the text content corresponding to at least one chapter. Specifically, when the server 10 does not acquire the text content of the corresponding at least one chapter, the acquisition failure information may be sent to the anchor terminal 20, and the anchor terminal 20 may upload the text content corresponding to the at least one chapter to the server 10 based on the acquisition failure information. In the process of uploading the text content corresponding to at least one chapter by the anchor terminal 20, the anchor terminal 20 may locally acquire the text content corresponding to at least one chapter, or may intercept the text content corresponding to at least one chapter, and then upload the acquired text content corresponding to at least one chapter to the server 10 in chapters, so that the server 10 can receive the text content corresponding to at least one chapter to serve as the preset text content. Specifically, after receiving the text content corresponding to at least one chapter uploaded by the anchor terminal 20, the server 10 may check the received text content corresponding to at least one chapter. After the review is passed, the server 10 may display the received text content corresponding to at least one chapter as preview content on the anchor terminal 20 corresponding to the anchor.
S130: and responding to the confirmation operation that the preview content of the preset text content is confirmed to be correct by the anchor terminal, and sending the preset text content to the anchor terminal.
If the anchor performs a confirmation operation at the anchor terminal 20 to confirm that the preview content of the preset text content is correct, the server 10 may transmit the preset text content to the anchor terminal 20 in response to the confirmation operation, so that the anchor terminal 20 can display the preset text content, and the anchor can view and read the preset text content through the anchor terminal 20 for live broadcasting.
S200: and matching the audio stream with the preset text content to determine the audio segment matched with the preset text content.
An audio clip may refer to a clip in an audio stream that matches a preset text content. When the anchor is played, the server 10 may match the audio stream uploaded by the anchor terminal with the preset text content acquired by the server 10, and determine a segment in the audio stream that matches the preset text content as an audio segment.
In one implementation, for how to match the audio stream with the preset text content to determine the audio segment matching with the preset text content, reference may be made to the following steps included in S200:
s210: and converting the audio stream into characters through a preset character recognition model.
The preset character recognition model may be a model preset in the server 10 for converting the received audio stream into characters. The server 10, after receiving the audio stream uploaded by the anchor terminal 20, may input the audio stream into a preset character recognition model, convert the audio stream into characters through the preset character recognition model, and output the converted characters.
S220: and performing character matching on the converted characters and the preset text content, and calculating the character matching degree of the characters and the preset text content.
The server 10 may perform character matching on the converted characters output by the preset character recognition model and the preset text content. Specifically, in the process of performing character matching, matching may be performed by calculating a character matching degree of the converted characters and the preset text content.
S230: and judging whether the character matching degree is greater than or equal to a first preset threshold value or not.
The first preset threshold may be a word matching degree threshold preset in the server 10 and used for determining whether the converted words are matched with the preset text content. And judging whether the characters obtained by conversion are matched with the preset text content or not by judging whether the calculated character matching degree of the characters obtained by conversion and the preset text content is greater than or equal to a first preset threshold value or not, and further determining the audio frequency segment matched with the preset text content.
S240: and if the character matching degree is greater than or equal to a first preset threshold value, determining the audio frequency segment with the character matching degree greater than or equal to the first preset threshold value.
If the matching degree of the converted characters with the preset text content is greater than or equal to a first preset threshold, the audio segment with the character matching degree greater than or equal to the first preset threshold can be determined as the audio segment to be determined, that is, the segment in the audio stream corresponding to the converted characters is the audio segment matched with the preset text content.
Specifically, in the process of determining whether the character matching degree is greater than or equal to the first preset threshold, the determination may be performed sentence by sentence. That is to say, each sentence in the converted characters is respectively matched with the preset text content, and if the matching degree of the characters after the sentence conversion and the preset text content is greater than or equal to the first preset threshold, it can be determined that the audio corresponding to the sentence is an audio segment matched with the preset text content. By judging each sentence in the characters obtained by conversion, a plurality of matched audio clips can be obtained, and the complete audio clip can be obtained by splicing the plurality of matched audio clips.
For example, if the character matching degree of a certain sentence of the converted characters and a corresponding sentence of the preset text content is greater than the first preset threshold N, the sentence of the converted characters may be considered to be matched with the sentence of the preset text content, that is, the sentence of the converted characters and the sentence of the preset text content are the same sentence, so that it may be determined that the audio corresponding to the sentence of the converted characters is an audio segment matched with the preset text content.
In one implementation, after S230, the following steps may be included:
s250: and if the character matching degree is smaller than a first preset threshold value, performing semantic analysis on the converted characters through a preset neural network, performing semantic matching on the characters and preset text contents, and calculating the semantic matching degree of the characters and the preset text contents.
In the live broadcasting process, the situation that the content which the anchor wants to read is the same as the preset text content, but the content actually read by the anchor may be different from the preset text content, the character matching degree of the characters obtained by the conversion of the preset character recognition model and the preset text content is separately judged, and the situation that the judgment is inaccurate may occur. Therefore, in this embodiment, if the matching degree between the converted text and the preset text content is smaller than the first preset threshold, the semantic analysis may be performed on the converted text through the preset neural network, the semantic matching may be performed on the converted text and the preset text content, and the semantic matching degree between the text and the preset text content may be calculated. The preset neural network may be a network preset in the server 10 and used for semantic analysis, specifically, the preset neural network may be a neural network NLP model, characters obtained through conversion by the preset character recognition model may be input into the preset neural network, the converted characters may be subjected to semantic matching with preset text contents by the preset neural network, and the semantic matching degree between the converted characters and the preset text contents may be calculated and output by the preset neural network.
S260: and judging whether the semantic matching degree is greater than or equal to a second preset threshold value.
The second preset threshold may be a semantic matching degree threshold preset in the server 10 and used for determining whether the semantics of the converted characters and the preset text content are matched. The server 10 may compare the semantic matching degree of the converted characters output by the preset neural network and the preset text content with a second preset threshold, determine whether the semantics of the converted characters and the preset text content are matched, and further determine the audio segment matched with the preset text content.
S270: and if the semantic matching degree of the converted characters and the preset text content is greater than or equal to a second preset threshold, determining the audio clip with the semantic matching degree greater than or equal to the second preset threshold.
If the semantic matching degree of the converted characters and the preset text content is greater than or equal to a second preset threshold, the audio segment with the semantic matching degree greater than or equal to the second preset threshold can be determined as the audio segment to be determined, that is, the segment in the audio stream corresponding to the converted characters is the audio segment matched with the preset text content.
Specifically, in the process of determining whether the semantic matching degree is greater than or equal to the second preset threshold, the determination may also be performed sentence by sentence. That is to say, each sentence in the converted characters is respectively matched with the preset text content, and if the semantic matching degree of the converted characters of the sentence with the preset text content is greater than or equal to a second preset threshold, it can be determined that the audio corresponding to the sentence is an audio segment matched with the preset text content. By judging each sentence in the characters obtained by conversion, a plurality of matched audio clips can be obtained, and the complete audio clip can be obtained by splicing the plurality of matched audio clips.
For example, if the semantic matching degree between a certain sentence of the converted characters and a corresponding sentence of the preset text content is greater than the second preset threshold M, the sentence of the converted characters may be considered to be matched with the sentence of the preset text content, that is, the sentence of the converted characters and the sentence of the preset text content are semantically or actually the same sentence, so that the audio corresponding to the sentence of the converted characters may be determined to be an audio segment matched with the preset text content.
If the semantic matching degree of the converted characters and the preset text content is smaller than a second preset threshold, the converted characters and the preset text content are considered to be not matched, and the content spoken by the anchor at the moment can be considered not to be reading the preset text content, that is, the audio corresponding to the converted characters can be spoken when the anchor interacts with the audience at the moment.
In one implementation, after S270, the following steps may be included:
s281: and acquiring the audio clip and storing the audio clip.
After determining the audio segment matched with the preset text content, the server 10 may obtain the audio segment, and store the audio segment in the server 10, so that the server 10 can only keep recorded and broadcast audio for reading the preset text content in the anchor live broadcast process based on the stored audio segment, so that viewers can play the recorded and broadcast audio to listen to the audio only including the anchor read preset text content, and do not need to spend a long time to listen to the audio unrelated to reading the preset text content.
S282: and after the anchor reads the text content, synthesizing all audio segments acquired in the process of reading the preset text content by the anchor according to the time sequence to obtain recorded and broadcast audio corresponding to the preset text content.
After the anchor reads the text content, or after the anchor downloads, the server 10 may splice and synthesize all audio segments obtained when the anchor reads the preset text content in the live broadcast process according to a time sequence, so as to obtain recorded and broadcast audio corresponding to the preset text.
S283: and responding to the playing operation of the audience on the corresponding audience terminal aiming at the playing button in the page of the preset text content, and sending the recorded and broadcast audio to the audience terminal so as to synchronously play the recorded and broadcast audio while displaying the preset text content on the live interface of the audience terminal.
As shown in fig. 4, since the audience can watch the preset text content in the audience terminal, in order to enable the audience to listen to the sound of the anchor reading while watching the preset text content, the server 10 can respond to the playing operation of the audience on the corresponding audience terminal 30 for the playing button 5 in the page of the preset text content, and send the recorded and broadcast audio to the audience terminal 30, so that the recorded and broadcast audio can be synchronously played while showing the preset text content in the live interface 2 of the audience terminal 30, and the experience of the audience in the process of watching the preset text content is improved.
In one implementation, S200 may further include the following steps:
s291: and matching the audio stream with the preset text content to determine an audio segment matched with the preset text content and a second audio segment not matched with the preset text content.
The second audio clip may be an audio clip corresponding to a text whose text content is not matched with the text content in terms of text and semantics, where the text is obtained by converting the audio stream through the preset text recognition model. After receiving the audio stream uploaded by the anchor terminal 20, the server 10 may match the audio stream with the preset text content, so as to determine an audio segment matched with the preset text content, and a second audio segment not matched with the preset text content, so as to better distinguish the audio of the anchor in the live broadcast reading process, actually read the audio of the preset text content and the audio of the non-reading preset text content, and further, in the process of synthesizing the recorded broadcast audio, the second audio segment may be excluded, so as to obtain the recorded broadcast audio only including reading the preset text content.
S292: and when the second audio clip is determined, sending reading timing information to the anchor terminal so as to display the read time length corresponding to all matched audio clips and the interruption time length corresponding to the current second audio clip on a live interface of the anchor terminal.
The reading timing information may be for live broadcasting to the anchor and timing information when various time is consumed in the reading process. When the second audio clip is determined, the server 10 may send reading timing information to the anchor terminal 20, so as to display the read durations corresponding to all matched audio clips and the interruption duration corresponding to the current second clip on a live interface of the anchor terminal 20. Specifically, after receiving the reading timing information sent by the server 10, the anchor terminal 20 may stop timing by a first timer used by the anchor terminal 20 to count the read time length and start timing by a second timer used by the anchor terminal 20 to count the interruption time length, and when the server 10 determines that the anchor reads an audio clip of the preset text content, the first timer in the anchor terminal 20 may start to continue timing, and the second timer may reset to restart timing. The effective reading duration of reading the preset text content in the live broadcast reading process of the anchor can be counted through the first timer, the duration of reading the preset text content at the anchor terminal can be counted through the second timer, and the anchor can be prevented from stopping reading for a long time due to neglecting the duration, so that poor watching experience is caused to audiences. The first timer and the second timer in the anchor terminal 20 are triggered to time by reading the timing information, so that the anchor can better grasp the rhythm in the live broadcast room, and better watching experience is brought to audiences. After the anchor finishes live reading or downloading, the server 10 may count the total duration of the anchor in the process of reading this live, the duration of reading the preset text content, and the total duration data for interacting with the audience based on the first timer and the second timer, and send the counted data to the anchor terminal 20 after the anchor finishes live reading or downloading, so that the received duration data can be displayed on the live interface of the anchor terminal 20, so that the anchor can clearly know the time consumption in the process of live, and further the anchor can improve the arrangement and grasp of the live time, and better viewing experience is provided for the audience.
S300: and at least sending the preset text content segment corresponding to the audio segment to a spectator terminal in a live broadcast room, so that the spectator terminal at least displays the preset text content segment corresponding to the audio segment on a live broadcast interface of the spectator terminal.
The preset text content segment may be a segment in the preset text content corresponding to the audio segment. After determining the audio frequency band matching with the preset text content, the server 10 may send at least the preset text content segment corresponding to the audio segment to the audience terminal 30 in the live broadcast room, so that the audience terminal 30 may display the preset text content segment corresponding to the audio segment on its live broadcast interface. Since a second audio clip that does not read the preset text content may exist in the process of determining the audio clip, and the audio clip may not be a complete clip for reading the preset text content, in the process of sending the preset text content clip corresponding to the audio clip to the audience terminal 30, the preset text content clip corresponding to the currently identified audio clip may be sent, so that the preset text content clip corresponding to the determined audio clip can be displayed on the live interface 2 of the audience terminal 30.
In one implementation, S300 may include the steps of:
s310: and responding to the viewing operation of the audience on the corresponding audience terminal aiming at the displayed preset text content segment by the audience, and sending at least part of the preset text content segment which is read by the anchor in the preset text content to the audience terminal corresponding to the audience so that the audience terminal can display at least part of the preset text content segment which is read by the anchor.
In the process of live broadcast reading of the anchor broadcast, a preset text content segment can be displayed on the live broadcast interface 2 of the audience terminal 30 corresponding to an audience in the live broadcast room, and specifically, a display column 3 in the live broadcast interface 2 of the audience terminal 30 can display one sentence in the preset text content currently read by the anchor broadcast and a previous sentence in the preset text content currently read by the anchor broadcast. The viewer may view at least a part of the preset content segments that have been read by the anchor in the preset text content by clicking on the preset text content segments displayed in the display bar 3.
When the audience clicks the preset text content segment displayed in the display column 3 of the audience terminal 30, the server 10 may respond to the viewing operation performed by the audience at the corresponding audience terminal for the displayed preset text content segment, and may send the preset text content segment that has been already read by the anchor in the chapter where the preset text content segment that is currently read by the anchor is located to the audience terminal 30 corresponding to the audience, so that the audience can view the preset text content segment that has been already read by the anchor in the chapter corresponding to the preset text content segment that is currently read by the anchor in the preset popup window 4 in the live interface 2 of the audience terminal 30, and thus the audience can view the missed part in the live viewing process, which is beneficial to improving the interactive function in the live viewing process, and thus improving the convenience of the user in the using process.
In one implementation, as to how to transmit at least the preset text content segment corresponding to the audio segment to the viewer terminal 30 in the live broadcast room, reference may be made to the following steps included in S310:
s311: and sending the preset text content segment corresponding to the audio segment and the preset text content segment corresponding to at least the previous audio segment to the audience terminal so that the audience terminal displays the content corresponding to the audio segment and part of the preset text content corresponding to at least the previous audio segment on a live broadcast interface of the audience terminal.
In the process of sending at least the preset text content segment corresponding to the audio segment to the viewer terminal 30 in the live broadcast room, the preset text content segment corresponding to the audio segment and the preset text content segment corresponding to at least the previous audio segment may be sent to the viewer terminal 30, so that the viewer terminal 30 displays the content corresponding to the audio segment and the part of the preset text content corresponding to at least the previous audio segment on the display bar 3 in the live broadcast interface of the viewer terminal 30. Specifically, one sentence in the preset text content currently read by the anchor and the previous sentence in the preset text content currently read by the anchor may be transmitted to the viewer terminal 30, so that the one sentence in the preset text content currently read by the anchor and the previous sentence in the preset text content currently read by the anchor are displayed in the display column 3 of the viewer terminal 30.
As shown in fig. 5, the second embodiment of the processing method for live reading of the present application may use the viewer terminal 30 as an execution subject. The present embodiment may include the following steps:
s400: and receiving a preset text content segment corresponding to the audio segment sent by the server.
The audio clip is determined by the server 10 after receiving the audio stream uploaded by the anchor terminal 20 in the process of live reading the preset text content, matching the audio stream with the preset text content, and matching the audio stream with the preset text content. For how to determine the audio clip, reference may be made to the description of S200 in the first embodiment of the processing method for live broadcast reading in the present application, and details are not repeated here.
S500: and at least displaying a preset text content segment corresponding to the audio segment on the live broadcast interface.
After receiving the preset text content segment sent by the server 10, the viewer terminal 30 may display at least the preset text content segment corresponding to the audio segment on the live interface 2 of the viewer terminal 30, so that the viewer can view the preset text content segment corresponding to the audio segment currently read by the anchor at the viewer terminal 30. Specifically, when the viewer clicks the preset text content segment displayed in the display bar 3 broadcasted on the live broadcast interface 2 of the viewer terminal 30, the preset text content segment that has been read by the anchor of the chapter corresponding to the currently displayed content may be displayed in a pop-up window form on the live broadcast interface. Through showing the preset text content segment which is read by the anchor of the chapter corresponding to the currently displayed content, the audience can more clearly know the content which is missed in the watching and listening process, and the experience of the audience in the live broadcasting process is favorably improved.
Specifically, as shown in fig. 4 and fig. 6, fig. 4 is the live interface 2 before the viewer clicks the show button 6, and fig. 6 is the live interface 2 after the viewer clicks the show button 6. The live interface 2 of the viewer terminal 30 may pop up a preset popup window 4 when the viewer clicks the preset text content segment displayed in the display bar 3, so as to display the read preset text content segment of the anchor of the chapter corresponding to the currently displayed content. Specifically, a display button 6 for displaying all chapter directories of the book selected by the anchor broadcast in the live broadcast reading process can be arranged on the preset popup window 4, when the audience clicks the display button 6 at the audience terminal 30, all chapter directories of the book selected by the anchor broadcast in the live broadcast reading process can be displayed on the preset popup window 4 in a list form, and when the audience selects a certain chapter in the chapter directories, text content corresponding to the chapter can be displayed in a skip mode on a live broadcast interface.
While the preset text content segment that has been read by the anchor of the chapter corresponding to the currently displayed content is displayed on the live interface 2 of the audience terminal 30, a play button 5 may be provided on the live interface 2 of the audience terminal 30, for playing the recorded audio of the anchor reading the current chapter in the live reading process. For the play button 5 and the recorded and played audio, reference may be made to the description related to the first embodiment of the processing method for live broadcast reading in the present application, and details are not described herein again.
When all chapter directories of the book selected by the anchor in the current live broadcast reading are displayed on the preset popup window 4 in a list form, a corresponding play button 5 can be set for each chapter in the chapter directories, and if the anchor reads the chapter in the current or previous live broadcast reading, the play button 5 of the chapter can be displayed, so that the audience can play recorded and broadcast audio of the anchor corresponding to the chapter by clicking the play button 5.
As shown in fig. 7, the third embodiment of the processing method for live reading in the present application may use the anchor terminal 20 as an execution main body. The present embodiment may include the steps of:
s600: and displaying the preset text content appointed for reading by the main broadcast on the live broadcast interface.
The anchor terminal 20 may display the preset text content that the anchor designates reading in the live broadcast on the live broadcast interface, and for how to anchor the preset text content that the anchor designates reading in the live broadcast on the live broadcast interface of the anchor terminal 20, reference may be made to the relevant description in the first embodiment of the processing method for live broadcast reading in the present application, which is not described herein again.
S700: and acquiring an audio stream generated by the anchor in the process of live broadcast reading of the preset text content.
The anchor terminal 20 may collect the sound of the anchor during the live broadcast reading process of the anchor, so as to form an audio stream generated during the live broadcast reading process of the anchor for the preset text content.
S800: and sending the audio stream to a server so that the server matches the audio stream with the preset text content to determine an audio clip matched with the preset text content, and at least sending the preset text content clip corresponding to the audio clip to a viewer terminal in a live broadcast room, so that the viewer terminal at least displays the preset text content clip corresponding to the audio clip on a live broadcast interface of the viewer terminal.
After acquiring the audio stream, the anchor terminal 20 may send the audio stream to the server 10, so that the server 10 can match the audio stream with the preset text content to determine an audio segment matching the preset text content. For how to determine the audio segment matched with the preset text content, reference may be made to the related description in the first embodiment of the processing method for live broadcast reading in the present application, and details are not described here again.
Meanwhile, the anchor terminal 20 can at least send the preset text content segment corresponding to the audio segment to the audience terminal 30 in the live broadcast room, so that the audience terminal 30 at least displays the preset text content segment corresponding to the audio segment on a live broadcast interface, and audiences can synchronously watch the preset text content segment currently read by the anchor in the process of listening to the anchor and reading the preset text content segment, thereby being beneficial to improving the interactive function in the live broadcast reading process, and further enabling the audiences to more accurately know the content currently read by the anchor in the listening process. As to how to at least send the preset text content segment corresponding to the audio segment to the audience terminal 30 in the live broadcast room, reference may be made to the related description in the first embodiment of the processing method for live broadcast reading in the present application, and details are not described here again.
As shown in fig. 8, the computer device 100 according to the embodiment of the computer device of the present application may be the server 10, the anchor terminal 20 or the viewer terminal 30, and the computer device 100 includes a processor 110, a memory 120 and a communication circuit. Memory 120 and communication circuitry are coupled to processor 110.
The Memory 120 is used for storing computer programs, and may be a RAM (Read-Only Memory), a ROM (Random Access Memory), or other types of storage devices. In particular, the memory may include one or more computer-readable storage media, which may be non-transitory. The memory may also include high speed random access memory, as well as non-volatile memory, such as one or more magnetic disk storage devices, flash memory storage devices. In some embodiments, a non-transitory computer readable storage medium in a memory is used to store at least one program code.
The processor 110 is used for controlling the operation of the computer device 100, and the processor 110 may also be referred to as a Central Processing Unit (CPU). The processor 110 may be an integrated circuit chip having signal processing capabilities. The processor 110 may also be a general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components. A general purpose processor may be a microprocessor or the processor 110 may be any conventional processor or the like.
The processor 110 is configured to execute a computer program stored in the memory 120 to implement the processing method of live broadcast reading described in the embodiment of the processing method of live broadcast reading of the present application.
In some embodiments, the computer device 100 may further comprise: a peripheral interface 130 and at least one peripheral. The processor 110, memory 120, and peripheral interface 130 may be connected by buses or signal lines. Various peripheral devices may be connected to peripheral interface 130 via a bus, signal line, or circuit board. Specifically, the peripheral device includes: at least one of radio frequency circuitry 140, display 150, audio circuitry 160, and power supply 170.
The peripheral interface 130 may be used to connect at least one peripheral related to I/O (Input/output) to the processor 110 and the memory 120. In some embodiments, processor 110, memory 120, and peripheral interface 130 are integrated on the same chip or circuit board; in some other embodiments, any one or two of the processor 110, the memory 120, and the peripheral interface 130 may be implemented on a separate chip or circuit board, which is not limited in this embodiment.
The Radio Frequency circuit 140 is used to receive and transmit RF (Radio Frequency) signals, also called electromagnetic signals. The radio frequency circuit 140 communicates with a communication network and other communication devices via electromagnetic signals. The rf circuit 140 converts an electrical signal into an electromagnetic signal to transmit, or converts a received electromagnetic signal into an electrical signal. Optionally, the radio frequency circuit 140 comprises: an antenna system, an RF transceiver, one or more amplifiers, a tuner, an oscillator, a digital signal processor, a codec chipset, a subscriber identity module card, and so forth. The radio frequency circuit 140 may communicate with other terminals via at least one wireless communication protocol. The wireless communication protocols include, but are not limited to: the world wide web, metropolitan area networks, intranets, various generations of mobile communication networks (2G, 3G, 4G, and 5G), wireless local area networks, and/or WiFi (Wireless Fidelity) networks. In some embodiments, the rf circuit 140 may further include NFC (Near Field Communication) related circuits, which are not limited in this application.
The display 150 is used to display a UI (User Interface). The UI may include graphics, text, icons, video, and any combination thereof. When the display screen 150 is a touch display screen, the display screen 150 also has the ability to capture touch signals on or over the surface of the display screen 150. The touch signal may be input to the processor 110 as a control signal for processing. At this point, the display 150 may also be used to provide virtual buttons and/or a virtual keyboard, also referred to as soft buttons and/or a soft keyboard. In some embodiments, the display 150 may be one, disposed on the front panel of the computer device 100; in other embodiments, the display screens 150 may be at least two, respectively disposed on different surfaces of the computer device 100 or in a folded design; in other embodiments, the display 150 may be a flexible display disposed on a curved surface or a folded surface of the computer device 100. Even further, the display 150 may be arranged in a non-rectangular irregular pattern, i.e., a shaped screen. The Display 150 may be made of LCD (Liquid Crystal Display), OLED (Organic Light-Emitting Diode), and the like.
Audio circuitry 160 may include a microphone and a speaker. The microphone is used for collecting sound waves of a user and the environment, converting the sound waves into electric signals, and inputting the electric signals to the processor 110 for processing or inputting the electric signals to the radio frequency circuit 140 to realize voice communication. For stereo sound acquisition or noise reduction purposes, the microphones may be multiple and disposed at different locations on the computer device 100. The microphone may also be an array microphone or an omni-directional pick-up microphone. The speaker is used to convert electrical signals from the processor 110 or the radio frequency circuit 140 into sound waves. The loudspeaker can be a traditional film loudspeaker or a piezoelectric ceramic loudspeaker. When the speaker is a piezoelectric ceramic speaker, the speaker can be used for purposes such as converting an electric signal into a sound wave audible to a human being, or converting an electric signal into a sound wave inaudible to a human being to measure a distance. In some embodiments, audio circuitry 160 may also include a headphone jack.
The power supply 170 is used to power the various components in the computer device 100. The power source 170 may be alternating current, direct current, disposable or rechargeable. When power source 170 comprises a rechargeable battery, the rechargeable battery may be a wired rechargeable battery or a wireless rechargeable battery. The wired rechargeable battery is a battery charged through a wired line, and the wireless rechargeable battery is a battery charged through a wireless coil. The rechargeable battery may also be used to support fast charge technology.
For detailed explanation of functions and execution processes of each functional module or component in the embodiment of the electronic terminal of the present application, reference may be made to the explanation in the embodiment of the live broadcast reading processing method of the present application, and details are not described here again.
In the several embodiments provided in the present application, it should be understood that the disclosed computer device 100 and recording and playing method can be implemented in other ways. For example, the embodiments of the computer device 100 described above are merely illustrative, and for example, a module or a unit may be divided into only one logical function, and may be implemented in other ways, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one position, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the embodiment.
In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
Referring to fig. 9, the above-described integrated unit, if implemented in the form of a software functional unit and sold or used as a separate product, may be stored in a computer-readable storage medium 200. Based on such understanding, the technical solution of the present application may be substantially implemented or contributed by the prior art, or all or part of the technical solution may be embodied in a software product, which is stored in a storage medium and includes instructions/computer programs for causing a computer device (which may be a personal computer, a server, a network device, or the like) or a processor (processor) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: various media such as a U disk, a portable hard disk, a read only memory, a random access memory, a magnetic disk or an optical disk, and electronic terminals such as a computer, a mobile phone, a notebook computer, a tablet computer, a camera, etc. having the storage medium.
The description of the execution process of the program data in the computer-readable storage medium may refer to the description of the embodiment of the live broadcast reading processing method in the present application, and is not repeated here.
The above description is only an example of the present application and is not intended to limit the scope of the present application, and all modifications of equivalent structures and equivalent processes, which are made by the contents of the specification and the drawings, or which are directly or indirectly applied to other related technical fields, are intended to be included within the scope of the present application.

Claims (12)

1. A processing method for live broadcast reading is characterized by comprising the following steps:
preset for live reading of anchor terminal receiving anchor audio streams uploaded during the course of the text content;
matching the audio stream with the preset text content to determine an audio segment matched with the preset text content;
and at least sending the preset text content segment corresponding to the audio segment to a spectator terminal in a live broadcast room, so that the spectator terminal at least displays the preset text content segment corresponding to the audio segment on a live broadcast interface of the spectator terminal.
2. The method of claim 1,
after at least sending the preset text content segment corresponding to the audio segment to a viewer terminal in a live broadcast room, the method comprises the following steps:
responding to a viewing operation of a viewer on the viewer terminal corresponding to the viewer terminal aiming at the displayed preset text content segment, and sending at least part of the preset text content segment read by the anchor in the preset text content to the viewer terminal corresponding to the viewer, so that the viewer terminal can display at least part of the preset text content segment read by the anchor.
3. The method according to claim 1 or 2,
the sending at least the preset text content segment corresponding to the audio segment to the audience terminal in the live broadcast room comprises:
and sending the preset text content segment corresponding to the audio segment and the preset text content segment corresponding to at least the previous audio segment to the audience terminal so that the audience terminal displays the content corresponding to the audio segment and part of the preset text content corresponding to the at least previous audio segment on a live broadcast interface of the audience terminal.
4. The method of claim 1,
the matching the audio stream with the preset text content to determine the audio segment matched with the preset text content includes:
matching the audio stream with the preset text content to determine an audio clip matched with the preset text content and a second audio clip not matched with the preset text content;
and when the second audio clip is determined, sending reading timing information to the anchor terminal so as to display the read duration corresponding to all the matched audio clips and the current interruption duration corresponding to the second audio clip on a live interface of the anchor terminal.
5. The method of claim 4,
when the second audio clip is determined, sending reading timing information to the anchor terminal to display the read duration corresponding to each audio clip and the interruption duration corresponding to the second audio clip on a live interface of the anchor terminal, including:
when the second audio clip is determined, sending an interrupt mark to the anchor terminal, so that the anchor terminal can pause a preset first timer based on the interrupt mark and start a preset second timer; the preset first timer is used for calculating the read time length of the text content read by the anchor in the live broadcasting process, and the preset second timer is used for calculating the interruption time length of the text content read by the anchor in the live broadcasting process, which corresponds to the current second audio clip.
6. The method of claim 1,
after the matching the audio stream with the preset text content to determine the audio segment matching with the text content, the method includes:
acquiring the audio clip and storing the audio clip;
and after the anchor reads the text content, synthesizing all the audio segments acquired in the process of reading the preset text content by the anchor according to a time sequence to obtain recorded and broadcast audio corresponding to the preset text content.
7. The method of claim 1,
before the receiving an audio stream uploaded by a anchor terminal of an anchor in a live broadcasting process, the method includes:
receiving related information of text content which is uploaded by the anchor terminal and is appointed to be read by the anchor in the live broadcast;
acquiring the preset text content based on the related information, and sending the preview content of the preset text content to the anchor terminal for confirmation;
and responding to the confirmation operation that the main broadcasting terminal confirms that the preview content of the preset text content is correct, and sending the preset text content to the main broadcasting terminal.
8. The method of claim 7, wherein:
the sending the preview content of the preset text content to the anchor terminal for confirmation comprises:
and sending an acquisition success notification of the preview content carrying the preset text content to the anchor terminal, so that the anchor terminal can confirm the text content corresponding to at least one chapter based on the acquisition success notification.
9. A processing method for live broadcast reading is characterized by comprising the following steps:
receiving a preset text content segment corresponding to the audio segment sent by the server; the audio clip is determined by matching the audio stream with the preset text content after the audio stream uploaded by the server in the process of live broadcast reading of the preset text content by the anchor terminal is received by the server, and matching the audio stream with the preset text content;
and at least displaying a preset text content segment corresponding to the audio segment on a live broadcast interface.
10. A processing method for live broadcast reading is characterized by comprising the following steps:
displaying preset text contents appointed to be read by the anchor in the live broadcast on a live broadcast interface;
acquiring an audio stream generated by the anchor in the process of live-broadcasting and reading the preset text content;
and sending the audio stream to a server, so that the server matches the audio stream with the preset text content to determine an audio clip matched with the preset text content, and at least sending the preset text content clip corresponding to the audio clip to a viewer terminal in a live broadcast room, so that the viewer terminal at least displays the preset text content clip corresponding to the audio clip on a live broadcast interface of the viewer terminal.
11. A computer device, comprising: a processor, memory, and communication circuitry; the memory and the communication circuit are coupled to the processor, the memory stores a computer program, and the processor can execute the computer program to realize the processing method of live reading according to any one of claims 1-10.
12. A computer-readable storage medium, in which a computer program is stored, the computer program being executable by a processor to implement a method of processing live readings as claimed in any one of claims 1 to 10.
CN202210776573.8A 2022-06-30 2022-06-30 Processing method for live broadcast reading, computer equipment and storage medium Pending CN115150633A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210776573.8A CN115150633A (en) 2022-06-30 2022-06-30 Processing method for live broadcast reading, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210776573.8A CN115150633A (en) 2022-06-30 2022-06-30 Processing method for live broadcast reading, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN115150633A true CN115150633A (en) 2022-10-04

Family

ID=83409953

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210776573.8A Pending CN115150633A (en) 2022-06-30 2022-06-30 Processing method for live broadcast reading, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN115150633A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107357494A (en) * 2017-07-05 2017-11-17 广州阿里巴巴文学信息技术有限公司 Data processing method, device and terminal device
US20190294630A1 (en) * 2018-03-23 2019-09-26 nedl.com, Inc. Real-time audio stream search and presentation system
CN110460872A (en) * 2019-09-05 2019-11-15 腾讯科技(深圳)有限公司 Information display method, device, equipment and the storage medium of net cast
CN112162680A (en) * 2020-09-24 2021-01-01 掌阅科技股份有限公司 Correlation method of reading service and live broadcast service, computing device and storage medium
CN112423089A (en) * 2020-11-19 2021-02-26 维沃移动通信有限公司 Live broadcasting method and device
CN112740327A (en) * 2018-08-27 2021-04-30 谷歌有限责任公司 Algorithmic determination of story reader reading interruption
CN113096635A (en) * 2021-03-31 2021-07-09 北京字节跳动网络技术有限公司 Audio and text synchronization method, device, equipment and medium
CN113852835A (en) * 2021-09-22 2021-12-28 北京百度网讯科技有限公司 Live broadcast audio processing method and device, electronic equipment and storage medium

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107357494A (en) * 2017-07-05 2017-11-17 广州阿里巴巴文学信息技术有限公司 Data processing method, device and terminal device
US20190294630A1 (en) * 2018-03-23 2019-09-26 nedl.com, Inc. Real-time audio stream search and presentation system
CN112740327A (en) * 2018-08-27 2021-04-30 谷歌有限责任公司 Algorithmic determination of story reader reading interruption
CN110460872A (en) * 2019-09-05 2019-11-15 腾讯科技(深圳)有限公司 Information display method, device, equipment and the storage medium of net cast
CN112162680A (en) * 2020-09-24 2021-01-01 掌阅科技股份有限公司 Correlation method of reading service and live broadcast service, computing device and storage medium
CN112423089A (en) * 2020-11-19 2021-02-26 维沃移动通信有限公司 Live broadcasting method and device
CN113096635A (en) * 2021-03-31 2021-07-09 北京字节跳动网络技术有限公司 Audio and text synchronization method, device, equipment and medium
CN113852835A (en) * 2021-09-22 2021-12-28 北京百度网讯科技有限公司 Live broadcast audio processing method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN109286852B (en) Competition method and device for live broadcast room
CN110267067B (en) Live broadcast room recommendation method, device, equipment and storage medium
WO2020010818A1 (en) Video capturing method and apparatus, terminal, server and storage medium
GB2589506A (en) Method and apparatus for selecting background music for video capture, terminal device, and medium
US20210258619A1 (en) Method for processing live streaming clips and apparatus, electronic device and computer storage medium
CN109729372B (en) Live broadcast room switching method, device, terminal, server and storage medium
JP2017517828A (en) Audio information identification method and apparatus
JP2014131113A (en) Reproducer, reproducing method, and recording medium
EP1860807A2 (en) Apparatus and method for receiving digital multimedia broadcast in electronic device
CN109327707B (en) Method, device and storage medium for transferring virtual resources
CN110418152B (en) Method and device for carrying out live broadcast prompt
US9491401B2 (en) Video call method and electronic device supporting the method
CN110290392B (en) Live broadcast information display method, device, equipment and storage medium
CN104159139A (en) Method and device of multimedia synchronization
CN110087148A (en) A kind of video sharing method, apparatus, electronic equipment and storage medium
CN110798327B (en) Message processing method, device and storage medium
CN106375846B (en) The processing method and processing device of live audio
CN113596516B (en) Method, system, equipment and storage medium for chorus of microphone and microphone
CN115150633A (en) Processing method for live broadcast reading, computer equipment and storage medium
CN110808021A (en) Audio playing method, device, terminal and storage medium
CN116132699A (en) Live interaction method, computer equipment and storage medium
CN115065836A (en) Live broadcast room switching display processing method, server, electronic terminal and storage medium
TW201116060A (en) Portable electronic apparatus and channel-switching method of the same
CN114125476A (en) Display processing method of display interface, electronic device and storage medium
CN113127678A (en) Audio processing method, device, terminal and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination