WO2022005618A1 - Processing video and audio streaming data - Google Patents

Processing video and audio streaming data Download PDF

Info

Publication number
WO2022005618A1
WO2022005618A1 PCT/US2021/031565 US2021031565W WO2022005618A1 WO 2022005618 A1 WO2022005618 A1 WO 2022005618A1 US 2021031565 W US2021031565 W US 2021031565W WO 2022005618 A1 WO2022005618 A1 WO 2022005618A1
Authority
WO
WIPO (PCT)
Prior art keywords
viewer
audio data
video data
processed streaming
streaming
Prior art date
Application number
PCT/US2021/031565
Other languages
French (fr)
Inventor
Gideon Eden
Original Assignee
Gideon Eden
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US16/915,741 external-priority patent/US11310554B2/en
Application filed by Gideon Eden filed Critical Gideon Eden
Publication of WO2022005618A1 publication Critical patent/WO2022005618A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • H04N21/43074Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of additional data with content streams on the same device, e.g. of EPG data or interactive icon with a TV program
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/462Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
    • H04N21/4622Retrieving content or additional data from different sources, e.g. from a broadcast channel and the Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting

Definitions

  • Various embodiments disclosed herein relate to processing video and audio streaming data.
  • Live events constitute a widely broadcasted portion of television programming, including news, political events, sporting events, parades, concerts, and the like. Live events are often accompanied by audio data along with the video data, such as direct audio recordings from the events, and commentary generated by television broadcasting personnel. As television communication grows internationally, viewers are able to watch live events that originatefromall over the world. However, the accompanying audio may notalways be desirable to the viewer, or meet the viewer’s expectations. If a viewer wishes to view a news station or local event of a foreign country, translation may be necessary, but not always offered by the television broadcasting providers.
  • This solution as the audio data and the video data are transferred via various independent communication channels, there is usually an inherent delay between the streaming of the audio data and the video data.
  • the disclosure describes a method of generating and presenting digital streaming data to a viewer or to a group of viewers.
  • the digital streaming data may comprise streaming video data depicted from a live event and streaming audio data generated by a commentator.
  • the streaming audio data may be related to the live event.
  • Methods of generating and presenting digital streaming data may include presenting the streaming video data to the commentator and the viewer.
  • a video presentation timing to the commentator may differ from avideo presentation timing to the viewer.
  • Methods may also include generating streaming audio data related to the live event.
  • methods may include transferring the streaming audio data via an Internet to the viewer.
  • An audio presentation timing to the commentator may differ from an audio presentation timing to the viewer.
  • methods may include providing each of the viewers with synchronization means to synchronize the streaming video data with the streaming audio data.
  • the synchronization means may include at least one of electronic circuitry that may delay an introduction of the streaming audio data received by the viewer, a software delay that may delay the introduction of the streaming audio data received by the viewer, and a video controlled buffer that may delay the introduction of the streaming video data to the viewer.
  • the video controlled buffer may comprise electronic memory that may be connected between a television data channel that may provide the streaming video data and a commercial television unit.
  • the video controlled buffer may provide the viewer with a capability to interactively delay the streaming video data to the commercial television unit.
  • the streaming video data may be delayed by the viewer selectively stopping and restarting the flow of the streaming video data at different times until the streaming video data and the streaming audio data are substantially synchronized.
  • methods may include transferring the streaming audio data via the Internet.
  • the transferring may be carried out via a web site to which both the commentator and the viewer are logged in.
  • the streaming audio data may be received by the viewer by at least one of a mainframe computer, a personal computer, a tablet, and a smart phone.
  • the video controlled buffer may comprise a software buffering program.
  • the software buffering program may be embedded in a processor that may receive the streaming video data from a television channel, and provide a time delayed video signal controlled by the viewer, and present delayed video data to a display unit of the processor.
  • the processor may also be used to delay the audio data.
  • the processor may comprise at least one of a mainframe computer, a desktop personal computer, a laptop personal computer, a tablet, and a smart phone.
  • Methods may include the viewer receiving the streaming audio data and the streaming video data.
  • the streaming audio data received by the user may precede the streaming audio data received by the user.
  • the viewer may interactively delay the presentation of the streaming audio data by incrementing the delay time until synchronization is achieved.
  • Methods may further include the viewer receiving the streaming audio data and the streaming video data.
  • the streaming video data received by the user may precede the streaming audio data received by the user.
  • the viewer may interactively delay the presentation of the streaming video data by incrementing the delay time until synchronization is achieved.
  • the streaming video data received by the viewer may precede the streaming audio data received by the viewer.
  • Methods may also include the viewer applying a first step in which the streaming video data is delayed sufficiently to cause the streaming audio data to precede the streaming video data.
  • Methods may further include the viewer applying a secondary step of delaying the audio data according to fine tune the desired synchronization.
  • the system may enable the viewers to communicate with the commentator and among themselves.
  • the system may provide text message functionality related to the live event via the Internet.
  • the viewers may communicate with the commentator regarding the live event via the web site.
  • the disclosure also includes a method of generating and presenting non-processed streaming audio data and non-processed streaming video data to a first viewer and a second viewer, the non-processed streaming video data comprising video data depicted from a live event, the non-processed streaming audio data comprising audio data related to the live event and generated by a commentator.
  • the method comprises presenting, via a commercial television display unit, the non-processed streaming video data to the commentator, the first viewer, and the second viewer, wherein a video presentation timing to the commentator may differ from the video presentation timing to at least one of the first viewer and the second viewer, and the video presentation timing to the first viewer may differ from the video presentation timing to the second viewer.
  • the method includes generating the non- processed streaming audio data related to the live event by the commentator and transferring the non-processed streaming audio data via an Internet to the first viewer and the second viewer.
  • the method includes providing the first viewer and the second viewer with a capability to synchronize a video presentation time of the non-processed streaming video data with an audio presentation time of the non-processed streaming audio data, wherein the first viewer can directly synchronize the non-processed streaming video data and the non-processed streaming audio data via a first time, and the second viewer can directly synchronize the non-processed streaming video data and the non-processed streaming audio data via a second time that is different from the first time.
  • the commercial television display unit may be capable of displaying an image based upon at least one of: a wireless electromagnetic video signal, a video signal transmitted via a cable, a video signal received via a satellite dish, and video data received via a streaming internet.
  • the commercial television display may comprise a personal computer display capable of displaying an image based upon streaming data, a tablet display capable of displaying an image based upon streaming data, and a smart phone display capable of displaying an image based upon streaming data.
  • the capability to synchronize comprises at least one of electronic circuitry that delays an introduction of the non-processed streaming audio data received by the first viewer and the second viewer, a software delay that delays the introduction of the non-processed streaming audio data received by the first viewer and the second viewer, and a video controlled buffer that delays an introduction of the non- processed streaming video data to the first viewer and the second viewer
  • the video controlled buffer may comprise electronic memory connected between a commercial television channel that provides the non-processed streaming video data and the commercial television unit, thereby providing at least one of the first viewer and the second viewer with a capability to interactively delay the non-processed streaming video data to the commercial television unit, by selectively stopping and restarting a flow of the non-processed streaming video data at different times until the non-processed streaming video data and the non-processed streaming audio data are substantially synchronized.
  • transferring the non-processed streaming audio data via the Internet is carried out via at least one web site to which both the commentator and at least one of the first viewer and the second viewer are logged in.
  • the non-processed streaming audio data may be received by at least one of the commentator, the first viewer and the second viewer by at least one of a mainframe computer, a desktop personal computer, a laptop personal computer a tablet, and a smart phone.
  • the video controlled buffer comprises a software buffering program embedded in a processor that receives the non-processed streaming video data from a television channel, provides a time delay controlled by at least one of the first viewer and the second viewer, and presents delayed video data to a display unit linked to the processor.
  • the processor is also used to delay the audio data.
  • the processor may comprise is at least one of a mainframe computer, a desktop personal computer, a laptop personal computer, a tablet, and a smart phone.
  • the non-processed streaming audio data received by at least one of the first viewer and the second viewer precedes the non-processed streaming video data received by at least one of the first viewer and the second viewer, and at least one of the first viewer and the second viewer interactively increases a time delay of an audio presentation of the non-processed streaming audio data until synchronization is achieved.
  • the non-processed streaming video data received by at least one of the first viewer and the second viewer precedes the non-processed streaming audio data received by at least one of the first viewer and the second viewer, and at least one of the first viewer and the second viewer interactively increases a time delay of a video presentation of the non-processed streaming video data until synchronization is achieved.
  • the non-processed streaming video data received by at least one of the first viewer and the second viewer precedes the non-processed streaming audio data received by at least one of the first viewer and the second viewer, and at least one of the first viewer and the second viewer applies a first step in which the non- processed streaming video data is delayed sufficiently to cause the non-processed streaming audio data to precede the non-processed streaming video data.
  • the non-processed streaming audio data received by at least one of the first viewer and the second viewer precedes the non-processed streaming video data received by at least one of the first viewer and the second viewer, and at least one of the first viewer and the second viewer interactively increases a time delay of an audio presentation of the non-processed streaming audio data until synchronization is achieved.
  • the method may comprise enabling the first viewer and the second viewer to communicate with the commentator and with each other during an occurrence of the live event by providing text messaging functionality via the Internet between at least two of the first viewer, the second viewer and the commentator.
  • the method includes facilitating transmission and reception of a text message to at least two of the first viewer, the second viewer, and the commentator.
  • methods may comprise determining an individual time difference between a preceding audio presentation and the related video presentation to any of the first viewer and the second viewer whose received audio data precedes the related video data.
  • the method includes delaying the reception of the text message by an amount of time substantially equal to the individual time difference, to any of the first viewer and the second viewer whose received audio data precedes the related video data.
  • the method further comprises synchronizing video data of a televised event of each of the first viewer and the second viewer with an audio data of the commentator, and determining the individual time difference, and delaying reception of the text message to any of the first viewer and the second viewer whose received audio data precedes a received video data.
  • the method further includes adjusting the time delay, via a buffer control unit receiving input from a user.
  • the buffer control unit may be capable of continuously incrementing and decrementing the time delay to any desired value until synchronization is achieved.
  • Figure 1 illustrates a schematic of a method of using a data streaming system, according to some embodiments.
  • Figures 2 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
  • Figure 3 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
  • Figure 4 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
  • Figure 5 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
  • Figure 6 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
  • Figure 7 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
  • Figure 8 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
  • Figure 9 illustrates a buffer control unit, according to some embodiments.
  • an additional audio channel may be generated by a commentator while watching a live event on a television monitor linked to a commercial television channel.
  • the commentator may create audio data by recording vocal commentary in real time while watching the event.
  • Such audio data may be transferred via the Internet to multiple viewers. Each viewer may watch the live event via a television monitor while listening to the audio data of the desired commentator via the Internet.
  • FIG. 1 illustrates an embodiment of the present invention.
  • a system 1 may incorporate streaming video data depicted from a live scene 2 generated by a television camera 3, such as a commercial broadcasting apparatus.
  • a combined streaming video and streaming audio data may be distributed via numerous television channels 4 and distribution lines 5 to viewers that may watch the live event on commercial television monitors 9.
  • the channels and distribution lines that communicate the live event may be quite complicated and may include satellites, optical and electrical transmission lines, digital processors, etc. Consequently, different viewers nationwide and internationally may not receive the combined streaming data at the same instance, as each channel may have differing intrinsic delays.
  • the viewers 8 include numerous viewers (viewerl, viewer2,.. ,viewerN), each of whom may receive the streaming combined video and audio data each with differing delay times, which can vary up to tens of seconds apart.
  • the same combined video/audio data may also presented in real time to a designated commentator 7 via a distribution line 6 and presented on a screen monitor 11.
  • the channel 6 may also have an inherent delay different than each of the distribution line 5.
  • vocal comments may be generated in real time by the commentator 7 watching the live event 2 on the television monitor 11.
  • the commentator 7 may record vocal comments with the aid of a microphone 14.
  • An audio signal of the comments may be transmitted in real time to a processor 12, which may be linked to an Internet 13 via a channel 15.
  • the processor may be any device that may be connected to the internet capable of transferring digital data, such as a main frame computer, a desktop computer, laptop computer, tablet, smart phone, and the like.
  • Each of the viewers 8 may have a viewer processor 10 linked to the Internet via Internet channels 16, such as an Ethernet cable and a Wi-Fi connection.
  • the processor may be capable of delivering the vocal commentary generated by the commentator 7 in real time to each of the viewers.
  • the audio commentary received by each of the individual viewers may at least one of precede and lag behind the video signal presented on the television monitor. Even a small time difference (such as a single second) may be unacceptable to a viewer. However, it is not impossible to have delays of tens of seconds, which makes watching video data from one source and listening to audio data from a separate source a frustrating endeavor and renders systems that provide general synchronization of audio data with video data according to timing of the live event useless.
  • the first mechanism may be a system that may be a video delay means 17.
  • the video delay means 17 may receive streaming video data from the television channel 5 and provide delayed streaming data to the television monitor 9.
  • Such video delay means may comprise a memory buffer on which the viewer may store the streaming video data, and release the data to the monitor after a specific delay time.
  • the streaming data may be temporarily stored in a storage device, such as memory (RAM), disk drive, hard drive, and the like.
  • RAM random access memory
  • Such a mechanism may be implemented when the video data precedes the audio data.
  • this device may provide video delay functionality, inherently low resolution incremental delay time units may deter the system. High resolution incremental delays (10-100 milliseconds) that precise synchronization may require are generally difficult to generate.
  • the audio data may precede the video data.
  • the system may provide audio delay means.
  • the audio data may be delayed by utilizing a software module in the viewer’s processor 10.
  • the software module may generate an audio data delay in high resolution increments (10-100 milliseconds) until synchronization is achieved.
  • Figure 2 illustrates a flow chart of process 100 that may be utilized to obtain synchronization.
  • the video data and the audio data may be received by a view at different times, as show in block 101.
  • the timing of the video data may be described as TIMING(video) and the timing of the audio data may be described as TIMING(audio).
  • Block 102 shows a calculation that may be implemented to find the difference between the timing of the video data and the timing of the audio data.
  • the delay may be zero, indicating that a delay between the video data and the audio data does not exist and thus the audio data and the video data are synchronized.
  • a delay may exist and the process 100 may determine which signal precedes the other, as shown in block 105.
  • a positive delay time may be determined when the video data precedes the audio data.
  • an incremental delay Delta may be generated by the video delay means, as indicated by block 106.
  • the delay time is recalculated at block 102.
  • the delay may then be calculated to be zero, meaning that the audio data and the video data are synchronized and no further action is required.
  • the process 100 may determine that the audio data precedes the video data, as shown at block 107.
  • a small incremental time Delta is be generated by the audio delay means.
  • the system 1 may provide communication between each of the viewers 8 and the commentator 7 during the duration of the live event 2.
  • the communication lines 15 and 16 can be bi-directional enabling each viewer to provide feedback to the commentator via the Internet 13.
  • the user 18 can generate text messages on the computer 10 which may appear in real time upon the computer screen 12 of the commentator, as well as upon each of the viewer’s computer screen 10.
  • the disclosure also includes methods of generating and presenting digital streaming data to a viewer or a group of viewers.
  • the digital streaming data may comprise streaming video data depicted from a live event and streaming audio data generated by a commentator. It should be appreciated that the streaming audio data may be related to the live event.
  • methods may include presenting the streaming video data to the commentator and the viewer (at step 300).
  • a video presentation timing to the commentator may differ from a video presentation timing to the viewer.
  • the timing of video presentation may differ based on a variety of factors, such as the geographical location of the viewer and the commentator. For example, a commentator may be located in a city where the live event is taking place, and a viewer may be located in a city across a country from the city in which the live event is taking place. As such, inherent delays in broadcasting the live event may take place with the viewer relative to the commentator.
  • methods may include generating streaming audio data related to the live event (at step 302).
  • the commentator may generate the streaming audio data.
  • the streaming audio data may comprise vocal commentary, which may be related to the live event, participants in the live event, location of the live event, and the like. Such vocal commentary may be desirable to viewers who want a particular perspective of the live event that may be offered by the commentator.
  • the live event may comprise a local college basketball game.
  • Video broadcasting of the basketball game may include vocal commentary relating to the game, but viewers may desire commentary relating to the players, such as the status of players, how a player’s skill level has improved, a player’s background, and referee’s decisions.
  • methods may further include transferring the streaming audio data via an Internet to the viewer (at step 304).
  • An audio presentation timing to the commentator may differ from an audio presentation timing to the viewer.
  • inherent delays in presentation to the viewer may occur based on a variety of factors, such as location, internet speed, audio data streaming source and the internet servers. For example, the viewer may have a slow Internet communication speed, which may thus prevent the audio data from being presented to the viewer at the same time that the commentator is transferring the audio data.
  • Methods may also include providing the viewer with synchronization means to synchronize the streaming video data with the streaming audio data (at step 306).
  • the synchronization means may comprise at least one of electronic circuitry that may delay an introduction of the streaming audio data received by the viewer, a software delay that may delay the introduction of the streaming audio data received by the viewer, and a video controlled buffer that may delay an introduction of the streaming video data to the viewer.
  • the video controlled buffer may include electronic memory that may be connected between a television data channel that provides the streaming video data and a commercial television unit.
  • methods of generating and presenting digital streaming data to a viewer may include connecting the video controlled buffer to the television data channel and the commercial television unit (at step 400).
  • the video controlled buffer may provide the viewer with the capability to delay the streaming video data to the commercial television unit.
  • Methods may also include receiving streaming video data by the television data channel (at step 402).
  • methods may include receiving audio data via an Internet (at step 404).
  • the viewer may receive the streaming video data and the audio data.
  • the timing of the video data and the timing of the audio data may differ.
  • methods of utilizing the video controlled buffer may further include determining that the video data arrives before the audio data (at step 406).
  • methods may include delaying the video data by stopping and starting the flow of the streaming video data (at step 408).
  • the video controlled buffer may receive streaming video data from the commercial television unit and provide delayed streaming data to a monitor of the television.
  • the video controlled buffer may comprise a memory buffer in which a viewer may store the streaming video data and release the data to the television monitor after a specific delay time.
  • the video data may be temporarily stored in a storage device (e.g., RAM, disk drive) until the video data and the audio data are substantially synchronized.
  • a storage device e.g., RAM, disk drive
  • the viewer may selectively stop and restart the flow of the streaming video data at different times until the streaming video data and the streaming audio data are substantially synchronized.
  • methods may include downloading or executing a software audio buffering program (at step 500).
  • the software buffering program may be downloaded to a remote computing device of the viewer.
  • the remote computing device may include at least one of a laptop computer, desktop computer, tablet, smart phone, and the like.
  • methods may include logging in to a website on the remote computing device (at step 502). At least one viewer and the commentator may log in to the website. The commentator may generate the audio data and transfer the audio data via the Internet. The transfer of the audio data from the commentator to the viewer may be carried out via the website.
  • the website may enable the viewer to communicate with the commentator. Communication may be achieved by providing text message functionality via the Internet. Methods may also include receiving the audio data on the remote computing device (at step 504). The viewer may present the audio data via the website that the viewer may be logged in to on the speaker of the remote computing device.
  • methods may include receiving, by the remote computing device, the video data from the television channel (at step 506).
  • the software buffering program of the remote computing device may receive the streaming video data from the television channel.
  • the software buffering program may provide a time delay that may be controlled by the viewer.
  • methods may include determining that the audio video data precedes the video data (at step 508).
  • methods may include delaying the presentation of the audio video data (at step 510).
  • the viewer may interactively delay the presentation of the streaming audio data.
  • Methods may further include adding increments of time to the audio data until the audio data and the video data are synchronized (at step 512).
  • methods may include presenting the video data and the delayed audio data (at step 514).
  • the video data and the delayed audio data may be presented to the viewer by the remote computing device.
  • the video data may be presented by the commercial television unit and the delayed audio data may be presented by the remote computing device. It should be appreciated that the video data and the delayed audio data may be presented to the viewer via different devices or a single device.
  • methods may include downloading the software buffering program (at step 600).
  • the video controlled buffer may comprise the software buffering program.
  • the software buffering program may be downloaded to the remote computing device of the viewer.
  • methods may include logging in to the website on the remote computing device (at step 602). At least one viewer and the commentator may log in to the website. The audio data may be transferred from the commentator to the viewer via the Internet, and may be carried out by the website. As such, methods may include receiving the audio data on the remote computing device (at step 604). The viewer may receive the audio data by presenting the audio data via the website on a speaker linked to the remote computing device.
  • methods may include receiving, by the remote computing device, the video data from the television channel (at step 606).
  • the software buffering program may receive the streaming video data from the television channel.
  • the software buffering program may provide a time delay controlled by the viewer. Methods may thus also include determining that the video data precedes the audio data (at step 608).
  • methods may include delaying the presentation of the video data (at step 610).
  • the viewer may interactively delay the presentation of the streaming video data.
  • Methods may further include adding increments of time to the video data until the video data and the audio data are synchronized (at step 612).
  • methods may include presenting the audio data and the delayed video data (at step 614).
  • the audio data and the delayed video data may be presented to the viewer by the remote computing device.
  • the delayed video data may be presented by the commercial television unit and the audio data may be presented by the remote computing device. It should be appreciated that the delayed video data and the audio data may be presented to the viewer via different devices or a single device.
  • methods may include receiving the audio data and the video data (at step 700).
  • the audio data may be received by the viewer via the website.
  • Methods may also include determining that the video data precedes the audio data (at step 702).
  • methods may include delaying the video data (at step 704).
  • the viewer may delay the video data by applying a first step of delaying the streaming video data.
  • the viewer may delay the streaming video data until the video data is delayed sufficiently to cause the streaming audio data to precede the streaming video data.
  • Methods may thus include determining that the audio data precedes the video data (at step 706).
  • Methods may further include fine tuning a delay in the audio data to synchronize the audio data and the video data (at step 708).
  • the audio data may thus be delayed according to fine tune to the desired synchronization.
  • This disclosure includes a method of synchronizing video data depicted from a live event, with audio data generated by a commentator.
  • the commentator and a plurality of viewers may watch the event in real time on displays capable of streaming video and audio data.
  • the commentator and the plurality of viewers receive the data on commercial television units, via commercial television channels. It should be appreciated that the data may be received and watched by the commentator and plurality of viewers via any system capable of streaming video and audio data, such as a streaming application.
  • the commentator may generate audio data that is transferred via the Internet to each one of the viewers. Each one of the viewer may have the ability to synchronize the video data with the audio data.
  • the commentator and the plurality of viewers may communicate among themselves by generating and receiving text messages, via any type of computing device, such as a smart phone, tablet, personal computer, any type of wearable device, any type of device capable of sending and receiving data, and the like.
  • the method includes enabling the first viewer and the second viewer to communicate with the commentator and with each other during an occurrence of the live event by providing text messaging functionality (via the Internet, WiFi, 3G, 4G, 5G, LTE, Bluetooth, Bluetooth low energy (“BLE”), Z-Wave, NFC, RFID, SigFox, DigiMesh, MiWi, Weightless, Thread, ZigBee, and the like) between at least two of the first viewer, the second viewer and the commentator (at step 800).
  • the method includes facilitating transmission and reception of a text message to at least two of the first viewer, the second viewer, and the commentator (at step 802).
  • methods may include synchronizing video data of a televised event of each of the first viewer and the second viewer with an audio data of the commentator (at step 804).
  • Systems and methods described herein may thereby delay receiving the text message, until at least one of the audio data and video data has been received, to thereby prevent spoilers from occurring. Accordingly, methods may include determining an individual time difference between a preceding audio presentation and the related video presentation to any of the first viewer and the second viewer whose received audio data precedes the related video data (at step 806).
  • the method includes delaying the reception of the text message to at least one of the first viewer, the second viewer, and the commentator whose received audio data precedes the related video data (at step 808). In some embodiments, receiving the text message is delayed by an amount of time substantially equal to the individual time difference (at step 810).
  • Figure 9 illustrates the concept of a buffer control unit that can be used to determine the delay time for either video streaming data or audio streaming data.
  • the buffer control unit may be implemented by either hardware circuit component (linear or rotational potentiometer) or a software module.
  • the buffer control unit is characterized by a linear (or rotational) slider 900 with a linear track 902 in which a knob 901 can be moved horizontally.
  • the knob 901 may be moved by the operator’s fingers.
  • the knob 901 may be moved by a mouse by clicking on the knob and dragging it horizontally. As shown in Figure 9, the left side corresponds to zero delay or no delay, and the most right side corresponds to 60 seconds of delay.
  • this configuration may allow overshooting of the delay time followed by a correction of moving the knob backwards. As such, the user does not have to precisely measure the delay time since the user can interactively and precisely adjust the location until synchronization is accomplished.
  • section headings and subheadings provided herein are nonlimiting.
  • the section headings and subheadings do not represent or limit the full scope of the embodiments described in the sections to which the headings and subheadings pertain.
  • a section titled “Topic 1” may include embodiments that do not pertain to Topic 1 and embodiments described in other sections may apply to and be combined with embodiments described within the “Topic 1” section.
  • routines, processes, methods, and algorithms described in the preceding sections may be embodied in, and fully or partially automated by, code modules executed by one or more computers, computer processors, or machines configured to execute computer instructions.
  • the code modules may be stored on any type of non-transitory computer- readable storage medium or tangible computer storage device, such as hard drives, solid state memory, flash memory, optical disc, and/or the like.
  • the processes and algorithms may be implemented partially or wholly in application-specific circuitry.
  • the results of the disclosed processes and process steps may be stored, persistently or otherwise, in any type of non-transitory computer storage such as, e.g., volatile or non-volatile storage.
  • A, B, and/or C can be replaced with A, B, and C written in one sentence and A, B, or C written in another sentence.
  • A, B, and/or C means that some embodiments can include A and B, some embodiments can include A and C, some embodiments can include B and C, some embodiments can only include A, some embodiments can include only B, some embodiments can include only C, and some embodiments include A, B, and C.
  • the term “and/or” is used to avoid unnecessary redundancy.

Abstract

The disclosure includes a method of generating and presenting non-processed streaming audio data and non-processed streaming video data to a first viewer and a second viewer. The method may include presenting, via a commercial television display unit, the non-processed streaming video data to at least one of the commentator, the first viewer, and the second viewer. The method may include generating the non-processed streaming audio data related to the live event by the commentator, and transferring the non-processed streaming audio data via an Internet to the first viewer and the second viewer. Also, the method may include providing the first viewer and the second viewer with a capability to synchronize a video presentation time of the non-processed streaming video data with an audio presentation time of the non-processed streaming audio data.

Description

PROCESSING VIDEO AND AUDIO STREAMING DATA
CROSS-REFERENCE TO RELATED APPLICATIONS This application claims the benefit of U.S. Provisional Patent Application No. 63/101,724; filed May 11, 2020; and entitled METHOD OF PROCESSING VIDEO AND AUDIO STREAMING DATA; the entire contents of which are incorporated herein.
This application claims the benefit of U.S. Nonprovisional Patent Application No. 16/915,741; filed June 29, 2020; and entitled PROCESSING VIDEO AND AUDIO STREAMING DATA; the entire contents of which are incorporated herein.
BACKGROUND
Field
Various embodiments disclosed herein relate to processing video and audio streaming data.
Description of Related Art
Traditional commercial television broadcasting provides viewers with content composed of both video streaming data and audio streaming data. Live events constitute a widely broadcasted portion of television programming, including news, political events, sporting events, parades, concerts, and the like. Live events are often accompanied by audio data along with the video data, such as direct audio recordings from the events, and commentary generated by television broadcasting personnel. As television communication grows internationally, viewers are able to watch live events that originatefromall over the world. However, the accompanying audio may notalways be desirable to the viewer, or meet the viewer’s expectations. If a viewer wishes to view a news station or local event of a foreign country, translation may be necessary, but not always offered by the television broadcasting providers. While live events, such as a parade or concert, may be accompanied by audio commentary, a viewer may wish to hear the observations of a person participating in the festivities. As well, a viewer watching a sporting event may be more interested in commentary from a local fan community than the commentary provided by the broadcasting television station. Such fans may resort to listening to local radio stations while watching the event in order to obtain the information they desire. With this solution, as the audio data and the video data are transferred via various independent communication channels, there is usually an inherent delay between the streaming of the audio data and the video data.
The operation of communication satellites and the digital processing mechanisms in servers, which may be different paths for the commentator and the viewer, often results in a non-synchronized video and audio data presentation. Thus, there is a need for a more effective way to synchronize audio data and video data received from different communication channels in order to provide alignment between the video streaming data and the audio streaming data.
SUMMARY
The disclosure describes a method of generating and presenting digital streaming data to a viewer or to a group of viewers. The digital streaming data may comprise streaming video data depicted from a live event and streaming audio data generated by a commentator. The streaming audio data may be related to the live event.
Methods of generating and presenting digital streaming data may include presenting the streaming video data to the commentator and the viewer. A video presentation timing to the commentator may differ from avideo presentation timing to the viewer. Methods may also include generating streaming audio data related to the live event. As well, methods may include transferring the streaming audio data via an Internet to the viewer. An audio presentation timing to the commentator may differ from an audio presentation timing to the viewer. Furthermore, methods may include providing each of the viewers with synchronization means to synchronize the streaming video data with the streaming audio data.
The synchronization means may include at least one of electronic circuitry that may delay an introduction of the streaming audio data received by the viewer, a software delay that may delay the introduction of the streaming audio data received by the viewer, and a video controlled buffer that may delay the introduction of the streaming video data to the viewer. The video controlled buffer may comprise electronic memory that may be connected between a television data channel that may provide the streaming video data and a commercial television unit. The video controlled buffer may provide the viewer with a capability to interactively delay the streaming video data to the commercial television unit. The streaming video data may be delayed by the viewer selectively stopping and restarting the flow of the streaming video data at different times until the streaming video data and the streaming audio data are substantially synchronized.
In some embodiments, methods may include transferring the streaming audio data via the Internet. The transferring may be carried out via a web site to which both the commentator and the viewer are logged in. The streaming audio data may be received by the viewer by at least one of a mainframe computer, a personal computer, a tablet, and a smart phone.
In several embodiments, the video controlled buffer may comprise a software buffering program. The software buffering program may be embedded in a processor that may receive the streaming video data from a television channel, and provide a time delayed video signal controlled by the viewer, and present delayed video data to a display unit of the processor. In some embodiments, the processor may also be used to delay the audio data. The processor may comprise at least one of a mainframe computer, a desktop personal computer, a laptop personal computer, a tablet, and a smart phone.
Methods may include the viewer receiving the streaming audio data and the streaming video data. The streaming audio data received by the user may precede the streaming audio data received by the user. The viewer may interactively delay the presentation of the streaming audio data by incrementing the delay time until synchronization is achieved.
Methods may further include the viewer receiving the streaming audio data and the streaming video data. The streaming video data received by the user may precede the streaming audio data received by the user. The viewer may interactively delay the presentation of the streaming video data by incrementing the delay time until synchronization is achieved. In some embodiments, the streaming video data received by the viewer may precede the streaming audio data received by the viewer. Methods may also include the viewer applying a first step in which the streaming video data is delayed sufficiently to cause the streaming audio data to precede the streaming video data. Methods may further include the viewer applying a secondary step of delaying the audio data according to fine tune the desired synchronization.
In some embodiments, the system may enable the viewers to communicate with the commentator and among themselves. The system may provide text message functionality related to the live event via the Internet. The viewers may communicate with the commentator regarding the live event via the web site.
The disclosure also includes a method of generating and presenting non-processed streaming audio data and non-processed streaming video data to a first viewer and a second viewer, the non-processed streaming video data comprising video data depicted from a live event, the non-processed streaming audio data comprising audio data related to the live event and generated by a commentator. In some embodiments, the method comprises presenting, via a commercial television display unit, the non-processed streaming video data to the commentator, the first viewer, and the second viewer, wherein a video presentation timing to the commentator may differ from the video presentation timing to at least one of the first viewer and the second viewer, and the video presentation timing to the first viewer may differ from the video presentation timing to the second viewer. In some embodiments, the method includes generating the non- processed streaming audio data related to the live event by the commentator and transferring the non-processed streaming audio data via an Internet to the first viewer and the second viewer. As well, in some embodiments, the method includes providing the first viewer and the second viewer with a capability to synchronize a video presentation time of the non-processed streaming video data with an audio presentation time of the non-processed streaming audio data, wherein the first viewer can directly synchronize the non-processed streaming video data and the non-processed streaming audio data via a first time, and the second viewer can directly synchronize the non-processed streaming video data and the non-processed streaming audio data via a second time that is different from the first time. The commercial television display unit may be capable of displaying an image based upon at least one of: a wireless electromagnetic video signal, a video signal transmitted via a cable, a video signal received via a satellite dish, and video data received via a streaming internet. The commercial television display may comprise a personal computer display capable of displaying an image based upon streaming data, a tablet display capable of displaying an image based upon streaming data, and a smart phone display capable of displaying an image based upon streaming data.
In some embodiments, the capability to synchronize comprises at least one of electronic circuitry that delays an introduction of the non-processed streaming audio data received by the first viewer and the second viewer, a software delay that delays the introduction of the non-processed streaming audio data received by the first viewer and the second viewer, and a video controlled buffer that delays an introduction of the non- processed streaming video data to the first viewer and the second viewer The video controlled buffer may comprise electronic memory connected between a commercial television channel that provides the non-processed streaming video data and the commercial television unit, thereby providing at least one of the first viewer and the second viewer with a capability to interactively delay the non-processed streaming video data to the commercial television unit, by selectively stopping and restarting a flow of the non-processed streaming video data at different times until the non-processed streaming video data and the non-processed streaming audio data are substantially synchronized.
In some embodiments, transferring the non-processed streaming audio data via the Internet is carried out via at least one web site to which both the commentator and at least one of the first viewer and the second viewer are logged in.
The non-processed streaming audio data may be received by at least one of the commentator, the first viewer and the second viewer by at least one of a mainframe computer, a desktop personal computer, a laptop personal computer a tablet, and a smart phone.
In some embodiments, the video controlled buffer comprises a software buffering program embedded in a processor that receives the non-processed streaming video data from a television channel, provides a time delay controlled by at least one of the first viewer and the second viewer, and presents delayed video data to a display unit linked to the processor. In some embodiments, the processor is also used to delay the audio data. The processor may comprise is at least one of a mainframe computer, a desktop personal computer, a laptop personal computer, a tablet, and a smart phone.
In some embodiments, the non-processed streaming audio data received by at least one of the first viewer and the second viewer precedes the non-processed streaming video data received by at least one of the first viewer and the second viewer, and at least one of the first viewer and the second viewer interactively increases a time delay of an audio presentation of the non-processed streaming audio data until synchronization is achieved.
In some embodiments, the non-processed streaming video data received by at least one of the first viewer and the second viewer precedes the non-processed streaming audio data received by at least one of the first viewer and the second viewer, and at least one of the first viewer and the second viewer interactively increases a time delay of a video presentation of the non-processed streaming video data until synchronization is achieved. In some embodiments, the non-processed streaming video data received by at least one of the first viewer and the second viewer precedes the non-processed streaming audio data received by at least one of the first viewer and the second viewer, and at least one of the first viewer and the second viewer applies a first step in which the non- processed streaming video data is delayed sufficiently to cause the non-processed streaming audio data to precede the non-processed streaming video data.
In some embodiments, the non-processed streaming audio data received by at least one of the first viewer and the second viewer precedes the non-processed streaming video data received by at least one of the first viewer and the second viewer, and at least one of the first viewer and the second viewer interactively increases a time delay of an audio presentation of the non-processed streaming audio data until synchronization is achieved.
In some embodiments, the method may comprise enabling the first viewer and the second viewer to communicate with the commentator and with each other during an occurrence of the live event by providing text messaging functionality via the Internet between at least two of the first viewer, the second viewer and the commentator. In some embodiments, the method includes facilitating transmission and reception of a text message to at least two of the first viewer, the second viewer, and the commentator. In some embodiments, methods may comprise determining an individual time difference between a preceding audio presentation and the related video presentation to any of the first viewer and the second viewer whose received audio data precedes the related video data.
Additionally, in some embodiments, the method includes delaying the reception of the text message by an amount of time substantially equal to the individual time difference, to any of the first viewer and the second viewer whose received audio data precedes the related video data. As well, in some embodiments, the method further comprises synchronizing video data of a televised event of each of the first viewer and the second viewer with an audio data of the commentator, and determining the individual time difference, and delaying reception of the text message to any of the first viewer and the second viewer whose received audio data precedes a received video data.
In some embodiments, the method further includes adjusting the time delay, via a buffer control unit receiving input from a user. The buffer control unit may be capable of continuously incrementing and decrementing the time delay to any desired value until synchronization is achieved.
BRIEF DESCRIPTION OF THE DRAWINGS
These and other features, aspects, and advantages are described below with reference to the drawings, which are intended to illustrate, but not to limit, the invention. In the drawings, like reference characters denote corresponding features consistently throughout similar embodiments.
Figure 1 illustrates a schematic of a method of using a data streaming system, according to some embodiments.
Figures 2 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
Figure 3 illustrates a flowchart of a method of using a data streaming system, according to some embodiments. Figure 4 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
Figure 5 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
Figure 6 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
Figure 7 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
Figure 8 illustrates a flowchart of a method of using a data streaming system, according to some embodiments.
Figure 9 illustrates a buffer control unit, according to some embodiments.
DETAILED DESCRIPTION
Although certain embodiments and examples are disclosed below, inventive subject matter extends beyond the specifically disclosed embodiments to other alternative embodiments and/or uses, and to modifications and equivalents thereof. Thus, the scope of the claims appended hereto is not limited by any of the particular embodiments described below. For example, in any method or process disclosed herein, the acts or operations of the method or process may be performed in any suitable sequence and are not necessarily limited to any particular disclosed sequence. Various operations may be described as multiple discrete operations in turn, in a manner that may be helpful in understanding certain embodiments; however, the order of description should not be construed to imply that these operations are order dependent. Additionally, the structures, systems, and/or devices described herein may be embodied as integrated components or as separate components.
For purposes of comparing various embodiments, certain aspects and advantages of these embodiments are described. Not necessarily all such aspects or advantages are achieved by any particular embodiment. Thus, for example, various embodiments may be carried out in a manner that achieves or optimizes one advantage or group of advantages as taught herein without necessarily achieving other aspects or advantages as may also be taught or suggested herein. SYNCHRONIZATION EMBODIMENTS
Commercial television broadcasting provides viewers with TV programs composed of video and audio streaming data. Live events such as news, political events and sport events make up a class of widely broadcasted television programming. The broadcasting stations typically provide audio data along with the video data. The audio data often include direct audio recordings from the live events and commentary generated by TV broadcasting personnel related to the live events. With the enormous growth of international television communications, many viewers watch live events transmitted via satellites in real time. With this global broadcasting, viewers can only listen to the audio information provided by the broadcasting personnel, which may not fulfill the viewers’ expectations. For example, international news can be viewed in many countries, not all of which speak the language of the broadcasting station. Thus, online translation of speech may be desired for foreign viewers, as well as local commentary relating to live events. As well, live sporting events only offer the commentary of the TV broadcasting station, while local fans are often more interested in commentary of their close fan community rather than that of the station. Such fans sometimes resort to listening to local broadcasting radio station while watching the live TV broadcast program in order to obtain the information and commentary they desire.
In the method of the current invention, an additional audio channel may be generated by a commentator while watching a live event on a television monitor linked to a commercial television channel. The commentator may create audio data by recording vocal commentary in real time while watching the event. Such audio data may be transferred via the Internet to multiple viewers. Each viewer may watch the live event via a television monitor while listening to the audio data of the desired commentator via the Internet.
However, the implementation of the proposed method is by no means trivial. The major difficulty lies in the presence of inherent delays between the video data and audio data in transferring the data via different communication channels. Operations of communication satellites, as well as digital processing mechanisms in servers (which may be different for the commentator and each of the viewers), usually result in non- synchronized video and audio data presentation. Consequently, there is a need for synchronization means at each viewer station that will provide synchronization of the video streaming data and the audio streaming data, whether the audio data precedes the video data, or the video data precedes the audio data.
Figure 1 illustrates an embodiment of the present invention. A system 1 may incorporate streaming video data depicted from a live scene 2 generated by a television camera 3, such as a commercial broadcasting apparatus. A combined streaming video and streaming audio data may be distributed via numerous television channels 4 and distribution lines 5 to viewers that may watch the live event on commercial television monitors 9. The channels and distribution lines that communicate the live event may be quite complicated and may include satellites, optical and electrical transmission lines, digital processors, etc. Consequently, different viewers nationwide and internationally may not receive the combined streaming data at the same instance, as each channel may have differing intrinsic delays. These delays are not typically significant enough to prevent the perception of “real time” coverage (unless the viewer simultaneously views the video data via a television channel and listens to radio broadcasting of the live event, which may be received ahead of the video data). In Figure 1, the viewers 8 include numerous viewers (viewerl, viewer2,.. ,viewerN), each of whom may receive the streaming combined video and audio data each with differing delay times, which can vary up to tens of seconds apart. The same combined video/audio data may also presented in real time to a designated commentator 7 via a distribution line 6 and presented on a screen monitor 11. The channel 6 may also have an inherent delay different than each of the distribution line 5.
In the several embodiments, vocal comments may be generated in real time by the commentator 7 watching the live event 2 on the television monitor 11. As the event progresses, the commentator 7 may record vocal comments with the aid of a microphone 14. An audio signal of the comments may be transmitted in real time to a processor 12, which may be linked to an Internet 13 via a channel 15. The processor may be any device that may be connected to the internet capable of transferring digital data, such as a main frame computer, a desktop computer, laptop computer, tablet, smart phone, and the like. Each of the viewers 8 may have a viewer processor 10 linked to the Internet via Internet channels 16, such as an Ethernet cable and a Wi-Fi connection. The processor may be capable of delivering the vocal commentary generated by the commentator 7 in real time to each of the viewers.
Due to the different inherent delay times of the video channels, the audio commentary received by each of the individual viewers may at least one of precede and lag behind the video signal presented on the television monitor. Even a small time difference (such as a single second) may be unacceptable to a viewer. However, it is not impossible to have delays of tens of seconds, which makes watching video data from one source and listening to audio data from a separate source a frustrating endeavor and renders systems that provide general synchronization of audio data with video data according to timing of the live event useless.
In order to overcome such synchronization issues, two additional delay mechanisms are implemented in the system. The first mechanism may be a system that may be a video delay means 17. The video delay means 17 may receive streaming video data from the television channel 5 and provide delayed streaming data to the television monitor 9. Such video delay means may comprise a memory buffer on which the viewer may store the streaming video data, and release the data to the monitor after a specific delay time. Internally, the streaming data may be temporarily stored in a storage device, such as memory (RAM), disk drive, hard drive, and the like. However, such a mechanism may be implemented when the video data precedes the audio data. Additionally, while this device may provide video delay functionality, inherently low resolution incremental delay time units may deter the system. High resolution incremental delays (10-100 milliseconds) that precise synchronization may require are generally difficult to generate.
In some embodiments, the audio data may precede the video data. As such, the system may provide audio delay means. The audio data may be delayed by utilizing a software module in the viewer’s processor 10. The software module may generate an audio data delay in high resolution increments (10-100 milliseconds) until synchronization is achieved.
Figure 2 illustrates a flow chart of process 100 that may be utilized to obtain synchronization. The video data and the audio data may be received by a view at different times, as show in block 101. The timing of the video data may be described as TIMING(video) and the timing of the audio data may be described as TIMING(audio). Block 102 shows a calculation that may be implemented to find the difference between the timing of the video data and the timing of the audio data.
In some instances, the delay may be zero, indicating that a delay between the video data and the audio data does not exist and thus the audio data and the video data are synchronized. In some instances, a delay may exist and the process 100 may determine which signal precedes the other, as shown in block 105. A positive delay time may be determined when the video data precedes the audio data. In response, an incremental delay Delta may be generated by the video delay means, as indicated by block 106. When the video data has been delayed, the delay time is recalculated at block 102. The delay may then be calculated to be zero, meaning that the audio data and the video data are synchronized and no further action is required. In some instances, the process 100 may determine that the audio data precedes the video data, as shown at block 107. In response, a small incremental time Delta is be generated by the audio delay means.
In several embodiments, the system 1 may provide communication between each of the viewers 8 and the commentator 7 during the duration of the live event 2. As shown in Figure 1, the communication lines 15 and 16 can be bi-directional enabling each viewer to provide feedback to the commentator via the Internet 13. For example, the user 18 can generate text messages on the computer 10 which may appear in real time upon the computer screen 12 of the commentator, as well as upon each of the viewer’s computer screen 10.
METHOD EMBODIMENTS
The disclosure also includes methods of generating and presenting digital streaming data to a viewer or a group of viewers. The digital streaming data may comprise streaming video data depicted from a live event and streaming audio data generated by a commentator. It should be appreciated that the streaming audio data may be related to the live event.
As shown in Fig. 3, methods may include presenting the streaming video data to the commentator and the viewer (at step 300). A video presentation timing to the commentator may differ from a video presentation timing to the viewer. The timing of video presentation may differ based on a variety of factors, such as the geographical location of the viewer and the commentator. For example, a commentator may be located in a city where the live event is taking place, and a viewer may be located in a city across a country from the city in which the live event is taking place. As such, inherent delays in broadcasting the live event may take place with the viewer relative to the commentator.
In some embodiments, methods may include generating streaming audio data related to the live event (at step 302). The commentator may generate the streaming audio data. The streaming audio data may comprise vocal commentary, which may be related to the live event, participants in the live event, location of the live event, and the like. Such vocal commentary may be desirable to viewers who want a particular perspective of the live event that may be offered by the commentator. For example, the live event may comprise a local college basketball game. Video broadcasting of the basketball game may include vocal commentary relating to the game, but viewers may desire commentary relating to the players, such as the status of players, how a player’s skill level has improved, a player’s background, and referee’s decisions.
With added reference to Figure 3, methods may further include transferring the streaming audio data via an Internet to the viewer (at step 304). An audio presentation timing to the commentator may differ from an audio presentation timing to the viewer. As the commentator transfers the audio data, inherent delays in presentation to the viewer may occur based on a variety of factors, such as location, internet speed, audio data streaming source and the internet servers. For example, the viewer may have a slow Internet communication speed, which may thus prevent the audio data from being presented to the viewer at the same time that the commentator is transferring the audio data. Methods may also include providing the viewer with synchronization means to synchronize the streaming video data with the streaming audio data (at step 306).
In some embodiments, the synchronization means may comprise at least one of electronic circuitry that may delay an introduction of the streaming audio data received by the viewer, a software delay that may delay the introduction of the streaming audio data received by the viewer, and a video controlled buffer that may delay an introduction of the streaming video data to the viewer. The video controlled buffer may include electronic memory that may be connected between a television data channel that provides the streaming video data and a commercial television unit.
Referring now to Figure 4, methods of generating and presenting digital streaming data to a viewer may include connecting the video controlled buffer to the television data channel and the commercial television unit (at step 400). The video controlled buffer may provide the viewer with the capability to delay the streaming video data to the commercial television unit.
Methods may also include receiving streaming video data by the television data channel (at step 402). As well, methods may include receiving audio data via an Internet (at step 404). The viewer may receive the streaming video data and the audio data. However, the timing of the video data and the timing of the audio data may differ. As such, methods of utilizing the video controlled buffer may further include determining that the video data arrives before the audio data (at step 406). In response, methods may include delaying the video data by stopping and starting the flow of the streaming video data (at step 408). The video controlled buffer may receive streaming video data from the commercial television unit and provide delayed streaming data to a monitor of the television. The video controlled buffer may comprise a memory buffer in which a viewer may store the streaming video data and release the data to the television monitor after a specific delay time. The video data may be temporarily stored in a storage device (e.g., RAM, disk drive) until the video data and the audio data are substantially synchronized. The viewer may selectively stop and restart the flow of the streaming video data at different times until the streaming video data and the streaming audio data are substantially synchronized.
With reference to Figure 5, methods may include downloading or executing a software audio buffering program (at step 500). The software buffering program may be downloaded to a remote computing device of the viewer. The remote computing device may include at least one of a laptop computer, desktop computer, tablet, smart phone, and the like.
In some embodiments, methods may include logging in to a website on the remote computing device (at step 502). At least one viewer and the commentator may log in to the website. The commentator may generate the audio data and transfer the audio data via the Internet. The transfer of the audio data from the commentator to the viewer may be carried out via the website. In some embodiments, the website may enable the viewer to communicate with the commentator. Communication may be achieved by providing text message functionality via the Internet. Methods may also include receiving the audio data on the remote computing device (at step 504). The viewer may present the audio data via the website that the viewer may be logged in to on the speaker of the remote computing device.
In some embodiments, methods may include receiving, by the remote computing device, the video data from the television channel (at step 506). The software buffering program of the remote computing device may receive the streaming video data from the television channel. The software buffering program may provide a time delay that may be controlled by the viewer. As such, methods may include determining that the audio video data precedes the video data (at step 508).
With added reference to Figure 5, methods may include delaying the presentation of the audio video data (at step 510). The viewer may interactively delay the presentation of the streaming audio data. Methods may further include adding increments of time to the audio data until the audio data and the video data are synchronized (at step 512). Finally, methods may include presenting the video data and the delayed audio data (at step 514). The video data and the delayed audio data may be presented to the viewer by the remote computing device. In some embodiments, the video data may be presented by the commercial television unit and the delayed audio data may be presented by the remote computing device. It should be appreciated that the video data and the delayed audio data may be presented to the viewer via different devices or a single device.
As shown in Figure 6, methods may include downloading the software buffering program (at step 600). The video controlled buffer may comprise the software buffering program. The software buffering program may be downloaded to the remote computing device of the viewer.
In some embodiments, methods may include logging in to the website on the remote computing device (at step 602). At least one viewer and the commentator may log in to the website. The audio data may be transferred from the commentator to the viewer via the Internet, and may be carried out by the website. As such, methods may include receiving the audio data on the remote computing device (at step 604). The viewer may receive the audio data by presenting the audio data via the website on a speaker linked to the remote computing device.
In some embodiments, methods may include receiving, by the remote computing device, the video data from the television channel (at step 606). The software buffering program may receive the streaming video data from the television channel. The software buffering program may provide a time delay controlled by the viewer. Methods may thus also include determining that the video data precedes the audio data (at step 608).
With continued reference to Figure 5, methods may include delaying the presentation of the video data (at step 610). The viewer may interactively delay the presentation of the streaming video data. Methods may further include adding increments of time to the video data until the video data and the audio data are synchronized (at step 612). Finally, methods may include presenting the audio data and the delayed video data (at step 614). The audio data and the delayed video data may be presented to the viewer by the remote computing device. In some embodiments, the delayed video data may be presented by the commercial television unit and the audio data may be presented by the remote computing device. It should be appreciated that the delayed video data and the audio data may be presented to the viewer via different devices or a single device.
Referring now to Figure 7, methods may include receiving the audio data and the video data (at step 700). The audio data may be received by the viewer via the website.Methods may also include determining that the video data precedes the audio data (at step 702).
In some embodiments, methods may include delaying the video data (at step 704). The viewer may delay the video data by applying a first step of delaying the streaming video data. The viewer may delay the streaming video data until the video data is delayed sufficiently to cause the streaming audio data to precede the streaming video data. Methods may thus include determining that the audio data precedes the video data (at step 706). Methods may further include fine tuning a delay in the audio data to synchronize the audio data and the video data (at step 708). The audio data may thus be delayed according to fine tune to the desired synchronization. TEXT MESSAGING EMBODIMENTS
This disclosure includes a method of synchronizing video data depicted from a live event, with audio data generated by a commentator. The commentator and a plurality of viewers may watch the event in real time on displays capable of streaming video and audio data. In some embodiments, the commentator and the plurality of viewers receive the data on commercial television units, via commercial television channels. It should be appreciated that the data may be received and watched by the commentator and plurality of viewers via any system capable of streaming video and audio data, such as a streaming application.
The commentator may generate audio data that is transferred via the Internet to each one of the viewers. Each one of the viewer may have the ability to synchronize the video data with the audio data. In addition, the commentator and the plurality of viewers may communicate among themselves by generating and receiving text messages, via any type of computing device, such as a smart phone, tablet, personal computer, any type of wearable device, any type of device capable of sending and receiving data, and the like.
During live events, such as a sporting event, viewers may communicate with each other by sending text messages via commercial carriers such as SMS messaging, Whatsapp, Twitter, and the like. Unfortunately, because of intrinsic latencies associated with streaming video data (such as via commercial television units, streaming applications), a viewer may get a text message announcing a touchdown before he can see it on his/her TV. Accordingly, the disclosure described herein may reduce or eliminate “surprise” or “spoiler” text messages.
Accordingly, as shown in Figure 8, in some embodiments, the method includes enabling the first viewer and the second viewer to communicate with the commentator and with each other during an occurrence of the live event by providing text messaging functionality (via the Internet, WiFi, 3G, 4G, 5G, LTE, Bluetooth, Bluetooth low energy (“BLE”), Z-Wave, NFC, RFID, SigFox, DigiMesh, MiWi, Weightless, Thread, ZigBee, and the like) between at least two of the first viewer, the second viewer and the commentator (at step 800). In some embodiments, the method includes facilitating transmission and reception of a text message to at least two of the first viewer, the second viewer, and the commentator (at step 802). In some embodiments, methods may include synchronizing video data of a televised event of each of the first viewer and the second viewer with an audio data of the commentator (at step 804).
Systems and methods described herein may thereby delay receiving the text message, until at least one of the audio data and video data has been received, to thereby prevent spoilers from occurring. Accordingly, methods may include determining an individual time difference between a preceding audio presentation and the related video presentation to any of the first viewer and the second viewer whose received audio data precedes the related video data (at step 806).
Additionally, in some embodiments, the method includes delaying the reception of the text message to at least one of the first viewer, the second viewer, and the commentator whose received audio data precedes the related video data (at step 808). In some embodiments, receiving the text message is delayed by an amount of time substantially equal to the individual time difference (at step 810).
Figure 9 illustrates the concept of a buffer control unit that can be used to determine the delay time for either video streaming data or audio streaming data. The buffer control unit may be implemented by either hardware circuit component (linear or rotational potentiometer) or a software module. The buffer control unit is characterized by a linear (or rotational) slider 900 with a linear track 902 in which a knob 901 can be moved horizontally. In hardware embodiments, the knob 901 may be moved by the operator’s fingers. In software embodiments, the knob 901 may be moved by a mouse by clicking on the knob and dragging it horizontally. As shown in Figure 9, the left side corresponds to zero delay or no delay, and the most right side corresponds to 60 seconds of delay. Unlike the “stop and release” delay control, this configuration may allow overshooting of the delay time followed by a correction of moving the knob backwards. As such, the user does not have to precisely measure the delay time since the user can interactively and precisely adjust the location until synchronization is accomplished.
INTERPRETATION
None of the steps described herein is essential or indispensable. Any of the steps can be adjusted or modified. Other or additional steps can be used. Any portion of any of the steps, processes, structures, and/or devices disclosed or illustrated in one embodiment, flowchart, or example in this specification can be combined or used with or instead of any other portion of any of the steps, processes, structures, and/or devices disclosed or illustrated in a different embodiment, flowchart, or example. The embodiments and examples provided herein are not intended to be discrete and separate from each other.
The section headings and subheadings provided herein are nonlimiting. The section headings and subheadings do not represent or limit the full scope of the embodiments described in the sections to which the headings and subheadings pertain. For example, a section titled “Topic 1” may include embodiments that do not pertain to Topic 1 and embodiments described in other sections may apply to and be combined with embodiments described within the “Topic 1” section.
Some of the devices, systems, embodiments, and processes use computers. Each of the routines, processes, methods, and algorithms described in the preceding sections may be embodied in, and fully or partially automated by, code modules executed by one or more computers, computer processors, or machines configured to execute computer instructions. The code modules may be stored on any type of non-transitory computer- readable storage medium or tangible computer storage device, such as hard drives, solid state memory, flash memory, optical disc, and/or the like. The processes and algorithms may be implemented partially or wholly in application-specific circuitry. The results of the disclosed processes and process steps may be stored, persistently or otherwise, in any type of non-transitory computer storage such as, e.g., volatile or non-volatile storage.
The various features and processes described above may be used independently of one another, or may be combined in various ways. All possible combinations and subcombinations are intended to fall within the scope of this disclosure. In addition, certain method, event, state, or process blocks may be omitted in some implementations. The methods, steps, and processes described herein are also not limited to any particular sequence, and the blocks, steps, or states relating thereto can be performed in other sequences that are appropriate. For example, described tasks or events may be performed in an order other than the order specifically disclosed. Multiple steps may be combined in a single block or state. The example tasks or events may be performed in serial, in parallel, or in some other manner. Tasks or events may be added to or removed from the disclosed example embodiments. The example systems and components described herein may be configured differently than described. For example, elements may be added to, removed from, or rearranged compared to the disclosed example embodiments.
Conditional language used herein, such as, among others, "can," "could," "might," "may," “e.g.,” and the like, unless specifically stated otherwise, or otherwise understood within the context as used, is generally intended to convey that certain embodiments include, while other embodiments do not include, certain features, elements and/or steps. Thus, such conditional language is not generally intended to imply that features, elements and/or steps are in any way required for one or more embodiments or that one or more embodiments necessarily include logic for deciding, with or without author input or prompting, whether these features, elements and/or steps are included or are to be performed in any particular embodiment. The terms “comprising,” “including,” “having,” and the like are synonymous and are used inclusively, in an open-ended fashion, and do not exclude additional elements, features, acts, operations and so forth. Also, the term “or” is used in its inclusive sense (and not in its exclusive sense) so that when used, for example, to connect a list of elements, the term “or” means one, some, or all of the elements in the list. Conjunctive language such as the phrase “at least one of X, Y, and Z,” unless specifically stated otherwise, is otherwise understood with the context as used in general to convey that an item, term, etc. may be either X, Y, or Z. Thus, such conjunctive language is not generally intended to imply that certain embodiments require at least one of X, at least one of Y, and at least one of Z to each be present.
The term “and/or” means that “and” applies to some embodiments and “or” applies to some embodiments. Thus, A, B, and/or C can be replaced with A, B, and C written in one sentence and A, B, or C written in another sentence. A, B, and/or C means that some embodiments can include A and B, some embodiments can include A and C, some embodiments can include B and C, some embodiments can only include A, some embodiments can include only B, some embodiments can include only C, and some embodiments include A, B, and C. The term “and/or” is used to avoid unnecessary redundancy.
While certain example embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions disclosed herein. Thus, nothing in the foregoing description is intended to imply that any particular feature, characteristic, step, module, or block is necessary or indispensable. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms; furthermore, various omissions, substitutions, and changes in the form of the methods and systems described herein may be made without departing from the spirit of the inventions disclosed herein.

Claims

WHAT IS CLAIMED IS:
1. A method of generating and presenting non-processed streaming audio data and non-processed streaming video data to a first viewer and a second viewer, the non- processed streaming video data comprising video data depicted from a live event, the non-processed streaming audio data comprising audio data related to the live event and generated by a commentator, the method comprising: presenting, via a commercial television display unit, the non-processed streaming video data to the commentator, the first viewer, and the second viewer, wherein a video presentation timing to the commentator may differ from the video presentation timing to at least one of the first viewer and the second viewer, and the video presentation timing to the first viewer may differ from the video presentation timing to the second viewer; generating the non-processed streaming audio data related to the live event by the commentator; transferring the non-processed streaming audio data via an Internet to the first viewer and the second viewer; and providing the first viewer and the second viewer with a capability to synchronize a video presentation time of the non-processed streaming video data with an audio presentation time of the non-processed streaming audio data, wherein the first viewer can directly synchronize the non-processed streaming video data and the non-processed streaming audio data via a first time, and the second viewer can directly synchronize the non-processed streaming video data and the non-processed streaming audio data via a second time that is different from the first time.
2. The method of Claim 1, wherein the commercial television display unit is capable of displaying an image based upon at least one of: a wireless electromagnetic video signal, a video signal transmitted via a cable, a video signal received via a satellite dish, and video data received via a streaming internet.
3. The method of Claim 1, wherein the commercial television display unit comprises a personal computer display capable of displaying an image based upon streaming data, a tablet display capable of displaying an image based upon streaming data, and a smart phone display capable of displaying an image based upon streaming data.
4. The method of Claim 1, wherein the capability to synchronize comprises at least one of electronic circuitry that delays an introduction of the non-processed streaming audio data received by the first viewer and the second viewer, a software delay that delays the introduction of the non-processed streaming audio data received by the first viewer and the second viewer, and a video controlled buffer that delays an introduction of the non-processed streaming video data to the first viewer and the second viewer.
5. The method of Claim 4, wherein the video controlled buffer comprises electronic memory connected between a commercial television channel that provides the non- processed streaming video data and the commercial television display unit, thereby providing the at least one of the first viewer and the second viewer with a capability to interactively delay the non-processed streaming video data to the commercial television display unit, by selectively stopping and restarting a flow of the non-processed streaming video data at different times until the non-processed streaming video data and the non- processed streaming audio data are substantially synchronized.
6. The method of Claim 1, wherein transferring the non-processed streaming audio data via the Internet is carried out via at least one web site to which both the commentator and the at least one of the first viewer and the second viewer are logged in.
7. The method of Claim 1, wherein the non-processed streaming audio data is received by at least one of the commentator, the first viewer and the second viewer by at least one of a mainframe computer, a desktop personal computer, a laptop personal computer a tablet, and a smart phone.
8. The method of Claim 4, wherein the video controlled buffer comprises a software buffering program embedded in a processor that receives the non-processed streaming video data from a television channel, provides a time delay controlled by the at least one of the first viewer and the second viewer, and presents delayed video data to a display unit linked to the processor.
9. The method of Claim 8, wherein the processor is also used to delay the audio data.
10. The method of Claim 9, wherein the processor is at least one of a mainframe computer, a desktop personal computer, a laptop personal computer, a tablet, and a smart phone.
11. The method of Claim 4, wherein the non-processed streaming audio data received by the at least one of the first viewer and the second viewer precedes the non-processed streaming video data received by the at least one of the first viewer and the second viewer by an audio delay time, and the at least one of the first viewer and the second viewer interactively increases a time delay of an audio presentation of the non-processed streaming audio data until synchronization is achieved.
12. The method of Claim 4, wherein the non-processed streaming video data received by the at least one of the first viewer and the second viewer precedes the non-processed streaming audio data received by the at least one of the first viewer and the second viewer, and at the least one of the first viewer and the second viewer interactively increases a time delay of a video presentation of the non-processed streaming video data until synchronization is achieved.
13. The method of Claim 4, wherein the non-processed streaming video data received by the at least one of the first viewer and the second viewer precedes the non-processed streaming audio data received by the at least one of the first viewer and the second viewer, and the at least one of the first viewer and the second viewer applies a first step in which the non-processed streaming video data is delayed sufficiently to cause the non- processed streaming audio data to precede the non-processed streaming video data.
14. The method of Claim 13, wherein the non-processed streaming audio data received by at the least one of the first viewer and the second viewer precedes the non- processed streaming video data received by the at least one of the first viewer and the second viewer, and the at least one of the first viewer and the second viewer interactively increases a time delay of an audio presentation of the non-processed streaming audio data until synchronization is achieved.
15. The method of Claim 1, further comprising enabling the first viewer and the second viewer to communicate with the commentator and with each other during an occurrence of the live event by providing text messaging functionality via the Internet between at least two of the first viewer, the second viewer and the commentator.
16. The method of Claim 11, further comprising facilitating transmission and reception of a text message to at least two of the first viewer, the second viewer, and the commentator.
17. The method of Claim 16, further comprising delaying the reception of the text message by an amount of time substantially equal to the audio delay time.
18. The method of claim 11, further comprising adjusting the time delay, via a buffer control unit receiving input from a user, wherein the buffer control unit is capable of continuously incrementing and decrementing the time delay to any desired value until synchronization is achieved.
19. The method of claim 12, further comprising adjusting the time delay, via a buffer control unit receiving input from a user, wherein the buffer control unit is capable of continuously incrementing and decrementing the time delay to any desired value until synchronization is achieved.
PCT/US2021/031565 2020-06-29 2021-05-10 Processing video and audio streaming data WO2022005618A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US16/915,741 2020-06-29
US16/915,741 US11310554B2 (en) 2018-08-30 2020-06-29 Processing video and audio streaming data

Publications (1)

Publication Number Publication Date
WO2022005618A1 true WO2022005618A1 (en) 2022-01-06

Family

ID=79314912

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2021/031565 WO2022005618A1 (en) 2020-06-29 2021-05-10 Processing video and audio streaming data

Country Status (1)

Country Link
WO (1) WO2022005618A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130169869A1 (en) * 2011-12-29 2013-07-04 Thomson Licensing Method for synchronizing media services
US20150121436A1 (en) * 2013-10-25 2015-04-30 Broadcom Corporation Presentation timeline synchronization across audio-video (av) streams
US20200059687A1 (en) * 2018-08-17 2020-02-20 Kiswe Mobile Inc. Live streaming with multiple remote commentators
US20200077128A1 (en) * 2018-08-30 2020-03-05 Gideon Eden Digital streaming data systems and methods
US20200162796A1 (en) * 2017-05-16 2020-05-21 Peter AZUOLAS Systems, apparatus, and methods for scalable low-latency viewing of integrated broadcast commentary and event video streams of live events, and synchronization of event information with viewed streams via multiple internet channels

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130169869A1 (en) * 2011-12-29 2013-07-04 Thomson Licensing Method for synchronizing media services
US20150121436A1 (en) * 2013-10-25 2015-04-30 Broadcom Corporation Presentation timeline synchronization across audio-video (av) streams
US20200162796A1 (en) * 2017-05-16 2020-05-21 Peter AZUOLAS Systems, apparatus, and methods for scalable low-latency viewing of integrated broadcast commentary and event video streams of live events, and synchronization of event information with viewed streams via multiple internet channels
US20200059687A1 (en) * 2018-08-17 2020-02-20 Kiswe Mobile Inc. Live streaming with multiple remote commentators
US20200077128A1 (en) * 2018-08-30 2020-03-05 Gideon Eden Digital streaming data systems and methods

Similar Documents

Publication Publication Date Title
US11871088B2 (en) Systems, apparatus, and methods for providing event video streams and synchronized event information via multiple Internet channels
US11368732B2 (en) Synchronizing program presentation
US9167278B2 (en) Method and system for automatic content recognition (ACR) based broadcast synchronization
US9876944B2 (en) Apparatus, systems and methods for user controlled synchronization of presented video and audio streams
US11729450B2 (en) Systems and methods for delivery of content via multicast and unicast
US20150271546A1 (en) Synchronized provision of social media content with time-delayed video program events
US10887646B2 (en) Live streaming with multiple remote commentators
JP6809174B2 (en) Synchronization devices, methods, programs and systems
CN112752109B (en) Video playing control method and system
US10750208B2 (en) Processing video and audio streaming data
US20220377394A1 (en) Systems and methods for optimizing a set-top box to retrieve missed content
US11310554B2 (en) Processing video and audio streaming data
GB2563267A (en) Methods and systems for generating a reaction video
WO2022005618A1 (en) Processing video and audio streaming data
US10637904B2 (en) Multimedia streaming service presentation method, related apparatus, and related system
US20220394328A1 (en) Consolidated Watch Parties
US20200077128A1 (en) Digital streaming data systems and methods
KR102051985B1 (en) Synchronization of Media Rendering in Heterogeneous Networking Environments
US11856242B1 (en) Synchronization of content during live video stream

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21833552

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 25/04/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21833552

Country of ref document: EP

Kind code of ref document: A1