KR20140138784A

KR20140138784A - Improved dash client and receiver with playback rate selection

Info

Publication number: KR20140138784A
Application number: KR1020147027003A
Authority: KR
Inventors: 퀴앙 가오; 마이클 조지 루비; 인니안 마오; 로렌즈 크리스토프 마인더; 케빈 롤랜드 팔
Original assignee: 퀄컴 인코포레이티드
Priority date: 2012-02-27
Filing date: 2013-02-26
Publication date: 2014-12-04
Also published as: EP2820823A1; EP2820819A1; WO2013130475A1; JP2015513840A; US20130227080A1; JP2015511784A; CN104205769B; JP6271445B2; US9386058B2; KR20140130210A; KR20140130211A; EP2820823B1; US20130227122A1; US20130227081A1; KR101699870B1; WO2013130477A1; US9503490B2; CN104205772B; EP2820819B1; US9450997B2

Abstract

클라이언트 디바이스는 스트리밍 미디어를 제공하고, 스트림 매니저, 리퀘스트 액셀러레이터, 스트림 매니저 및 리퀘스트 액셀러레이터에 커플링되고 어떤 리퀘스트들을 할지 결정하기 위한 소스 컴포넌트를 포함한다. 레이트 선택 프로세스는, 버퍼가 늦은 경우 버퍼가 채워져서, 레이트들을 불규칙하게 변경하는 것을 회피하고 정확한 안정된 레이트를 신속하게 선택할 수 있도록 레이트 판정들을 행할 수 있다. 정확한 레이트 추정들을 허용하고, 네트워크 지연들 및 패킷 손실 레이트들이 높더라도 링크 용량을 달성할 수 있고, 스트림의 적시 전달을 달성하고, 거의 없는 단기 변동성으로 비교적 안정된 다운로드 레이트들을 달성하는 HTTP를 위해 멀티미디어 다운로드 전략들이 사용될 수 있다. 수신기는, 다수의 HTTP 접속들을 사용하고, 미디어 리퀘스트들을 더 작은 청크 리퀘스트들로 분해하고, TCP 흐름 제어 메커니즘들을 사용하여 접속들을 동기화시키고, 버스트들(bursts)의 데이터를 리퀘스트할 수 있다. 또한, 수신기는 접속을 비지(busy)로 유지하기 위해 HTTP 파이프라이닝(pipelining) 프로세스를 사용할 수 있다.The client device provides streaming media and includes source components for coupling to the stream manager, the request accelerator, the stream manager, and the request accelerator and for determining which requests to receive. The rate selection process may make rate determinations so that when the buffer is late, the buffer is populated so as to avoid irregularly changing rates and to quickly select an accurate stable rate. Multimedia download for HTTP that allows accurate rate estimates, achieves link capacity even with high network delays and packet loss rates, achieves timely delivery of streams, and achieves relatively stable download rates with near-term variability Strategies can be used. The receiver can use multiple HTTP connections, decompose media requests into smaller chunked requests, synchronize connections using TCP flow control mechanisms, and request data in bursts. The receiver can also use an HTTP pipelining process to keep the connection busy.

Description

[0001] IMPROVED DASH CLIENT AND RECEIVER WITH PLAYBACK RATE SELECTION [0002]

관련 출원들에 대한 상호-참조Cross-references to related applications

본 출원은, 발명의 명칭이 “Improved DASH Client and Receiver with Rate Adaptation and Downloading for Adaptive Video" 로 2012년 2월 27일자로 출원된 미국 가출원 제 61/603,569호에 대한 우선권을 주장하며, 그 내용들은, 사실상 그 전체가 인용에 의해 본원에 포함된다.This application claims priority to U. S. Provisional Application No. 61 / 603,569, filed February 27, 2012, entitled " Improved DASH Client and Receiver with Rate Adaptation and Downloading for Adaptive Video, , The entirety of which is hereby incorporated by reference in its entirety.

DASH는 “Dynamic Adaptive Streaming over HTTP”를 지칭한다. DASH를 이용하여, 콘텐츠 제공자는 콘텐츠를 MPD 파일들과 같은 연관 메타데이터와 함께 세그먼트들, 프래그먼트들, 리프리젠테이션들, 어뎁테이션들 및 기타 등등으로 포맷하고, 그들 모두를 표준 HTTP 서버 또는 특수 HTTP 서버를 통해 이용 가능한 파일들로 저장한다. DASH 클라이언트는 DASH 클라이언트의 사용자에게 프리젠테이션을 제공하기 위해 필요한 이들 파일들을 획득하는 수신기이다.DASH refers to " Dynamic Adaptive Streaming over HTTP ". With DASH, a content provider formats content into segments, fragments, representations, adaptations, and so on, along with associated metadata such as MPD files, and stores them all in a standard HTTP server or special HTTP And stores them as files available through the server. A DASH client is a receiver that obtains these files needed to provide a presentation to a user of a DASH client.

사용자들은 일반적으로 네트워크가 제한된 환경에서 예고 없이 고품질의 스트리밍을 원하기 때문에, DASH 클라이언트들은 까다로운 제약을 갖는다. 따라서, 개선된 DASH 클라이언트들이 바람직하다.DASH clients have severe constraints because users typically want high-quality streaming in a network-constrained environment without warning. Thus, improved DASH clients are desirable.

클라이언트 디바이스는 스트리밍 미디어를 제공하고, 스트림들을 제어하기 위한 스트림 매니저, 콘텐츠에 대한 네트워크 리퀘스트들을 하기 위한 리퀘스트 액셀러레이터, 스트림 매니저 및 리퀘스트 액셀러레이터에 커플링되고 어떤 리퀘스트들을 할지 결정하기 위한 소스 컴포넌트, 네트워크 접속, 및 미디어 플레이어를 포함한다. 리퀘스트 액셀러레이터는 리퀘스트들을 버퍼링하기 위한 리퀘스트 데이터 버퍼 및 응답할 수 있는 각 리퀘스트에 완전한 응답들을 되돌려주기 위한 로직을 포함한다. 스트림 매니저, 리퀘스트 액셀러레이터, 및 소스 컴포넌트는 프로세서 명령들 또는 프로그램 코드로서 구현될 수 있고, 클라이언트 디바이스는 프로그램 메모리, 워킹 메모리, 프로세서, 및 전력 소스를 더 포함할 수 있다. 클라이언트 디바이스는 디스플레이 및 사용자 입력 디바이스 또한 포함할 수 있다. 클라이언트 태스크들은 데이터를 효율적으로 스트리밍 하기 위해 소스 컴포넌트, 스트림 매니저, 및 리퀘스트 액셀러레이터 사이에 파싱될 수 있다. The client device includes a stream manager for providing streaming media, a stream manager for controlling streams, a request accelerator for receiving network requests for content, a stream manager and a source component for determining which requests to be coupled to the stream accelerator, And a media player. The request accelerator includes a request data buffer for buffering the requests and logic for returning complete responses to each request that can be answered. The stream manager, the request accelerator, and the source component may be implemented as processor instructions or program code, and the client device may further include a program memory, a working memory, a processor, and a power source. The client device may also include a display and a user input device. Client tasks may be parsed between the source component, the stream manager, and the request accelerator to efficiently stream data.

본원에 기술된 바와 같이, 다양한 측면에서, 클라이언트는 언제 리프리젠테이션을 유지할지 또는 다른 리프리젠테이션으로 스위칭 할지를 경정하는 것과 같은 동작을 수행하고, 어느 프래그먼트들을 요청할지를 결정하며 미디어 플레이어가, 대부분의 상황에서, 스톨링 없이 스트림을 계속할 수 있는 충분한 데이터를 획득할 수 있음을 보증할 수 있다.As described herein, in various aspects, a client may perform operations such as determining when to maintain a presentation or switch to another presentation, determine which fragments to request, , It can be ensured that sufficient data can be obtained to continue the stream without stalling.

레이트 선택 프로세스는, (a) 버퍼가 낮은 레벨에 있을 때, 버퍼가 채워지고, (b) 심지어 낮은 다운로드 레이트 추정들이 관측되더라도, 비정상적으로 레이트들을 변경하는 것을 피하도록 버퍼를 이용하고, (c) 안정된 레이트 시나리오에서, 올바른 안정된 레이트를 빠르게 선택하도록 레이트 결정들을 할 수 있다. (a) 정확한 추정을 허용하고, (b) 네트워크 지연들 및 패킷 손실 레이트들이 높더라도 링크 용량을 달성할 수 있고, (c) 스트림의 적시 전달을 달성하고, (d) 거의 없는 단기 변동성으로 비교적 안정된 다운로드 레이트들을 달성하는 HTTP를 위해 멀티미디어 다운로드 전략들이 사용된다. 이를 달성하기 위해, 수신기는, 다수의 HTTP 접속들을 사용하고, 미디어 리퀘스트들을 더 작은 청크 리퀘스트들로 분해하고, TCP 흐름 제어 메커니즘들을 사용하여 접속들을 동기화시키고, 버스트들(bursts)의 데이터를 리퀘스트할 수 있다. 또한, 수신기는 접속을 비지(busy)로 유지하기 위해 HTTP 파이프라이닝(pipelining) 프로세스를 사용할 수 있다.The rate selection process utilizes a buffer to avoid changing the rates abnormally, (a) when the buffer is at a low level, the buffer is filled, (b) even if low download rate estimates are observed, (c) In a stable rate scenario, rate decisions can be made to quickly select the right stable rate. (a) allow accurate estimation, (b) achieve link capacity even at high network delays and packet loss rates, (c) achieve timely delivery of the stream, and (d) Multimedia download strategies are used for HTTP to achieve stable download rates. To achieve this, the receiver uses a number of HTTP connections, decomposes media requests into smaller chunked requests, synchronizes connections using TCP flow control mechanisms, and requests data in bursts . The receiver can also use an HTTP pipelining process to keep the connection busy.

수신기의 프리젠테이션 엘리먼트를 이용하여 플레이 아웃할 미디어를 수신하는 수신기에서, 플레이 아웃은 미디어가 프리젠테이션 버퍼로부터 플레이백 레이트로 소비되는 것을 초래하고, 상기 수신기는 복수의 플레이백 레이트들로부터 선택하도록 구성되고, 플레이백 레이트를 선택하기 위한 방법은, 수신기가 프리젠테이션 버퍼를 모니터하는 단계를 포함하고, 프리젠테이션 버퍼는, 적어도 미디어 데이터가 수신되는 시간과 상기 미디어 데이터가 상기 수신기에 연관된 프리젠테이션 엘리먼트에 의해 소비되는 시간 사이에 미디어 데이터를 저장한다. 수신기는 또한, 버퍼 레벨의 표시를 저장하고 ―버퍼 레벨은 수신되었지만 상기 프리젠테이션 엘리먼트에 의해 아직 소비되지 않은 미디어 데이터에 의해 상기 프리젠테이션 버퍼의 얼마나 많은 부분이 차지되었는지에 대응함―, 추정된 다운로드 레이트를 결정하고, 목표 플레이백 레이트를 계산하기 위해, 저장된 표시 및 추정된 다운로드 레이트를 사용하고, 상기 목표 플레이백 레이트에 따라 상기 복수의 플레이백 레이트들 중에서 선택한다.In a receiver that receives media to play out using a presentation element of the receiver, the playout results in media being consumed at a playback rate from a presentation buffer, and the receiver is configured to select from a plurality of playback rates And the method for selecting the playback rate comprises the step of the receiver monitoring the presentation buffer, wherein the presentation buffer includes at least a time at which the media data is received and a time at which the media data is transmitted to the presentation element associated with the receiver And stores the media data between the times consumed by the user. The receiver also stores an indication of the buffer level - the buffer level corresponds to how much of the presentation buffer has been occupied by media data that has been received but has not yet been consumed by the presentation element - And uses a stored indication and an estimated download rate to calculate a target playback rate and selects from among the plurality of playback rates according to the target playback rate.

선택된 플레이백 레이트는 추정된 다운로드 레이트의 미리 결정된 곱셈보다 작거나 같은 플레이백 레이트이고, 미리 결정된 곱셈은 버퍼 레벨의 증가하는 함수이다. 미리 결정된 곱셈은 프리젠테이션 버퍼의 미디어 데이터의 플레이백 지속 시간의 아핀 선형 함수일 수 있고, 그리고/또는 프리젠테이션 버퍼의 버퍼 레벨이 임계 양보다 작을 때 1보다 작을 수 있고, 그리고/또는 프리젠테이션 버퍼의 미디어 데이터의 프리젠테이션 지속 시간이 프리젠테이션 시간의 미리 설정된 최대 양보다 많거나 같을 때 1 보다 크거나 같을 수 있다. 미리 결정된 곱셈이 프리젠테이션 버퍼의 미디어 데이터의 플레이백 지속 시간의 구분적 선형 함수일 수 있다. 선택된 플레이백 레이트는 추정된 다운로드 레이트의 미리 결정된 곱셈보다 작거나 같은 플레이백 레이트일 수 있고, 미리 결정된 곱셈은 프리젠테이션 버퍼의 미디어 데이터의 바이트 수의 증가 함수일 수 있다. 플레이백 레이트는 비례 인수 곱하기 추정된 다운로드 레이트보다 작거나 같은 복수의 플레이백 레이트들 중 가장 큰 가용 플레이백 레이트일고, 비례 인수는 레이트 변화들에의 반응 시간의 추정으로 나누어진 프리젠테이션 버퍼의 미디어 데이터의 플레이백 지속 시간의 증가 함수이다.The selected playback rate is a playback rate that is less than or equal to a predetermined multiplication of the estimated download rate, and the predetermined multiplication is an increasing function of the buffer level. The predetermined multiplication may be an affine linear function of the playback duration of the media data of the presentation buffer and / or may be less than one when the buffer level of the presentation buffer is less than the threshold amount, and / May be greater than or equal to 1 when the presentation duration of the media data is greater than or equal to a preset maximum amount of presentation time. The predetermined multiplication may be a piecewise linear function of the playback duration of the media data of the presentation buffer. The selected playback rate may be a playback rate that is less than or equal to a predetermined multiplication of the estimated download rate and the predetermined multiplication may be a function of increasing the number of bytes of media data in the presentation buffer. The playback rate is the largest available play rate among a plurality of playback rates that is less than or equal to the proportional factor multiplied by the estimated download rate, and the proportional factor is the media of the presentation buffer divided by the estimate of the response time to rate changes It is an increasing function of playback duration of data.

반응 시간의 추정은 미디어 데이터의 스위치 포인트들 사이의 프리젠테이션 시간에 상한일 수 있고 그리고/또는 미디어 데이터의 스위치 포인트들 사이의 프리젠테이션 시간의 평균일 수 있고 그리고/또는 미리 결정된 상수 곱하기 추정된 라운드-트립 시간(“ERTT”)보다 크거나 같을 수 있다.The estimation of the response time may be an upper limit on the presentation time between switch points of the media data and / or may be an average of the presentation time between switch points of media data and / or a predetermined constant multiplied by an estimated round - may be greater than or equal to the trip time ("ERTT").

수신기는 또한, 버퍼 레벨의 허용된 편차를 결정하고, 버퍼 레벨의 저장된 표시 및 버퍼 레벨의 허용된 편차를 사용하여 목표 플레이백 레이트를 계산하고, 목표 플레이백 레이트에 따라 복수의 플레이백 레이트들 중에서 선택할 수 있다.The receiver may also determine a permissible deviation of the buffer level, calculate a target playback rate using the stored indication of the buffer level and the allowed deviation of the buffer level, and determine, based on the target playback rate, You can choose.

플레이백 레이트는 높은 비례 인수, 낮은 비례 인수, 다운로드 레이트 추정, 현재 플레이백 레이트, 버퍼 레벨, 및 레이트 변경들에 대한 반응 시간의 추정에 기초하여 선택될 수 있다. 높은 비례 인수 및 낮은 비례 인수는 모두 레이트 변화에의 반응 시간의 추정에 의해 나누어진 프리젠테이션 버퍼의 미디어 데이터의 플레이백 지속 시간의 증가 함수들 및/또는 구분적 선형 함수들일 수 있다. 높은 비례 인수는 낮은 비례 인수보다 크거나 같을 수 있다. 플레이백 레이트는 이전 플레이백 레이트가 낮은 비례 인수 곱하기 추정된 다운로드 레이트와 높은 비례인수 곱하기 다운로드 레이트 추정 사이에 있으면 이전 플레이백 레이트와 동일할 수 있다. 이전 플레이백 레이트가 높은 비례 인수 곱하기 추정된 다운로드 레이트를 넘으면 플레이백 레이트는 높은 비례 인수 곱하기 추정된 다운로드 레이트보다 크지 않은 가장 큰 가용 플레이백 레이트 이도록 선택될 수 있고, 그리고/또는 이전 플레이백 레이트가 낮은 비례 인수 곱하기 다운로드 레이트 추정 아래이면 낮은 비례 인수 곱하기 다운로드 레이트 추정보다 크지 않은 가장 큰 가용 플레이백 레이트 이도록 선택될 수 있다.The playback rate may be selected based on an estimate of the response rate for high proportional factors, low proportional factors, download rate estimates, current playback rate, buffer level, and rate changes. The high proportional and low proportional factors may all be incremental functions of the playback duration of the media data of the presentation buffer divided by an estimate of the response time to the rate change and / or delineated linear functions. The high proportional factor may be greater than or equal to the low proportional factor. The playback rate may be the same as the previous playback rate if the previous playback rate is between a low proportional factor multiplied estimated download rate and a high proportional factor multiplied download rate estimate. If the previous playback rate exceeds the high proportional factor times the estimated download rate, then the playback rate can be selected to be the highest available playback rate that is not higher than the estimated rate multiplied by the proportional factor, and / or the previous playback rate If it is below the low proportional factor multiplication download rate estimate, it can be selected to be the largest available play rate that is not greater than the low proportional factor times the download rate estimate.

일 실시예에서, 수신기는 수신기의 프리젠테이션 엘리먼트를 이용하여 플레이 아웃할 미디어를 수신하고, 플레이백 레이트로 미디어 데이터를 소비하고, 복수의 플레이백 레이트들 중 하나에서 플레이백을 제공하는 프리젠테이션 인터페이스, 적어도 미디어 데이터가 수신되는 시간과 미디어 데이터가 수신기에 연관된 프리젠테이션 엘리먼트에 의해 소비되는 시간 사이에 미디어 데이터를 저장하고 프리젠테이션 인터페이스에 커플링되는 프리젠테이션 버퍼, 버퍼 레벨의 표시를 포함하는 프리젠테이션 버퍼 용량에 관한 변수들을 위한 스토리지 ―버퍼 레벨은 수신되었지만 프리젠테이션 엘리먼트에 의해 아직 소비되지 않은 미디어 데이터에 의해 프리젠테이션 버퍼의 얼마나 많은 부분이 차지되었는지에 대응함―, 추정된 다운로드 레이트 결정기, 및 목표 플레이백 레이트를 계산하기 위해 저장된 표시 및 추정된 다운로드 레이트를 사용하여, 결정된 선택된 플레이백 레이트에 따라 리퀘스트들을 배열하기 위한 로직을 포함한다.In one embodiment, the receiver is configured to receive media to play out using the presentation elements of the receiver, consume media data at a playback rate, and provide a presentation interface to provide playback at one of a plurality of playback rates A presentation buffer that stores media data between at least the time at which the media data is received and the time at which the media data is consumed by the presentation element associated with the receiver and is coupled to the presentation interface, The storage-buffer level for variables regarding buffer capacity corresponds to how much of the presentation buffer is occupied by media data that has been received but has not yet been consumed by the presentation element - , And using the target playback display is stored to calculate the rate, and the estimated download rate, includes logic for arranging the request according to the determined selected playback rate.

다양한 엘리먼트들이 네트워크 경로에 의해 커플링된 소스 및 수신기 사이의 네트워크 경로를 통해 데이터 다운로드를 제어하기 위한 프로세서에 의한 실행을 위한 컴퓨터 판독가능 매체를 이용하여 구현될 수 있다. 컴퓨터 판독가능 매체는 비-일시적 컴퓨터 판독가능 매체일 수 있다.Various elements may be implemented using a computer readable medium for execution by a processor for controlling data download via a network path between a source and a receiver coupled by a network path. The computer readable medium may be a non-transitory computer readable medium.

발명의 다른 측면들이 이 설명들로부터 분명해 질 것이다.Other aspects of the invention will become apparent from these descriptions.

도 1은 DASH 배치에서 DASH 클라이언트를 포함하는 다양한 엘리먼트들을 도시하고, 기록, 콘텐츠 준비 및 콘텐츠 전달 단계들을 수반하여 어떻게 미디어 기록이 최종 사용자에게 도착하는지를 보여준다.
도 2는 다른 컴포넌트들을 갖는 DASH 클라이언트의 예시적 아키텍쳐를 나타내고, 이는 스트림 매니저, 리퀘스트 액셀러레이터, 소스 컴포넌트, 네트워크 접속, 및 미디어 플레이어를 포함한다.
도 3은 리프리젠테이션 스위칭 프로세스들의 타이밍차트이고, 백워드 루킹 프로세스(backward looking process)에 대한 도 3A 및 포워드 루킹 프로세스(forward looking process)에 대한 도 3B를 포함한다.
도 4는 스위치 포인트들이 정렬된 경우에 대한 리프리젠테이션 스위칭 프로세스를 도시하는 타이밍 차트이다.
도 5는 레이트 추정기, 특히 버퍼 레벨이 적응적인(pker-유형 레이트 추정기)에 의해 관리되는 시간 동안의 레이트들을 도시하는 도표이다..
도 6은 비 적응성 지수 가중 이동 평균(“EWMA”) 필터가 사용되는 때의 레이트 증가 대 다운로드 시간(r-time)을 도시하는 도표이다.
도 7은 비-적응성 EWMA 필터가 사용되는 때의 레이트 증가 대 플레이백 시간(p-time)을 도시하는 도표이다.
도 8은 가변 윈도우 크기 가중 이동(“WMA”) 필터가 사용되는 때의 레이트 증가 대 다운로드 시간(r-time)을 도시하는 도표이다.
도 9는 pker-유형 프로세스가 사용되는 때의 레이트 증가 대 플레이백 시간(p-time)을 도시하는 도표이다.
도 10는 섹션 2.1의 pker 프로세스가 사용되는 때의 레이트 감소 대 다운로드 시간을 도시하는 도표이다.
도 11은 레이트의 갑작스런 증가들에 대한 pker 프로세스의 동작을 도시한다.
도 12은 레이트의 갑작스런 하락들에 대한 pker 프로세스의 동작을 도시한다.
도 13은 간단한 (고정된-너비) 이동 윈도우 평균과 지수 가중된 이동 평균의 비교를 도시한다.
도 14는 pker 레이트 추정 프로세스의 플로우차트이다.
도 15는, 도 16과 함께, pker 프로세스에 의해 사용되는 B 및 T _fast 값이 어떻게 기록된 (Tp, Tr) 값들로부터 결정될 수 있는지를 도시한다.
도 16은 값들을 결정하는 것의 측면들을 도시한다.
도 17은 “워터마크” 페칭 프로세스의 동작을 도시한다.
도 18은 플레이백 레이트를 선택하는데 사용될 수 있는 lambda 및 mu 함수들의 예들을 도시한다.
도 19는 “보수적인” 설정을 사용하는 (lambda, mu)-함수들의 예시적 선택을 나타낸다.
도 20는 “중도적인” 설정을 사용하는 (lambda, mu)-함수들의 예시적 선택을 나타낸다.
도 21는 “공격적인” 설정을 사용하는 (lambda, mu)-함수들의 예시적 선택을 나타낸다.
도 22는 어느 정도까지, MLB 프로세스를 에뮬레이팅하기 위한 프로세스를 사용하는 (lambda, mu)-함수들의 예시적 선택을 나타낸다.
도 23은 lambda 설정들에 대한 나란한 값들의 예를 도시한다.
도 24은 mu 설정들에 대한 나란한 값들의 예를 도시한다.
도 25는 레이트 추정, 다음에 레이트-기반 레이트 선택, 다음에 버퍼 관리-기반 레이트 선택을 위한 프로세스를 도시한다.
도26은 리퀘스트 취소 없는 레이트 하락을 도시한다.
도 27은 리퀘스트 취소가 있는 레이트 하락을 도시한다.
도 28은 예시적 리퀘스트 취소 프로세스를 도시하는 플로우차트이다.
도 29는 리퀘스트 취소 검출을 위한 프로세스를 도시한다.
도 30은 복수의 TCP 접속들로 페칭하지만 수신 버퍼 튜닝이 없는 동작들의 도표이다.
도 31은 복수의 TCP 접속들로 페칭하고 수신 버퍼 튜닝이 있는 다른 동작들의 도표이다.
도 32는 예시적 리퀘스트 액셀러레이터 프로세스의 플로우차트이다.
도 33은 소정의 프래그먼트 리퀘스트에 대해 만들 다수의 서브리퀘스트들을 찾기 위한 프로세스를 도시한다.
도 34는 계산된 크기들을 갖는 소스 리퀘스트들의 해체 구간들로 선택되는 개개의 리퀘스트들을 선택하기 위한 프로세스를 도시한다.
도 35는 시간 오프셋들 및 시간 오프셋들에 의해 결정된 수선 세그먼트에 대한 프래그먼트 구조의 예를 나타낸다.
도 36은 레이트 선택에서 lambda 및 mu에 대해 사용될 수 있는 값들의 테이블들을 포함한다.Figure 1 shows various elements including a DASH client in a DASH deployment and shows how media records arrive at the end user with recording, content preparation and content delivery steps.
2 shows an exemplary architecture of a DASH client with other components, including a stream manager, a request accelerator, a source component, a network connection, and a media player.
Figure 3 is a timing chart of the presentation switching processes and includes Figure 3A for the backward looking process and Figure 3B for the forward looking process.
4 is a timing chart showing a presentation switching process in the case where the switch points are aligned.
5 is a chart showing rates during a time that is managed by a rate estimator, particularly a buffer level adaptive ( pker -type rate estimator).
Figure 6 is a chart showing rate increase versus download time (r-time) when a non-adaptive exponential weighted moving average (" EWMA ") filter is used.
7 is a chart showing rate increase vs. playback time (p-time) when a non-adaptive EWMA filter is used.
8 is a chart showing rate increase versus download time (r-time) when a variable window size weighted shift (" WMA ") filter is used.
Figure 9 is a chart showing the rate increase vs. playback time (p-time) when a pker -type process is used.
10 is a chart showing rate reduction versus download time when the pker process of section 2.1 is used.
Figure 11 illustrates the operation of the pker process for sudden increases in rate.
Figure 12 illustrates the operation of the pker process for sudden drops in rate.
Figure 13 shows a comparison of a simple (fixed-width) moving window average and an exponentially weighted moving average.
14 is a flowchart of a pker rate estimation process.
FIG. 15 shows how the B and T _fast values used by the pker process can be determined from the recorded (Tp, Tr) values in conjunction with FIG.
Figure 16 illustrates aspects of determining values.
Fig. 17 shows the operation of the " watermark " fetching process.
Figure 18 shows examples of lambda and mu functions that can be used to select the playback rate.
Figure 19 shows an exemplary selection of (lambda, mu) -functions using a " conservative " setting.
Figure 20 shows an exemplary selection of (lambda, mu) -functions using a " moderate " setting.
Figure 21 shows an exemplary selection of (lambda, mu) - functions using an " offensive " setting.
Figure 22 shows an exemplary selection of (lambda, mu) - functions using a process for emulating an MLB process to some extent.
Figure 23 shows an example of side-by-side values for lambda settings.
24 shows an example of side by side values for mu settings.
25 shows a process for rate estimation, then rate-based rate selection, and then buffer management-based rate selection.
Fig. 26 shows the rate drop without request cancellation.
FIG. 27 shows a rate drop with a request cancellation.
28 is a flowchart showing an exemplary request canceling process.
Fig. 29 shows a process for request cancel detection.
30 is a diagram of operations that fetch with multiple TCP connections but without receive buffer tuning.
31 is a diagram of other operations that fetch into a plurality of TCP connections and have receive buffer tuning.
32 is a flowchart of an exemplary request accelerator process.
33 shows a process for finding a plurality of sub-requests to be made for a predetermined fragment request.
34 shows a process for selecting individual requests to be selected as the disassembly periods of the source requests having the calculated sizes.
35 shows an example of a fragment structure for a waterline segment determined by time offsets and time offsets.
Figure 36 includes tables of values that may be used for lambda and mu in rate selection.

도 2에 도시된 바와 같이, 본원에 설명된 DASH 클라이언트는 스트림 매니저(SM), 리퀘스트 액셀러레이터(RA), 소스 컴포넌트(SC), 네트워크 접속, 및 미디어 플레이어를 포함한다. DASH 클라이언트는 또한 하나 이상의 미디어 데이터 버퍼들을 포함할 수도 있다. 몇몇 실시예들에서, RA, SC 및 미디어 플레이어는 모두 그들 자신의 데이터 버퍼들, 또는 하나의 큰 데이터 버퍼의 논리적 파티션들을 가질 수도 있다. 다른 실시예들에서, RA가 응답할 수 있는 모든 리퀘스트에 대해 완전한 응답을 되돌려줄 수 있도록 RA 만이 리퀘스트들을 버퍼링하기 위한 데이터 버퍼를 가지고 미디어 플레이어는 SC가 설정한 모든 데이터 버퍼를 사용할 수도 있다. SM은 그 결정들을 하는데 필요한 메타데이터를 저장하기 위한 그 자신의 (물리적 또는 논리적) 국부 스토리지를 가질 수 있다.As shown in FIG. 2, the DASH client described herein includes a stream manager (SM), a request accelerator (RA), a source component (SC), a network connection, and a media player. The DASH client may also include one or more media data buffers. In some embodiments, the RA, SC, and media player all may have their own data buffers, or logical partitions of one large data buffer. In other embodiments, the media player may use all of the data buffers set by the SC, with only the RA having a data buffer for buffering the requests so that the RA can return a complete response for all requests that it can respond to. The SM may have its own (physical or logical) local storage to store the metadata needed to make those decisions.

도 1은 DASH 클라이언트를 갖는, DASH 배치를 도시한다.Figure 1 shows a DASH deployment, with a DASH client.

도 2는 다른 컴포넌트들을 갖는 DASH 클라이언트의 예시적 아키텍처를 도시한다. SM, RA, SC 및 미디어 플레이어는 하드웨어, 소프트웨어 또는 어떤 조합으로 구현될 수도 있음이 이해되어야 한다. 따라서, 어떤 기능이 어떤 컴포넌트에 할당된 경우에, 그것은 프로세서 명령들, 프로그램 코드, 또는 기타 등등으로 구현될 수도 있고, 그러한 경우 (프로그램 메모리, ROM, RAM, 프로세서, 전원, 커넥터들, 회로 보드들, 등등) 그러한 명령들을 실행하는데 필요한 하드웨어가 내포되어 있다. 네트워크 기능들이 기술된 경우, 네트워크 접속이 존재하는 것으로 이해되어야 하고 유선, 광학, 무선, 기타 등등일 수 있고, 사용자 상호작용이 내포된 경우, (디스플레이, 키보드, 터치패드, 스피커들, 마이크로폰들, 등등) 사용자 인터페이스 기능들 또한 내포된다.Figure 2 shows an exemplary architecture of a DASH client with other components. It should be understood that the SM, RA, SC, and media player may be implemented in hardware, software, or some combination. Thus, when a function is assigned to a component, it may be implemented as processor instructions, program code, or the like, and in such cases (such as program memory, ROM, RAM, processor, power supply, connectors, , Etc.), the hardware necessary to execute such instructions is implied. When the network functions are described, it should be understood that a network connection exists and may be wired, optical, wireless, etc., Etc.) User interface functions are also implied.

DASH 클라이언트는 두 클록들, 또는 그들의 논리적 등가물을 유지한다. 한 클록은 클라이언트에서 구동하는 로컬 클록의 시간을 표시하는 소프트회로 또는 실시간 클록 회로이고, 다른 클록은 미디어 콘텐츠의 프리젠테이션 시간을 그 시작과 관련하여 나타내는, 프리젠테이션 시간이다. 여기서, 실시간 클록 시간은 “r-time”으로 지칭되고, “p-time”은 프리젠테이션 시간을 나타내는 기술어이다.The DASH client maintains two clocks, or their logical equivalents. One clock is a soft circuit or a real time clock circuit that displays the time of the local clock running on the client, and the other clock is the presentation time, which indicates the presentation time of the media content in relation to its start. Here, the real-time clock time is referred to as "r-time" and the "p-time" is a descriptor for presentation time.

리프리젠테이션들은 동일한 콘텐츠에 대해 다른 비트-레이트들 또는 다른 차이점들에서 인코딩된 미디어 스트림들이다. 따라서, 사용자는 일반적으로 오로지 하나의 리프리젠테이션을 필요로 할 것이나, 클라이언트는 상황들 및/또는 조건들 변화에 따라 한 리프리젠테이션에서 다른 것으로 스위칭 할 수도 있다. 예를 들어, 대역폭이 높은 경우, 스트리밍 클라이언트는 높은 품질, 높은 비트레이트의 리프리젠테이션을 선택할 수 있다. 대역폭이 감소되면, 클라이언트는 낮은 품질, 낮은 비트레이트의 리프리젠테이션으로 스위칭 함으로써 이들 상황들에 적응할 수 있다.The representations are media streams encoded at different bit-rates or other differences for the same content. Thus, the user will generally only need one representation, but the client may switch from one presentation to another depending on changes in situations and / or conditions. For example, if the bandwidth is high, the streaming client can choose a high quality, high bitrate representation. When the bandwidth is reduced, the client can adapt to these situations by switching to a low quality, low bit rate representation.

스위치 포인트들(또는 랜덤 액세스 포인트들)은 스트림에 선행하는 데이터의 정보를 요구함 없이, 그로부터 미디어 샘플들의 디코딩이 시작될 수 있는 리프리젠테이션의 샘플들이다. 보다 구체적으로 비디오 리프리젠테이션들의 경우, 샘플들(프레임들)이 일반적으로 앞선 프레임들에 의존하기 때문에, 모든 샘플이 랜덤 액세스 포인트인 것은 아니다. 스트리밍 클라이언트가 리프리젠테이션들을 스위칭 하고자 한다면, 노력 낭비를 피하기 위해 꼭 스위치 포인트에서 새로운 리프리젠테이션을 디코딩하기 시작해야 한다. 몇몇 경우에는, 스위치 포인트들은 스트리밍 클라이언트로 세그먼트 인덱스(sidx)에 시그널링된다.Switch points (or random access points) are samples of a presentation from which decoding of media samples can begin without requiring information of the data preceding the stream. More specifically, in the case of video representations, not all samples are random access points, since samples (frames) generally depend on the preceding frames. If the streaming client wishes to switch presentations, it must begin decoding the new representation at the switch point to avoid wasting effort. In some cases, switch points are signaled to the segment index (sidx) as a streaming client.

리프리젠테이션 그룹(종종 단순히 그룹으로 축약된다)은 스위칭 가능한 리프리젠테이션들의 세트이다. 미디어 프리젠테이션은 하나보다 많은 리프리젠테이션 그룹들을 포함할 수 있다. 예를 들어, 다른 비트레이트들의 비디오 리프리젠테이션들에 대한 하나의 리프리젠테이션 그룹과, 오디오 비트레이트들을 위한 다른 리프리젠테이션 그룹을 가질 수 있다. DASH 표준에서, 리프리젠테이션 그룹은 종종 어뎁테이션 세트(adaptation set)로도 불린다.A representation group (often simply abbreviated as a group) is a set of switchable representations. A media presentation may include more than one representation group. For example, it may have one representation group for video representations of different bit rates and another representation group for audio bit rates. In the DASH standard, a representation group is often referred to as an adaptation set.

세그먼트는 하나의 리프리젠테이션의 적어도 일부분에 대한 미디어 데이터를 포함하는 파일이다. 프래그먼트는 프래그먼트의 시작 p-time으로부터 세그먼트 내의 프래그먼트의 바이트 범위까지의 매핑이 이용 가능한 세그먼트의 일부이다. 때때로, 서브세그먼트라는 용어가 프래그먼트 대신에 사용되고, 이들은 동등한 것으로 여겨질 수 있다. 몇몇 미디어 콘텐트는 프래그먼트들로 분할되지 않고, 이러한 경우, “프래그먼트들”은 세그먼트들 그 자신으로 지칭될 수 있다.A segment is a file that contains media data for at least a portion of one representation. A fragment is part of a segment that is mapped from the start p-time of the fragment to the byte range of the fragment within the segment. Sometimes, the term subsegment is used instead of a fragment, and they can be considered equivalent. Some media content is not divided into fragments, in which case " fragments " may be referred to as segments themselves.

도 3은 가능한 두 리프리젠테이션 스위칭 프로세스들을 도시하는 타이밍 차트이다. 스위치는 역방향일 수 있고(backward looking, 제 1 프로세스; 도 3A), 이 경우 스위치-투 리프리젠테이션(switch-to representation)에서 스위치 포인트는 스위치-프롬 리프리젠테이션(switch-from representation)에서 이미 리퀘스트 된 p-time 스트래치에서 루킹하고 이 스트래치의 끝에 가장 가까운 스위치-투 리프리젠테이션으로부터 p-time 역방향의 이전 스위치 포인트를 선택함으로써 발견된다. 제 2 프로세스(도 3B)는 순방향이고: d는 스위치-프롬 리프리젠테이션에서 마지막 리퀘스트된 p-time으로부터 시작하는 스위치-투 리프리젠테이션에서 p-time 순방향의 다음 스위치 포인트를 찾는다.3 is a timing chart showing two possible presentation switching processes. The switch can be backward looking (the first process; FIG. 3A), where the switch point in the switch-to representation is already in the switch-from representation It is found by routing in the requested p-time stretch and selecting the previous switch point in the p-time reverse direction from the switch-to-nearest presentation closest to the end of this stretch. The second process (FIG. 3B) is forward: d finds the next switch point in the p-time forward direction in the switch-to-tell presentation starting from the last requested p-time in the switch-forward presentation.

도 4는 스위치 포인트가 얼라인되는 때 그리고 스위치 포인트가 마지막 리퀘스트된 프래그먼트를 바로 팔로우하는 때의 스위칭 프로세스들을 도시하는 타이밍 차트이다. 두 프로세스는 그러한 설정에서 동일하게 동작하므로, 다이어그램은 역방향 및 순방향 방법들 모두의 동작들을 도시한다. 따라서, 스위치 포인트들이 얼라인되는 때, 어떤 프로세스도 오버래핑 데이터를 다운로드 할 수 없다.4 is a timing chart showing the switching processes when the switch point is aligned and when the switch point immediately follows the last requested fragment. Since both processes operate identically in such a configuration, the diagram illustrates the operations of both the reverse and forward methods. Thus, when the switch points are aligned, no process can download the overlapping data.

프리젠테이션 타임은 미디어가 일반적으로 보통의 속도로, 플레이 아웃 또는 플레이 백 될 것으로 예상되는 시간 기간이다. 예를 들어, 30분 비디오 프리젠테이션은 30분동안 플레이 할 것이다. 사용자는 빨리 감기 또는 되감기를 할 수도 있고, 이는 걸리는 실제 시간을 변경시킬 것이나, 프리젠테이션은 여전치 30분 비디오 프리젠테이션임이 이해되어야 한다. 프리젠테이션 엘리먼트는 프리젠테이션 시간에 걸쳐 사용자에게 프리젠테이션을 제공한다. 프리젠테이션 엘리먼트들의 예들은 시각 디스플레이 및 청각 디스플레이, 또는 그것을 프리젠팅할 수 있는 디바이스로 보내지는 비디오/오디오 스트림을 포함한다. “플레이백”은 미디어의 소비를 기술하는데 사용되는 용어이다. 예를 들어, 스마트폰은 프리젠테이션의 프리젠테이션 타임(p-time) 동안 프리젠테이션을 리프리젠팅하는 미디어를 다운로드 또는 획득하고, 그것을 버퍼링하며, 미디어 플레이어는 그 미디어를 "소비한다", 바람직하게는 리시버가 데이터를 더 획득하기 위해 기다리는 동안 프리젠테이션에서 스톨을 경험하지 않도록 적어도 프리젠테이션 타임의 끝까지 버퍼가 완전히 비지 않게 소비한다고 말해진다. 물론, “플레이백” 또는 “플레이 아웃”은 미디어가 한번보다 많이 플레이 된다는 것을 내포하는 것은 아니다. 많은 경우에서, 미디어가 한번 소비되면, 절대 다시 사용되지 않는다.Presentation time is the period of time during which media is expected to play at normal speed, playout, or playback. For example, a 30 minute video presentation will play for 30 minutes. The user may fast forward or rewind, which will change the actual time taken, but it should be understood that the presentation is still a 30 minute video presentation. The presentation element provides a presentation to the user over the presentation time. Examples of presentation elements include a visual display and an audio display, or a video / audio stream sent to a device capable of presenting it. "Playback" is a term used to describe the consumption of media. For example, a smartphone may download or acquire media that re-present the presentation during presentation time (p-time) of the presentation, buffer it, and the media player "consume & It is said that the buffer is not completely empty until at least the end of the presentation time so that the receiver does not experience a stall in the presentation while waiting to acquire more data. Of course, " playback " or " playout " does not imply that the media is played more than once. In many cases, once the media is consumed, it is never used again.

프리젠테이션 버퍼는 리시버, 미디어 플레이어의 또는 어느 하나 또는 둘 모두에 액세스할 수 있는 메모리 엘리먼트이다. 설명의 편의를 위해, 우리는 “프리젠테이션 버퍼”, “버퍼”, “미디어 버퍼” 및 “플레이백 버퍼” 용어들을 상호 교환 가능하게 사용하며, 이는 다운로드 되었지만 아직 플레이 아웃 되거나 소비되지 않은 데이터, 일반적으로 미디어 데이터를 포함하는 논리적 버퍼임이 분명히 이해되어야 한다. 프리젠테이션 버퍼를 포함하는 데이터는 디바이스 내에서 상이한 컴포넌트들 사이에 분할될 수 있으며, 즉 다운로드 된 데이터의 어떤 부분들은 제 1 프로세스, 예를 들어 디바이스 내의 수신 프로세스에 의해 홀딩되고, 반면에 다른 부분들은 다른 프로세스, 예를 들어 디바이스 내의 플레이 아웃 프로세스로 이미 전달되었을 수 있다. 또한 프리젠테이션 버퍼를 포함하는 데이터의 적어도 일부는 다른 버퍼 또는 다른 프로세스들에 걸쳐 적어도 부분적으로 복제될 수도 있다. 어떤 경우들에서는 다운로드 되었지만 아직 플레이 아웃 되지 않은 데이터 모두가 아직 프리젠테이션 버퍼 내에 있는 것으로 여겨지지는 않으며, 예를 들어, 어떤 경우들에서 미디어 콘텐츠가 일단 미디어 플레이로 넘겨지면 이는 더 이상 프리젠테이션 버퍼에 있지 않는 것으로 여겨질 수도 있다. 일반적으로, 다운로드 되었지만 아직 플레이 아웃 되지 않고 아직 프리젠테이션 버퍼 내에 있는 것으로 여겨지지 않는, 미디어 데이터의 양은, 있더라도, 아주 적다.The presentation buffer is a memory element that can access the receiver, the media player, or either or both. For convenience of description, we use the terms "presentation buffer", "buffer", "media buffer" and "playback buffer" interchangeably and refer to data that has been downloaded but has not yet been played out or consumed, Lt; RTI ID = 0.0 > media data. &Lt; / RTI > Data containing the presentation buffer can be partitioned between different components within the device, i.e., some portions of the downloaded data are held by a first process, e.g., a receiving process in the device, It may have already been delivered to another process, e. G. A playout process in the device. Also, at least a portion of the data containing the presentation buffer may be at least partially replicated across other buffers or other processes. In some cases, not all of the downloaded but not yet-played-out data is still considered to be in the presentation buffer, for example, in some cases once the media content is passed to the media play it is no longer in the presentation buffer It may be considered not. In general, the amount of media data that is downloaded but not yet played out and is not yet considered to be in the presentation buffer is very small, even if it is.

A presentation buffer accommodates unevenness is receiving and playing back media, storing received media data until it is consumed. 미디어 데이터가 소비된 뒤에, 그것은 구성에 따라, 삭제될 수 있거나 계속 저장될 것이다. 어떤 구현에서는, 프리젠테이션 버퍼의 크기(프리젠테이션 버퍼에 저장될 수 있는 데이터의 바이트 수에 의해 측정될 수도 있는)는 시간에 따라 변할 수도 있다. 예를 들어, 프리젠테이션 버퍼는 공유된 메모리로부터 필요한 만큼 동적으로 할당될 수도 있다. A presentation buffer accommodates unevenness is receiving and playing back media, storing received media data until it is consumed. After the media data is consumed, it may be deleted or kept stored, depending on the configuration. In some implementations, the size of the presentation buffer (which may be measured by the number of bytes of data that can be stored in the presentation buffer) may change over time. For example, the presentation buffer may be dynamically allocated as needed from the shared memory.

본원에 상세히 설명된 많은 예들에서, 프리젠테이션 버퍼는 크기에 의해 특징지어지는 것으로 간주될 수 있다. 프리젠테이션 버퍼 전용의 고정 메모리의 경우, 그 크기는 가용 메모리에 저장될 수 있는 바이트 수에 의해 측정될 수도 있다. 프리젠테이션 버퍼가 동적으로 할당되는 경우에, 프리젠테이션 버퍼의 것으로 보는 “크기”는 프리젠테이션 버퍼에 현재 할당된 바이트 수, 프리젠테이션 버퍼에 할당될 수 있는 최대 바이트 수, 또는 어떤 다른 적절한 측정값일 수 있다. 프리젠테이션 버퍼 크기는 또한 종종 프리젠테이션 버퍼에서 현재 이용 가능한 미디어의 프리젠테이션 타임 플레이 아웃 지속 기간으로 측정될 수도 있다.In many of the examples described in detail herein, the presentation buffer can be considered to be characterized by size. In the case of a fixed memory dedicated to a presentation buffer, its size may be measured by the number of bytes that can be stored in the available memory. In the case where the presentation buffer is dynamically allocated, the " size " viewed as the presentation buffer may be the number of bytes currently allocated to the presentation buffer, the maximum number of bytes that can be allocated to the presentation buffer, have. The presentation buffer size may also often be measured to the presentation time playout duration of currently available media in the presentation buffer.

프리젠테이션 버퍼는 또한 다른 특징, 그 “레벨” 또는 “필 레벨”을 가진다. 프리젠테이션 버퍼의 레벨은, 예를 들어 바이트 또는 프리젠테이션 타임 지속기간으로 측정되는, 얼마나 많은 소비되지 않은 데이터가 프리젠테이션 버퍼에 존재하는지를 나타낸다. 레벨은 미디어 데이터가 수신되면서 높아지고 그것이 소비되면서 낮아질 것으로 예상된다. 레벨은 오로지 논리적이다 - 예를 들어, 프리젠테이션 버퍼는 계속 미디어 데이터로 가득 차 있지만, 예를 들어, 이미 소비된 미디어 데이터와 같이 어떤 미디어는 새로운 미디어 데이터가 수신되면서 오버라이팅을 위해 마킹된다. 어떤 수신기들은 “빈 버퍼”는 0인 소비되지 않은 미디어 데이터가 있을 때의 상태이고, “꽉 찬 버퍼”는 프리젠테이션 버퍼의 100%가 소비되지 않은 미디어 데이터로 차 있을 때의 상태 이도록 프로그램 될 수도 있다. 다른 수신기들은 프리젠테이션 버퍼 크기의 0%에서 100%보다 작은 범위에 걸쳐 변동하도록 다른 경계들을 가질 수도 있다. 공유 메모리가 사용되고 소비되지 않은 미디어 데이터가 그곳에 저장될 때 유일하게 할당되는 프리젠테이션 버퍼인 경우, 프리젠테이션 버퍼는, 정의상, 항상 가득 차 있으므로, 레벨 비율을 나타낼 때 동적으로 할당된 프리젠테이션 버퍼의 메모리의 크기를 분모로 사용하는 것은 의미가 없을 수 있다. 대신에, 프리젠테이션 버퍼의 레벨은 프리젠테이션 버퍼에 대해 허용되는 최대 크기에 의해 나누어진 프리젠테이션 버퍼의 소모되지 않은 미디어 데이터의 양의 비율로 측정될 수도 있다.
The presentation buffer also has another feature, its "level" or "fill level". The level of the presentation buffer indicates how much unspent data is present in the presentation buffer, e.g., measured in byte or presentation time duration. The level is expected to increase as media data is received and to be lowered as it is consumed. The level is only logical - for example, the presentation buffer is still full of media data, but some media, such as already consumed media data, is marked for overwriting as new media data is received. Some receivers are in a state when there is uncommitted media data where the "empty buffer" is zero and a "full buffer" is programmed to be in a state when 100% of the presentation buffer is filled with uncommitted media data have. Other receivers may have different boundaries to vary over a range of 0% to less than 100% of the presentation buffer size. In the case of a presentation buffer that is uniquely assigned when shared memory is used and uncommitted media data is stored therein, the presentation buffer is, by definition, always full, It may be meaningless to use the size of the denominator as a denominator. Instead, the level of the presentation buffer may be measured as the ratio of the amount of uncommitted media data in the presentation buffer divided by the maximum size allowed for the presentation buffer.

1. 클라이언트 컴포넌트들의 개관1. Overview of client components

다시 도 1-2를 참조하면, 예시적인 클라이언트의 다양한 컴포넌트들이 도시된다.Referring again to Figures 1-2, various components of an exemplary client are shown.

SC는 어떤 리프리젠테이션들이 이용가능한지, 및 그들의 프래그먼트들이 무엇인지에 관한 정보와 같은, 메타데이터를 계속 파악한다. SC는 또한 네트워크를 통해 수신된 미디어 데이터를 버퍼링하고 미디어 플레이어에 그것을 넘겨줄 책임이 있다. SM은 어떤 리프리젠테이션들이 시간상 어떤 포인트에서 다운로드 되어야 하고 레이트 스위치 결정을 할 책임이 있다. 마지막으로, RA는 SC에 의해 제공된 바와 같이 미디어 프래그먼트들, 주어진 정확한 URL 및 바이트-범위 정보를 다운로드 할 책임이 있다.The SC keeps track of the metadata, such as information about which representations are available and what their fragments are. The SC is also responsible for buffering media data received over the network and handing it over to the media player. The SM is responsible for making decisions about which representations should be downloaded at some point in time and for rate switching. Finally, the RA is responsible for downloading the media fragments, given exact URL and byte-range information as provided by the SC.

SM은 레이트 스위칭 결정을 책임지는 소프트웨어 컴포넌트이다. SM의 목표들 중 하나는 주어진 상황에서 최적의 콘텐츠를 고르는 것이다. 예를 들어, 많은 대역폭이 가능하다면, 높은 다운로드 레이트가 획득될 수 있고, 따라서 SM은 높은 레이트 리프리젠테이션을 골라야 한다. 다운로드 레이트가 심각하게 떨어지면, 선택된 높은 리프리젠테이션은 더 이상 유지될 수 없을 수 있고, 따라서 SM은 상황에 보다 적절한, 낮은 리프리젠테이션 레이트로 스위칭 해야 한다. SM은 플레이백 버퍼를 완전히 비워지는 것을 피하고(이는 플레이백 스톨을 유발할 수 있으므로), 그러나 동시에 너무 서두르거나 너무 자주 스위치 하지 않을 만큼 충분히 빠르게 스위치 해야 한다. 더욱이, 네트워크를 통해 다운로드 되고 스톨링 없이 플레이백 될 수 있는 가장 높은 품질의 콘텐츠를 리퀘스트하는 것을 목표로 한다. SM은 결정 프로세스에서 다운로드 속도 외 다른 인자들을 고려하는 것까지 확장될 수 있다. 그것은 배터리 수명, 디스플레이 크기, 및 다른 인자들과 같은 것들을 리프리젠테이션 결정 때에 잠재적으로 고려할 수 있다. 그러한 추가 제약들은 SM에 필터로서 추가될 수 있고, 본원의 기본적인 레이트 결정 계산에 영향을 미치지 않는다.The SM is a software component responsible for rate switching decisions. One of SM's goals is to pick the best content for a given situation. For example, if a lot of bandwidth is possible, a high download rate may be obtained, and therefore the SM should choose a high-rate presentation. If the download rate drops significantly, the selected high representation may no longer be maintained, and therefore the SM should switch to a lower representation rate that is more appropriate for the situation. The SM must avoid being completely emptied of the playback buffer (which may cause a playback stall), but at the same time switch too fast or too fast to switch too often. Moreover, it aims to request the highest quality content that can be downloaded through the network and played back without stalling. The SM can be extended to consider factors other than download speed in the decision process. It can potentially take into account such things as battery life, display size, and other factors when making a presentation decision. Such additional constraints may be added as filters to the SM and do not affect the underlying rate decision calculations of the present disclosure.

대표적이고, 고-레벨의, 클라이언트의 동작이 이하에 기술될 것이다. 사용자는 비디오 및 오디오와 다른 타입의 미디어를 수반할 수 있는, 라이브 스포츠 브로드캐스터, 사전-녹화된 영화, 오디오 스트림, 또는 다른 오디오-비주얼 또는 다른 콘텐츠와 같은 특정 미디어 콘텐츠를 리퀘스트한다고 가정한다. 클라이언트는 그 리퀘스트를, 아마 사용자 인터페이스 또는 컴퓨터 인터페이스를 통해, SM으로 공급할 것이다. SM은 SC에 리퀘스트하고 어떤 리프리젠테이션들이 이용가능한지, 어떤 p-time span이 어떤 프래그먼트들에 의해 커버되는지, 리프리젠테이션들의 스위치 포인트들이 어디에 위치하는지에 대한 표시들을 수신할 것이다. 그에 추가로, SM은 단기 다운로드 레이트에 관한 그 마음대로 이용할 수 있는 어떤 정보를 가질 수 있다 - 이하에 설명된 바와 같이, RA는 이 데이터를 SC에 보고하고 SC는 이를 SM에 보고하거나 제공한다.Representative, high-level, client operations will be described below. It is assumed that the user requests specific media content, such as a live sports broadcaster, a pre-recorded movie, an audio stream, or other audio-visual or other content, which may carry video and audio and other types of media. The client will supply the request to the SM, perhaps via a user interface or computer interface. The SM will request the SC and receive indications as to which representations are available, which p-time spans are covered by which fragments, and where the switch points of the replies are located. In addition, the SM can have any available information about the short-term download rate - as described below, the RA reports this data to the SC and the SC reports or provides it to the SM.

SM은 그 정보를 지난 히스토리와 함께 사용하여 지속 가능한 레이트를 추정하고 리프리젠테이션 내의 적절한 스위치 포인트 및 그 스위치 포인트로부터 시작하는 그 리프리젠테이션으로부터 다운로드할 미디어 콘텐츠의 양을 결정한다. 다운로드들이 진행되고 미디어 콘텐츠가 플레이백 됨에 따라, SM은 레이트 스위치가 적절한지 아닌지를 결정하기 위해 공급된 정보를 사용한다. 레이트 스위치가 적절하지 않으면, SM은 SC에 현재 리프리젠테이션으로부터 프래그먼트들을 계속하여 페치하라고 알린다. 레이트 스위치가 절절하면, SM은 잠재적인 스위치 포인트들을 찾고 원하는 스위치를 하기 위해 어떤 리프리젠테이션들로부터 어떤 프래그먼트들이 페치될 필요가 있는지를 결정한다. SM은 그 다음에 그 정보를 SC로 전달한다. 다운로드 될 비디오의 다음 섹션에 대한 결정이 되어야 할 때는 언제든지, SC와 SM 사이의 이 교환은 주기적으로 수행된다. 좋은 결정을 하기 위해, SM은 버퍼 레벨을 모니터하고, SM은 버퍼가 충분히 찼는지, 그리고 얼마간의 시간 주기 동안 프래그먼트들이 다운로드 될 필요가 없는지를 결정할 수 있다.The SM uses the information together with its past history to estimate the sustainable rate and determine the amount of media content to download from the appropriate switch point in the presentation and from the presentation starting from that switch point. As the downloads proceed and the media content is played back, the SM uses the supplied information to determine whether the rate switch is appropriate or not. If the rate switch is not appropriate, the SM informs the SC to continue fetching the fragments from the current representation. If the rate switch is successful, the SM determines the potential switch points and determines which fragments need to be fetched from certain representations in order to make the desired switch. The SM then forwards the information to the SC. Whenever a decision on the next section of video to be downloaded should be made, this exchange between the SC and the SM is performed periodically. To make a good decision, the SM monitors the buffer level and the SM can determine if the buffer is full and if fragments need not be downloaded for some period of time.

SM이 다운로드할 프래그먼트를 결정했으면, SC는 RA가 프래그먼트를 실제로 다운로드하고, 미디어 버퍼에 다운로드 된 프래그먼트를 유지하며, 마지막으로 그것을 플레이 아웃할 시간이 왔을 때 미디어 버퍼의 미디어 데이터를 미디어 플레이어로 전달하도록 할 책임을 진다. Once the SM has determined which fragments to download, the SC will cause the RA to actually download the fragments, keep the downloaded fragments in the media buffer, and finally deliver the media data in the media buffer to the media player when it is time to play it out Be responsible for.

SM은 그것이 SC에 다운로드하라고 한 그 프래그먼트들에 더 이상 능동적으로 관련되지 않는다. 그러나, SM은, 소정의 프래그먼트의 다운로드가 이미 시작된 이후라 할지라도, 그 마음을 바꾸고 그것이 이전에 발한 프래그먼트 리퀘스트를 취소할 수 있다. 이러한 기능은 다운로드 레이트가 극적으로 떨어지고 다운로드 되고 있는 프래그먼트가 미디어 버퍼가 완전히 비워질 때까지 이용 가능하지 않을 것 같다고 드러나는 경우에 유용하다. 그러한 상황이 발생하면, SM은 그것을 감지하고 리퀘스트를 취소하며 대신에 보다 적절한 레이트로 스위칭하여야 한다.The SM is no longer actively involved in those fragments that it has requested to download to the SC. However, even after the downloading of a predetermined fragment has already begun, the SM can change its mind and cancel the previously requested fragments request. This feature is useful if the download rate drops dramatically and the fragment being downloaded appears to be unavailable until the media buffer is completely emptied. When such a situation occurs, the SM must detect it and cancel the request and instead switch to a more appropriate rate.

일단 SC가 SM으로부터 페치할 프래그먼트 핸들을 수신하면, 그것은 그 데이터 구조에서 대응하는 프래그먼트의 URL 및 바이트 레인지를 찾고, 그것을 사용하여 그것이 RA로 넘길 리퀘스트를 생성한다. 그것은 또한 RA로부터 응답 데이터를 리트리빙하고, 수신된 미디어 프래그먼트들을 플레이 가능한 스트림으로 변형할 책임이 있다. 마지막으로, SC는 MPD로부터 획득된 데이터, 세그먼트 인덱스(sidx) 박스들, 또는 Apple의 HTTP Live Streaming(HLS)의 경우에, 플레이 리스트들과 같은, 메타데이터를 파싱하고 계속 파악할 책임을 진다. Once the SC receives a fragment handle to fetch from the SM, it finds the URL and byte range of the corresponding fragment in the data structure and uses it to generate a request that it passes to the RA. It is also responsible for retrieving the response data from the RA and transforming the received media fragments into a playable stream. Finally, the SC is responsible for parsing and grasping metadata, such as playlists, in the case of data obtained from the MPD, segment index (sidx) boxes, or Apple's HTTP Live Streaming (HLS).

RA는 SC로부터 수신된 프래그먼트 및 메타데이터 리퀘스트들을 취하고, 대응하는 HTTP 리퀘스트들을 생성하고, 이들을 네트워크 접속을 통해 전송하고, 대응하는 응답들을 리트리빙하여 이들을 SC로 다시 전달하는 컴포넌트이다. 네트워크 접속은 인터넷 접속, 셀룰러-기반 접속, WiFi 접속 또는 HTTP 리퀘스트들 및 응답들을 다룰 수 있는 다른 네트워크 접속들일 수 있다. 네트워크 접속은 단일 디바이스 내에 내제할 수 있으며, 즉 그것은 디바이스 내에 이미 캐싱된 미디어 데이터로의 내부 인터페이스일 수 있다. 많은 조합들이 있을 수 있으며, 즉, 어떤 미디어 콘텐츠는 유선 인터넷 접속으로부터 다운로드 될 수 있고, 어떤 것들은 셀룰러 기반 접속을 통해, 어떤 것들은 WiFi 접속을 통해, 어떤 것들은 로컬 캐시로부터 다운로드 될 수 있다. 어떤 경우에는 미디어 데이터가 다운로드 되는 접속은 혼합되어 있을 수 있으며, 즉 일부는 셀룰러를 통해, 일부는 WiFi를 통해, 일부는 유선 등등을 통할 수 있다. 특정 리퀘스트들은 어떤 경우에는 HTTP와 다를 수 있으나, 미디어 콘텐츠를 서빙하는 서버들이 HTTP 서버들일 경우에 HTTP가 바람직하다.The RA is a component that takes fragments and metadata requests received from the SC, generates corresponding HTTP requests, transmits them over the network connection, and retrieves the corresponding responses and forwards them back to the SC. A network connection may be an Internet connection, a cellular-based connection, a WiFi connection, or other network connections that can handle HTTP requests and responses. A network connection may be embedded within a single device, i. E. It may be an internal interface to media data already cached in the device. There may be many combinations, that is, some media content may be downloaded from a wired Internet connection, some via a cellular based connection, some via a WiFi connection, and some from a local cache. In some cases, the connections through which the media data is downloaded may be mixed, some through cellular, some via WiFi, and some through wired. Certain requests may differ from HTTP in some cases, but HTTP is preferred if the servers serving the media content are HTTP servers.

가장 간단한 형태에서, RA는 HTTP 클라이언트이다. 그러나 RA가 일반적인 HTTP 클라이언트보다 더 능률적인 것이 바람직할 수도 있다. RA의 한 목표는 충분히 높은 다운로드 속도를 달성하는 것이고; 그것은 선택된 플레이백 미디어 레이트보다 상당히 빠르게 다운로드하는 것을 목표로 해야 한다. 반면에, 로우 쓰로우풋(raw throughput)에 대한 적시성을 해치지 않도록 하는 것 또한 주의해야 한다: 곧 플레이 아웃될 프래그먼트들은 더 뒤의 것들보다 더 급하고, RA는 그것들을 적시에 수신하도록 시도해야 한다. 따라서, 적시성을 위해 다소간의 쓰로우풋을 희생해야 할 필요가 있을 수 있다. RA는 모든 합당한 네트워크 상황에서 양호하게 동작하도록 설계되어야 한다.In its simplest form, the RA is an HTTP client. However, it may be desirable that the RA be more efficient than a normal HTTP client. One goal of the RA is to achieve a sufficiently high download speed; It should aim to download significantly faster than the selected playback media rate. On the other hand, it should also be noted that the timeliness of raw throughput is not compromised: the fragments to be played out sooner are more urgent than the later ones, and the RA should try to receive them timely . Thus, for timeliness, you may need to sacrifice some time spent. The RA should be designed to work well in all reasonable network situations.

RA의 기본적 설계는 최적의 결과를 얻기 위해 몇 개의 접속들과 아마도 FEC(forward error correction) 또한 사용하는 것이다. 따라서, RA는 일반적으로 하나보다 많은 개방 HTTP 접속을 다룰 필요가 있을 것이다. RA는 그 접속들 상으로 리퀘스트를 발송할 것이다. RA는, 어떤 상황들에서는, 보다 작은 리퀘스트들의 세트로 리퀘스트들을 분할할 수 있다. 대응하는 응답들을 수신하면, RA는 데이터를 코헤런트 응답으로 다시 모은다. 다시 말하면, RA는 보낼 HTTP 리퀘스트들의 그래뉼러리티(granularity)를 결정하고, 어떤 접속들로 리퀘스트들을 디스패칭할지, 및 어느 부분의 소스 프래그먼트들 또는 수선 세그먼트들을 리퀘스트할지를 결정할 책임을 진다. 그 리퀘스트들의 그래뉼러리티는 버퍼 레벨, 리퀘스트의 긴급성, 이용 가능한 접속들의 수 등과 같은, 많은 것들에 의존할 수 있다.The basic design of the RA is to use several connections and possibly forward error correction (FEC) to get the best results. Thus, an RA will typically need to handle more than one open HTTP connection. The RA will send out requests on those connections. The RA may, in some circumstances, divide the requests into a set of smaller requests. Upon receiving the corresponding responses, the RA aggregates the data back into a coherent response. In other words, the RA is responsible for determining the granularity of HTTP requests to send, determining which connections to dispatch requests to, and which parts of the source fragments or repair segments to request. The granularity of the requests may depend on many things, such as the buffer level, the urgency of the request, the number of connections available, and so on.

RA에 의해 발신된 각 리퀘스트는 메타데이터에 대한, 또는 SC에 의해 RA로 전달된 프래그먼트 리퀘스트의 부분 또는 전체에 대한 HTTP 리퀘스트이다. 그것은 소스 미디어 데이터 또는 소스 미디어 데이터로부터 생성된 수리(repair) 데이터일 수 있다. SC 프래그먼트 리퀘스트로부터 생성된 RA 리퀘스트들에 대한 응답들은, 대부분의 경우에, RA가 프래그먼트 리퀘스트의 모든 미디어 데이터를 재건하기에 충분해야 하고, RA는 다음에 이를 SC에 다시 전달할 수 있다. 따라서, RA는 미디어 프래그먼트 리퀘스트와 연관된 RA 리퀘스트들로부터의 응답들을 다시 SC에 제공된 프래그먼트 리퀘스트에 대한 응답으로 어셈블링할 책임을 진다. 예를 들어 FEC 수선 데이터에 대한 몇몇의 RA 리퀘스트들이 있다면, RA에 의한 어셈블링은 FEC 디코딩을 포함할 수 있다. Each request originated by the RA is an HTTP request to the metadata, or to part or all of the fragment request sent by the SC to the RA. It may be repair data generated from source media data or source media data. Responses to RA requests generated from an SC fragment request should, in most cases, be sufficient for the RA to reconstruct all the media data of the fragment request, and the RA can then pass it back to the SC. Thus, the RA is responsible for assembling the responses from the RA requests associated with the media fragment request in response to the fragment request provided to the SC again. For example, if there are some RA requests for FEC repair data, assembling by the RA may include FEC decoding.

HTTP 리퀘스트들을 다루는 것에 추가하여, RA는 어떤 샘플링 레이트의 타임 슬라이스들 동안, 단기 기간들 동안, 다운로드 속도를 측정한다. 예시적 샘플링 레이트는 100 ms이고, 즉, RA는 100 ms 기간들 동안 다운로드 속도들을 측정한다. 이 데이터는 SM에 의해 그 다운로드 속도 추정을 계산하고, 결국 레이트 결정들을 하는데 사용된다. 다른 샘플링 레이트들도 물론 가능하다.In addition to handling HTTP requests, the RA measures the download speed during short-term periods during time slices of a certain sampling rate. The exemplary sampling rate is 100 ms, i.e., RA measures download rates during 100 ms periods. This data is used by the SM to calculate its download speed estimate and eventually to make rate determinations. Other sampling rates are of course possible.

RA는 DASH 미디어 프리젠테이션 디스크립션(MPD)과 같은 메타데이터 또는 세그먼트 구조들에 대해 알아야 할 필요가 없다. 특정 구현에서, RA는 HTTP 스택 구현의 몇몇 동시의 예들을 사용하여 몇 개의 접속들에 걸쳐, 심지어 비슷하거나 상이한 서버들로의 상이한 유형의 접속들에 걸친 몇몇의 경우에도 HTTP 리트리벌을 구현한다. The RA does not need to know about metadata or segment structures such as the DASH Media Presentation Description (MPD). In certain implementations, the RA implements an HTTP retry over several connections, even in some cases across different types of connections to similar or different servers, using several concurrent instances of the HTTP stack implementation .

RA는 SC가 언제 새로운 리퀘스트가 받아들여질 수 있는지를 알게 할 책임을 진다. SC는 요청할 다음 프래그먼트를 결정하기 위해 SM을 호출하고 RA에 적절한 리퀘스트를 제공한다. RA는 또한 몇몇 상태 정보를 제공한다. RA는 SC를 통해 SM에게, 단기 다운로드 속도 및 다운로드에 소요된 전체 시간을 정규적으로 제공할 수 있다. SM은 또한, SC를 통해 간접적으로, 이 정보를 위해 RA를 폴링할 수 있다. 그 외에, RA는 또한 SM에 각 개별 리퀘스트의 몇 퍼센트가 이미 완결되었는지에 대해 알린다. 이 정보는 SM이 그것을 리트리브하기 위해 호출하는 API에 유사하게 제공된다.The RA is responsible for knowing when the SC can accept new requests. The SC invokes the SM to determine the next fragment to request and provides the RA with the appropriate request. The RA also provides some status information. The RA can regularly provide the SM with the short-term download speed and the total time spent on the download via the SC. The SM can also indirectly poll the RA for this information via the SC. In addition, the RA also informs the SM what percentage of each individual request has already been completed. This information is provided similarly to the API that the SM calls to retrieve it.

RA 또는 SC 내에 버퍼링된 데이터가 가능한 한 적고(고의적인 미디어 버퍼는 제외) RA, SC 및 실제 미디어 파이프라인 사이에는 아주 타이트한 데이터 흐름이 있어야 한다. 다양한 형태의 HTTP 리퀘스트들에 대해서도 마찬가지이다; 실제 대응하는 HTTP 리퀘스트들이 네트워크를 통해 전송되는 때보다 단지 근소한 양의 시간만큼 이르게 리퀘스트할 프래그먼트에 대해 결정해야 한다. 하나의 이유는 SM이 더 미리 리퀘스트에 대해 결정해야 하면, 그 정보가 덜 정확하고 최근이며, 결과적으로 그 결정이 더 낮은 품질이 될 것이기 때문이다.There should be as little buffered data as possible in RA or SC (except for deliberate media buffers), and very tight data flow between RA, SC, and physical media pipelines. The same is true for various types of HTTP requests; It is necessary to make a decision on the fragments to request in only a small amount of time earlier than when the corresponding HTTP requests are transmitted over the network. One reason is that if the SM has to make further decisions about the request in advance, the information is less accurate and recent, and consequently the decision will be of lower quality.

SM은 차례로 이슈될 리퀘스트들을 제출한다. 그러나, SM은 모든 이전 리퀘스트들이 완결되지 않은 경우에도 또한 새로운 리퀘스트들을 이슈할 수 있고; 공존 리퀘스트들이 허용된다. SC는 SM이 그것들을 이슈하기 위해 RA에 리퀘스트들을 전달한다. 그 다음 RA가 공존 프로세싱을 처리하고, 그것이 수신된 데이터를 SC로 다시 전달하는 것을 확인한다.The SM submits the requests to be issued in turn. However, the SM may issue new requests even if all previous requests are not completed; Coexistence requests are allowed. The SC sends the RAs to the RA to issue them. The RA then processes the coexistence processing and confirms that it passes the received data back to the SC.

공존 리퀘스트들은 RA가 HTTP 파이프라이닝을 구현할 수 있도록 한다. 실제로, 다중 접속들을 사용하는 RA 조차도 이 스킴(scheme)에 맞다.
Coexistence requests allow RAs to implement HTTP pipelining. In fact, even RAs using multiple connections are well suited to this scheme.

1.1 스트림 매니저(SM)1.1 Stream Manager (SM)

SM은 언제 프래그먼트들을 요청하고, 사용자 액션들, 네트워크 상황 및 다른 인자들의 결합에 응답하여 어떤 프래그먼트들을 리퀘스트할지를 결정한다. 사용자가 콘텐츠를 시청하기 시작할 것을 결정하면, SM은 사용자에 의해 또는 제공되는 서비스에 의해 특정되는 p-time으로부터 시작하는 그 콘텐츠에 대해 리퀘스트하는 제 1 프래그먼트를 결정할 책임을 진다. 예를 들어, 어떤 라이브 스트리밍은 모든 사용자가 같은 r-time에 미디어 콘텐츠의 같은 p-time 부분을 보고 있을 것을 요구할 수 있고, 반면에 다른 라이브 스트리밍 및 온-디맨드 서비스들은 어떤 r-time에 어떤 p-time을 플래이백하는지에 대해 최종 사용자 또는 어플리케이션에 유연성을 허용할 수 있다. 미디어 버퍼가 차면, SM은 추가 프래그먼트 리퀘스트들을 제공하는 것을 일시적으로 정지시킨다. SM은 네트워크 상황 및 디스플레이 크기, 잔여 배터리 수명 등과 같은, 다른 인자들에 따라, p-time의 각 포인트에서 콘텐츠를 어떤 품질로 플레이백할지를 결정할 책임이 있다. The SM determines when to request fragments and which fragments to request in response to a combination of user actions, network conditions, and other factors. If the user decides to start watching the content, the SM is responsible for determining the first fragment to request for that content, starting from the p-time specified by the user or by the service provided. For example, some live streaming may require all users to be viewing the same p-time portion of media content at the same r-time, while other live streaming and on-demand services may require some p- You can allow end users or applications flexibility to play back-time. When the media buffer is full, the SM temporarily stops providing additional fragmentation requests. The SM is responsible for determining what quality to play the content at each point in the p-time, based on other factors such as network conditions and display size, remaining battery life, and so on.

SM이 프래그먼트 리퀘스트를 제공하는 것이 적절하다고 판단하면, SM은 RA가 프래그먼트 리퀘스트들을 수신하고 처리할 준비가 된 경우에만 오로지 리퀘스트를 제공할 수 있다. SC는 RA를 폴링함으로써 이것이 사실인 때를 결정하고, 이 정보를 SM에 포워딩한다.If the SM determines that it is appropriate to provide the fragment request, the SM may provide the request only if the RA is ready to receive and process the fragment requests. The SC determines when this is true by polling the RA and forwards this information to the SM.

RA가 다음 리퀘스트를 수신할 준비가 되어 있으면, SM은 새로운 리퀘스트가 이슈되어야 하는지를 결정하고 리퀘스트할 다음 프래그먼트를 선택한다. SM은 미디어 데이터에 대해 한번에 프래그먼트 하나씩 리퀘스트한다. SM은 콘텐츠의 적시의 심리스 플레이백을 허용하도록 프래그먼트들을 리퀘스트할 책임을 진다. 리프리젠테이션들의 플레이백 변경은 일반적으로 스위치 포인트들에서만 발생할 수 있고, 두 연속하는 스위치 포인트들 사이에 복수의 프래그먼트들이 있을 수 있다; SM은 그 제한을 고려한다.If the RA is ready to receive the next request, the SM determines if the new request should be an issue and selects the next fragment to request. The SM requests one fragment at a time for media data. The SM is responsible for requesting fragments to allow timely seamless playback of the content. Playback changes of representations may generally only occur at switch points, and there may be multiple fragments between two consecutive switch points; The SM considers its limitations.

일반적으로, SM은 오로지 그것들이 부드러운 플레이백을 위해 적시에 수신될 것으로 믿는 것이 합리적인 프래그먼트들을 리퀘스트할 것을 시도한다. 그러나 네트워크 상태가 종종 아주 빠르게 격렬히 변화할 수 있음을 고려하면, 이는 모든 상황에서 보증될 수 없다. 따라서, SM은 리퀘스트들을 취소할 수 있는 능력 또한 가진다. SM은 정체가 감지되고 아무런 조치가 취해지지 않는다면 심각한 스톨링 위험이 있는 경우 리퀘스트들을 취소할 것이다. 예를 들어 프래그먼트 리퀘스트가 이슈된 직후에 네트워크 상황 악화로 인해 다운로드 레이트가 갑자기 급격하게 떨어지는 경우, 아무런 조치가 취해지지 않으면 스톨링이 가능성이 있다.In general, the SM only attempts to request reasonable fragments to believe that they will be received in a timely manner for smooth playback. However, considering that network conditions can often change very rapidly, this can not be guaranteed in all situations. Thus, the SM also has the ability to cancel requests. The SM will cancel the requests if there is a risk of serious stalling if the congestion is detected and no action is taken. For example, if the download rate suddenly drops suddenly due to deterioration of the network situation immediately after the fragment request is issued, there is a possibility of stalling if no action is taken.

SM은 리프리젠테이션, R, 및 이전에 선택된 가장 최근의 프래그먼트의 p-time 끝, E를 계속 파악한다. SM은 일반적으로 E'=E 의 시작 p-time을 갖는 다음 프래그먼트를 요청할 것을 선택한다. 몇몇 변화들은 시작 시간이 버퍼 레벨 및 현재 플레이백 시간으로부터 결정되도록 할 수도 있다.The SM keeps track of the representation, R, and the p-time end, E, of the most recently selected fragment. The SM typically chooses to request the next fragment with a starting p-time of E '= E. Some changes may cause the start time to be determined from the buffer level and the current playback time.

SM은 스위치 포인트들에서 잠재적인 오버랩이 폐기되면 원활하게 플레이백될 수 있는 스트림을 생성하도록 의도되는 리퀘스트들의 시퀀스를 생성할 수 있다. SM이 리퀘스트들을 생성하는 순서는 RA가 그것들을 (꼭 필수적으로 이슈하지 않지만) 우선순위를 매겨야 하는 순서와 같다. 이는 또한 RA가 수신된 데이터를 SC로 다시 전달하고, SC가 그것을 플레이 아웃해야 하는 순서와 같다.The SM may generate a sequence of requests intended to produce a stream that can be smoothly played back if potential overlaps are discarded at switch points. The order in which the SM generates these requests is the order in which the RAs should prioritize them (though not necessarily necessarily). This is also the order in which the RA passes the received data back to the SC and the SC should play it out.

SM이 레이트를 스위치해야 할 필요가 있다고 결정하면, 일반적인 경우 이를 하기 위한 두 프로세스가 있다. 한 프로세스에서는, SM은 E 이하의 p-time을 갖는 새로운 (“switch-to”) 리프리젠테이션에서 스위치 포인트 (또한 종종 “랜덤 액세스 포인트” 또는 “RAP”로 지칭된다) P를 찾고, 일단 그러한 포인트가 식별되면, SM은 새로운 리프리젠테이션의 프래그먼트들을 리퀘스트하기 시작한다. 제 2 프로세스는 E의 그것보다 늦거나 같은 p-time을 갖는 스위치 포인트, P를 찾는 것이고 P를 넘는 end-time을 갖는 프래그먼트가 리퀘스트되기까지 이전 (“switch-from”) 리프리젠테이션의 프래그먼트들을 계속 요청한다. 어느 경우에도, SC로 스위칭을 시그널링하는 것이 유용할 수 있다.If the SM determines that it needs to switch the rate, there are usually two processes to do so. In one process, the SM finds a switch point (also often called a "random access point" or "RAP") P in a new "switch-to" Once the point is identified, the SM begins requesting fragments of the new representation. The second process is to find a switch point, P, that has a p-time that is later than or equal to that of E, and to wait until a fragment with an end-time that exceeds P is requested, Keep asking. In either case, it may be useful to signal switching to the SC.

이들 프로세스들은 모두 얼마간의 오버래핑 데이터가 다운로드되어야 할 수도 있다는 특징을 갖는데 주목하라. 스위치-프롬 리프리젠테이션 및 스위치-투 리프리젠테이션 모두에 대해 데이터가 다운로드되어야 할 필요가 있을 수 있는 p-time의 스트래치가 있다.Note that all of these processes are characterized in that some overlapping data may need to be downloaded. There is a p-time stretch that may need to download data for both switch-based presentations and switch-to-present presentations.

이들 스위칭 프로세스들 중 어느 것이 적합한지는 상황에 달려있다. 예를 들어, 특정 상황에서는, 어느 하나의 프로세스에 대해 오버랩이 과도하게 크고, 반면에 다른 하나에 대해 그것은 매우 짧다. 모든 프래그먼트들이 리프리젠테이션들에 걸쳐 얼라인되어 있고 모든 프래그먼트들이 RAP로 시작하는 간단한 경우에, 이들 스위칭 프로세스들은 더 간단한 방법으로 축소되고, 여기서 SM은 단지 스위치-프롬 리프리젠테이션 대신 스위치-투 리프리젠테이션으로부터 다음 프래그먼트를 리퀘스트함으로써 스위치한다. 이 경우, 어떤 오버래핑 데이터도 다운로드 될 필요가 없음에 또한 주목하라.
Which of these switching processes is appropriate depends on the situation. For example, in certain situations, the overlap is excessively large for one process, while for another it is very short. In the simplest case where all fragments are aligned across the representations and all fragments start with RAP, these switching processes are reduced in a simpler way, where the SM is switched to a switch-to- And switches by requesting the next fragment from the presentation. Note also that in this case no overlapping data need be downloaded.

1.1.1 SM 프래그먼트 결정 프로세스1.1.1 SM Fragment Determination Process

이 섹션은 SC에게 어느 프래그먼트들을 요청할지를 결정하기 위한 SM 프래그먼트 결정 프로세스를 기술한다. 이 예들에서, 단일 리프리젠테이션 그룹이 가정되나, 예들은 복수의 리프리젠테이션 그룹들을 사용하는, 예를 들어, 비디오 리프리젠테이션 그룹에서 비디오 리프리젠테이션을 그리고 오디오 리프리젠테이션 그룹에서 오디오 리프리젠테이션을 고르는, 프로세스들을 설명하는데 확장될 수 있다.This section describes the SM fragments decision process to determine which fragments to request from the SC. In these examples, a single representation group is assumed, but examples include video representations using, for example, a video representation group using a plurality of representation groups and audio representations Selecting a transaction can be extended to describe processes.

SM에 의해 선택되는 다음 프래그먼트는 일반적으로 이전 프래그먼트 리퀘스트의 end p-time인 시작 p-time을 갖는다. 이하에 요청할 다음 프래그먼트를 선택하기 위해 SM에 구현될 수도 있는 몇몇의 상세한 로직이 기술된다.The next fragment selected by the SM will typically have a start p-time that is the end p-time of the previous fragment request. Some detailed logic is described below that may be implemented in the SM to select the next fragment to request.

다음의 예들에서, 프래그먼트들은 RAP들로 시작하고 리프리젠테이션들 간에 얼라인되는 것으로 가정한다. 그것이 사실이 아니라면, 이 설명의 변형들이 가능하다. 그 조건이 있으면, SM의 프래그먼트 결정은 레이트 결정으로 축소되고, 즉 SM은 현재 리프리젠테이션에 머무를지 아니면 다른 것으로 스위치할지를 결정한다. 프래그먼트들이 리프리젠테이션들에 걸쳐 반드시 얼라인되지는 않고 RAP들로 시작하지 않을 수도 있는, 보다 일반적인 경우에, 결정은 유사하지만, 스위칭 비용은 더 높으며, 이는 고려될 수도 있다.In the following examples, fragments are assumed to start with RAPs and be aligned between representations. Unless this is true, variations of this explanation are possible. If there is such a condition, the fragment determination of the SM is reduced to a rate determination, that is, the SM determines whether to remain in the current presentation or switch to another. In the more general case where fragments may not necessarily be aligned across the representations and may not start with RAPs, the decision is similar, but the switching cost is higher, which may be considered.

SM 리프리젠테이션 프로세스는 논리적으로 분리된 두 프로세스들을 포함한다. 제 1 프로세스는 레이트 추정기이고, 이는 RA가 제공하는 단기 샘플들로부터 유지되는 대략적인 다운로드 레이트를 계산하고, 제 2 프로세스는 이 추정을 사용하여 스위치 결정들을 하는 결정 프로세스이다.
The SM representation process includes two logically separate processes. The first process is a rate estimator, which is a decision process that calculates the approximate download rate maintained from the short term samples provided by the RA and the second process makes switch decisions using this estimate.

2. 레이트 추정 프로세스2. Rate estimation process

적응성 비트레이트 스트리밍 클라이언트는 일반적으로 올바른 비트레이트 미디어를 선택하기 위해 레이트 결정 모듈에 의해 이후에 사용되는 다운로드 레이트 추정기를 사용한다. 이 접근법에서, 다운로드 레이트가 크면, 고품질의 미디어가 스트리밍 될 수 있다. 다운로드 레이트에서의 변화는 리프리젠테이션 스위치들을 트리거할 수 있다. 레이트 추정의 품질은 스트리밍 클라이언트의 품질에 큰 영향을 준다. The adaptive bitrate streaming client typically uses a download rate estimator that is later used by the rate determination module to select the correct bit rate media. In this approach, if the download rate is high, high quality media can be streamed. The change in the download rate may trigger the presentation switches. The quality of the rate estimation has a great influence on the quality of the streaming client.

적응성 비디오 스트리밍 디바이스를 위한 양질의 레이트 추정기는 많은 특성들을 가져야 한다. 첫 번째로, 그것은 단기 다운로드 레이트가 많이 변하더라도 변화를 거의 갖지 않아야 한다. 두 번째로 그것은 아래의 채널 상의 레이트 변화에 빠르게 적응해야 한다. 채널 레이트가 상당히 떨어지는 경우, 추정은 그 사실을 빠르게 반영하여 디바이스가 스톨링 없이 품질을 그에 알맞게 조절할 수 있도록 해야 한다. 상응하여, 비디오 품질에서의 상승은 빠르게 관찰되어 더 높은 품질의 콘텐츠가 페치될 수 있도록 해야 한다.A good quality rate estimator for adaptive video streaming devices should have many characteristics. First, it should have little change even if the short-term download rate changes a lot. Second, it must adapt quickly to rate changes on the underlying channels. If the channel rate drops significantly, the estimation should quickly reflect that fact so that the device can adjust the quality accordingly without stalling. Correspondingly, the rise in video quality should be observed quickly so that higher quality content can be fetched.

이들 두 조건들을 만족시키는 것은 트레이드-오프들을 요구할 수 있다. 일반적으로, 작은 변화를 갖는 추정기는 큰 반응 시간을 가질 것이고 그 반대도 그러하다. 예를 들어, 디바이스에 사용될 수 있는 간단한 추정기를 고려하라. 그 추정기는 어떤 고정된 X에 대해, 다운로드의 마지막 X 초 동안 이동 평균을 취할 것이다. 예를 들어, X=30초(s)의, 큰 X를 고르는 것은 변화가 거의 없는 상대적으로 평탄한 추정을 낳지만, 다만 다운로드 레이트 변화들에 느리게 반응할 것이다. 그러한 추정기가 레이트 결정에 사용된 경우, 결과적인 플레이어는 대역폭 하락에 자주 스톨하거나 더 높은 비트레이트로 적시에 스위칭하는 것을 그렇게 하는 것이 안전하게 가능한 때에 실패할 수도 있다. 이들 이유로, 구현은 더 작은 X, 말하자면 X=3s를 고를 수도 있다. 그러한 선택은 안정성을 훼손하지만 보다 빠른 레이트 조정을 낳을 것이다. 레이트 추정은 많이 변할 것이고, 따라서 플레이어는 비디오 플레이백 레이트를 아주 자주 변경하여 나쁜 사용자 체험을 초래할 수도 있다.Satisfying both of these conditions may require trade-offs. In general, estimators with small changes will have a large response time and vice versa. For example, consider a simple estimator that can be used in a device. The estimator will take a moving average for the last X seconds of the download for any fixed X. For example, choosing a large X of X = 30 seconds (s) will result in a relatively flat estimate with little change, but will only react slowly to the download rate changes. When such an estimator is used for rate determination, the resulting player may fail to stall frequently to a drop in bandwidth or to do so in a timely manner at a higher bit rate in a secure manner. For these reasons, an implementation may choose a smaller X, say X = 3s. Such a choice would undermine stability but result in faster rate adjustments. The rate estimate will vary greatly, and thus the player may change the video playback rate very often, resulting in a bad user experience.

도 5에서, 울퉁불퉁한 커브는 많은 단기 변동을 갖는, 로우(raw) 다운로드 레이트이다. 레이트 추정기는 울퉁불퉁한 다운로드 레이트의 평편화된 버전이다. 레이트 변화에서, 그것은 지속되는 새로운 레이트로 수렴하고, 레이트가 변하지 않는 한 그에 비슷하게 유지된다.In Fig. 5, the rugged curve is a raw download rate with many short-term fluctuations. The rate estimator is a flattened version of the rugged download rate. At a rate change, it converges to a new rate that lasts and remains similar as long as the rate does not change.

바람직한 특성들 중 하나는 작은 버퍼 레벨이 있는 경우, 조정이 빠르다는 것이고, 이는 레이트의 빠른 적응을 초래하여, 다운로드 레이트가 하락하고 있을 때 조정 전에 프리젠테이션 버퍼가 비지 않도록 한다. 한편으로, 미디어 버퍼 내에 많은 미디어 데이터가 있는 경우, 레이트 추정은 보다 느린 조정으로 보다 평탄해야 한다. 미디어 버퍼에 보다 많은 미디어 데이터가 있는 경우, 플레이 아웃 레이트는 미디어 버퍼에 보다 적은 미디어 데이터가 있을 때보다 다운로드 레이트가 하락하는 보다 긴 시간 동안 더 높게 유지되는 경향이 있어야 한다. One of the desirable characteristics is that if there is a small buffer level, the adjustment is fast, which results in a fast adaptation of the rate so that the presentation buffer is not empty before adjustment when the download rate is falling. On the other hand, if there is a lot of media data in the media buffer, the rate estimate should be smoother with a slower adjustment. If there is more media data in the media buffer, the playout rate should tend to remain higher for a longer period of time when the download rate falls than when there is less media data in the media buffer.

pker, pker 프로세스, 또는 pker-타입 프로세스로 지칭하는, 다음에 소개되는 레이트 추정 프로세스는 레이트 변화들에 빠르게 반응할 뿐만 아니라, 안정적이어서 낮은 변동성 및 높은 반응성에 대한 조건들 모두를 만족시킨다.
The rate estimation process described below, referred to as the pker , pker process, or pker -type process, not only responds quickly to rate changes, but is also stable and satisfies all of the conditions for low variability and high reactivity.

2.1. 2.1. pkerpker 프로세스 process

이 섹션은 본원에서 pker, pker-타입 프로세스, 또는 단순히 “pker 프로세스”로 지칭되는 레이트 추정 프로세스를 기술한다. 기본적 레이트 추정기는 그 추정들을 오로지 단기 레이트 측정들에 기초하고, 한 방법 또는 다른 것을 사용하여 그로부터 더 긴 이동 평균(running average)을 계산한다. 전술한 기본적 이동 윈도우 평균(moving window average; "MWA")은 그러한 프로세스의 일 예이다.This section describes a rate estimation process referred to herein as a pker , a pker -type process, or simply a & quot ; pker process. &Quot; The basic rate estimator uses the estimates based solely on short-term rate measurements and calculates a longer running average therefrom using one method or the other. The aforementioned moving window average ("MWA") is an example of such a process.

도 6-7은 레이트 선택용으로 비-적응성(고정 계수) 지수 가중 평균을 사용한 것의 효과들을 도시한다. 이들 플롯들은, 간략화를 위해, 새 레이트 추정은 새 다운로드 선택을 즉시 트리거하고(즉, 프래그먼트들은 상대적으로 작다), 새 레이트 선택은 단지 레이트 추정이다.6-7 illustrate the effects of using a non-adaptive (fixed coefficient) exponential weighted average for rate selection. These plots, for simplicity, the new rate estimate immediately triggers a new download selection (i.e., the fragments are relatively small), and the new rate selection is only rate estimation.

도 6은 r-time 측면을 도시한다. 여기 도시된 바와 같이, x-축은 다운로드 시간(실 시간)이다. 극적인 레이트 증가가 시간 T1에서 발생하면, 비디오 데이터가 플레이 아웃 되고 있는 것보다 훨씬 빨리 다운로드 되고 있기 때문에, 버퍼가 매우 빨리 커지기 시작한다. EWMA 추정은 점차 실 레이트로 수렴한다.Figure 6 shows the r-time aspect. As shown here, the x-axis is the download time (real time). If a dramatic rate increase occurs at time T1, the buffer begins to grow very quickly because video data is being downloaded much faster than it is being played out. The EWMA estimation gradually converges to the room rate.

도 7은 같은 이벤트의 p-time 측면을 도시한다. 도면에서, 선(702)은 스크린에 디스플레이되는 비트레이트를 나타낸다. 레이트는 도 6의 r-time 도면보다 훨씬 느리게 조정된다. r-time 에 비해 p-time에 대한 수렴 속도가 시작에서 NR/OR 인자에 의해 느려졌다(왜냐하면 플레이어는 그 포인트에서 다운로드 초 당 약 비디오 NR/OR 초를 수신했기 때문이다). 따라서, 순 효과는 이 타입의 레이트 추정기를 사용할 때 상당한 양의 p-time 동안 다운로드 레이트보다 훨씬 낮은 레이트로 미디어가 플레이 아웃할 수 있다는 것이다.Figure 7 shows a p-time aspect of the same event. In the figure, line 702 represents the bit rate displayed on the screen. The rate is adjusted much slower than the r-time plot of FIG. The convergence rate for p-time is slowed down by the NR / OR factor at the start (because the player has received about video NR / OR seconds per second downloaded at that point) compared to r-time. Thus, the net effect is that the media can play out at a much lower rate than the download rate during a significant amount of p-time when using this type of rate estimator.

레이트가 미디어 스트리밍의 목적으로 추정되는 경우, 추정기는 다른 적절한 정보를 이용할 수 있다. 보다 구체적으로, 미디어 플레이어의 버퍼가, 또는 일반적으로, 버퍼링 되었거나, 또는 이미 플레이 아웃된, 각 미디어 세그먼트를 다운로드 하는데 얼마나 걸렸는지에 대한 정보를 포함하는, 미디어 플레이어의 다운로드 히스토리(현재 버퍼에 있는 것보다 더 과거로)가 관심의 대상이다.If the rate is estimated for media streaming purposes, the estimator may use other appropriate information. More specifically, the download history of the media player, which includes information about how long it took to download each media segment, or the buffer of the media player, or generally, buffered or already played out More past) are the objects of interest.

일 구현은 예를 들어 MWA를 사용하나, 미디어 버퍼의 함수로서 윈도우 크기를 선택할 수 있다.One implementation uses, for example, MWA, but can select the window size as a function of the media buffer.

미디어 플레이어의 버퍼 레벨이 높으면, 플레이어는 스톨링의 위험에 당면하지 않으며, 큰 윈도우를 사용하여 장기 추정이 행해질 수 있고, 이는 보다 안정적인 추정을 낳을 것이다. 반대로, 버퍼 레벨이 낮으면 플레이어는 빠르게 반응해야 하고, 이는 보다 짧은 평균 윈도우가 이 경우에 더 나은 선택임을 암시한다.If the buffer level of the media player is high, the player does not face the risk of stalling, and a long term estimate can be made using a large window, which will result in a more reliable estimation. Conversely, if the buffer level is low, the player must react quickly, suggesting that a shorter average window is a better choice in this case.

따라서 레이트 추정 프로세스의 일 구현은, 현재 미디어 버퍼의 p-time 양(즉, 다운로드되었지만 아직 플레이아웃되지 않은 p-time 의 현재 양)에 비례하는 r-time 윈도우를 사용하여, 변하는 윈도우 너비를 사용할 수도 있다.Thus, one implementation of the rate estimation process uses a varying window width, using an r-time window that is proportional to the p-time amount of the current media buffer (i.e., the current amount of p-time that has been downloaded but not yet played out) It is possible.

다른 구현은 미디어 버퍼에 현재 보관된 바이트의 수에 비례하여 윈도우 너비를 선택할 수도 있다.Other implementations may choose the window width in proportion to the number of bytes currently stored in the media buffer.

일 구현은 또한 단지 그 레벨이 아니라 버퍼 그 자체의 콘텐츠를 검사할 수 있다. 예를 들어, 그것이 버퍼의 큰 부분이 그 동일한 콘텐츠의 플레이백 지속시간인 것 보다 짧은 시간에 다운로드되었다고 결정하면, 이는 다운로드 버퍼가 빠르게 커지고 있음을 암시하고, 따라서 레이트 추정기는 추정이 조정될 필요가 있다고 결정할 수도 있다.An implementation may also examine the contents of the buffer itself, not just its level. For example, if it determines that a large portion of the buffer has been downloaded in a shorter time than the playback duration of that same content, this implies that the download buffer is growing rapidly, and therefore the rate estimator needs to adjust the estimate You can decide.

유사하게, 레이트 추정기는 버퍼 레벨의 레이트 변화를 추적하고, 레이트 추정이 빠르게 조정될 필요가 있다는 표시들에 따라 버퍼 레벨을 빠르게 변화시킬 수도 있다.Similarly, the rate estimator may track the rate change of the buffer level and quickly change the buffer level according to indications that the rate estimate needs to be adjusted quickly.

도 8-9는 가변 윈도우 크기 가중 이동 평균 필터가 사용되는 경우의 도 6-7과 동일한 시나리오에서의 동작을 도시한다. 예들에서, “pker” 프로세스가 가변 윈도우 크기 WMA 필터와 같은 프로그래밍 코드로 설명된다. pker 프로세스는 프로세서에 의해 실행되는 프로그램 명령들로 구현될 수도 있다.Figures 8-9 illustrate operation in the same scenario as Figures 6-7 where a variable window size weighted moving average filter is used. In the examples, the " pker " process is described in programming code such as a variable window size WMA filter. The pker process may be implemented with program instructions that are executed by the processor.

도 8에서, 선(802)은 기저 채널이 레이트 OR(구 레이트)에서 레이트 NR(신 레이트)로 급격한 레이트 증가를 가지는 경우의 pker 레이트 추정이다. 새로운 레이트로 조정하는 레이트 선택에 걸리는 r-time의 양은 OR/NR에 비례한다. 증가가 클수록, 실시간에서 조정이 더 빠르게 일어난다. 도시한 바와 같이, 시간 T2에서, Buff@T2=2*Buff@T1 및 T _fast=OR/NR*Buff@T1 이다.In FIG. 8, line 802 is a pker rate estimate when the base channel has a sharp rate increase from rate OR (old rate) to rate NR (new rate). The amount of r-time required to select the rate to adjust at the new rate is proportional to OR / NR. The larger the increase, the faster the adjustment takes place in real time. As shown, at time T2, Buff @ T2 = 2 * Buff @ T1 and T _fast = OR / NR * Buff @ T1.

도 9는 p-time에서의 플레이백 동작을 도시한다. pker 추정기가 새 레이트로 조정하는데 약 일 버퍼 지속기간(레이트 증가가 일어났을 때 버퍼에 있던 p-time의 양)이 걸리고, 즉 미디어 버퍼가 미디어 버퍼에 추가된 p-time 지속기간 B를 갖는 미디어 콘텐츠의 양을 갖는 시간까지 pker 추정기가 새 레이트로 조정했고, 여기서 B는 새 레이트로의 레이트 증가 시에 미디어 버퍼의 미디 콘텐츠의 p-time 지속기간이다.Fig. 9 shows a playback operation in p-time. The pker estimator takes approximately one buffer duration (the amount of p-time that was in the buffer when the rate increase occurred) to adjust to the new rate, i. e., the media buffer has a p- The pker estimator adjusted to the new rate until the time that it has the amount of content, where B is the p-time duration of the media content of the media buffer at the rate increase to the new rate.

이를 하는 구체적인 프로세스가 이제 기술될 것이다. 프로세스는 플레이백 버퍼의 마지막 γ_T-부분을 다운로드 하는데 얼마나 많은 r-time이 걸렸는지를 결장하고, 여기서 γ _T 는 절절히 선택된 상수이다. 예를 들어, 이는 전체 현재 플레이백 버퍼(γ _T =1)를 다운로드 하는데 걸린 완전한 시간이거나, 플레이백 버퍼의 마지막 절반(γ _T =0.5)를 다운로드 하는데 걸린 시간일 수도 있다. γ _T >1도 또한 가능하다. T _fast를 플레이백의 버퍼의 마지막 γ _T -부분을 다운로드 하는데 걸린 r-time의 양이라고 두자. 추정된 다운로드레이트는 다운로드 시간의 이전 T _fast초 동안 다운로드 레이트를 추정함으로써 계산될 수 있다. 다른 값의 γ _T 가 가능함에 주의하라. 여기에 설명된 바와 같이 다른 값들은 다른 목표에 공헌한다.A specific process for doing this will now be described. The process discards how much r-time it took to download the last γ _T - portion of the playback buffer, where γ _T is the constant chosen constant. For example, it may be the complete time taken to download the entire current playback buffer ( ? _T = 1), or the time it took to download the last half of the playback buffer ( ? _T = 0.5). γ _T > 1 is also possible. Let T _{fast be} the amount of r-time taken to download the last γ _T - portion of the playback buffer. The estimated download rate can be calculated by estimating the download rate for the previous T _fast seconds prior to the download time. Note that a different value of γ _T is possible. Other values, as described herein, contribute to other goals.

T _fast 너비 윈도우에 걸친 이 종류의 윈도윙된 평균은 레이트 증가를 빠르게 검출하는 주목할 만한 특성을 갖는다. 사실, γ _T <1인 값이 T _fast를 결정하는데 사용되면, 추정기는 미디어 버퍼의 미디어 콘텐츠의 p-time 지속기간이 B일 때 시간의 어느 순간에 임의의 인자만큼 레이트가 증가하면, 레이트 추정기가 증가된 레이트로 수렴하기 전에 버퍼가 많아야 한정된 B 배로 커질 것이라는 특성을 갖는다. T _fast This kind of windowed average across width windows has a notable characteristic that quickly detects the rate increase. In fact, if a value of y _{T &} lt; 1 is used to determine T _fast , then the estimator estimates that if the rate increases by an arbitrary factor at any moment in time when the p-time duration of the media content of the media buffer is B, Lt; RTI ID = 0.0 > at most < / RTI > times B before converging at an increased rate.

보다 정교한 레이트 추정 방법은 전술한 두 접근법을 결합할 수 있다. 그것은 특히 버퍼 레벨 B 및 T _fast 의 최소값을 평균 윈도우 너비, 즉, 다운로드 레이트를 평균하는 r-time의 양으로 사용할 수 있다. 보다 일반적으로, 다운로드 레이트는 γ _B 및 T _fast 의 최소값의 이전 r-time 동안 평균될 수 있고, 여기서 γ _B 는 적절히 선택된 상수이다. 그러한 선택은 스톨링 위험이 있는 레이트 하락이 있을 때 그것이 빠르게 반응할 것이라는 특성을 가질 것이고, 그러한 경우에, B가 최소이고 평균은 미디어 버퍼의 미디어 콘텐츠의 p-time 지속기간에 비례하는 r-time동안 이루어질 것이고, 따라서 미디어 버퍼가 반쯤 비는 때까지 레이트 추정이 새 레이트가 될 것이다. 예를 들어, 레이트가 감소하는 때 미디어 버퍼의 미디어 콘텐츠 지속기간이 B이고, 다운로드 레이트가 다운로드 레이트가 감소하기 전에 선택된 프리젠테이션의 플레이백 레이트의 a<1 인 부분 이도록 다운로드 레이트가 감소하고, 비관적으로 레이트 추정이 새 다운로드 레이트로 감소할 때까지 선택된 리프리젠테이션의 플레이백 레이트가 감소하지 않는다고 가정하자. 그러면, 레이트 감소가 일어나는 시간을 넘는 x의 r-time 동안 계속됨에 따라, 버퍼 레벨은 B'=B - x + α·x 이고, 즉, x p-time 은 미디어 버퍼로부터 비워지고 α·x 는 미디어 버퍼로 다운로드 된다. x=B'인 시간의 포인트에서, 즉 p-time의 미디어 버퍼 레벨이 다운로드가 새 레이트에서 된 r-time과 동일한 시간의 포인트에서, 레이트 추정이 새 레이트가 될 것이며, 이는 이 전체 시간 동안 다운로드가 새 레이트에서 되었기 때문에 이 시간의 포인트에서 다운로드의 이전 r-time 동안 추정이 새 레이트가 될 것이기 때문이다. 방정식 x=B'=B - x + α·x 을 x에 대해 풀면 x=B'=B/(2-α)가 산출되고, 즉, 레이트 추정은 버퍼 B'가 여전히 적어도 B/2일 때 새 레이트에 도달할 것이다. 대신에 레이트가 시간의 어떤 포인트에서 상당히 증가하면 T _fast 가 최소가 될 것이고 이전 T _fast r-time 동안 평균 다운로드 레이트가 이전 B r-time 동안의 평균보다 상당히 높을 것이다.More sophisticated rate estimation methods can combine the two approaches described above. It can in particular use the minimum values of the buffer levels B and T _fast as the average window width, i.e. the amount of r-time averaging the download rate. More generally, the download rate can be averaged over the previous r-time of the minimum of [ gamma] _B and T _fast , where [ gamma] _B is a suitably chosen constant. Such a choice would have the property that it will respond quickly when there is a risk of stalling a rate drop, and in such a case, B is the minimum and the average is r-time proportional to the p-time duration of the media content in the media buffer And thus the rate estimate will be the new rate until the media buffer is half-weighted. For example, when the rate decreases, the download rate decreases so that the media content duration of the media buffer is B and the download rate is a < 1 of the playback rate of the selected presentation before the download rate decreases, Assume that the playback rate of the selected presentation does not decrease until the rate estimate decreases to the new download rate. Then the buffer level is B '= B - x + alpha x, i. E., X p-time is emptied from the media buffer and alpha x is equal to It is downloaded to the media buffer. At a point in time where x = B ', i.e., at a point in time at which the media buffer level of p-time equals r-time at which the download is at the new rate, the rate estimate will be the new rate, Since at this point in time the estimate will be the new rate for the previous r-time of the download. Solving for x the equation x = B '= B - x + x x yields x = B' = B / (2-a), i.e. the rate estimation is performed when buffer B 'is still at least B / 2 You will reach the new rate. Instead, if the rate significantly increases at some point in time, T _fast will be the minimum and the average download rate over the previous T _fast r-time will be significantly higher than the average over the previous B r-time.

우리는 이제 이 구성에 기초하여 pker 레이트 추정 프로세스의 예를 상세히 설명한다. 그것은 단기 레이트 측정들을 사용하고, 이는 리퀘스트 액셀러레이터 (RA)와 같은 다운로드 모듈, 및 추정을 계산하기 위한 버퍼 정보로부터 획득될 수 있다. 버퍼 정보는 유용한 추정을 얻기 위한 단기 레이트 측정들의 윈도우 너비를 결정하는데 사용된다.We now elaborate an example of the pker rate estimation process based on this configuration. It uses short-term rate measurements, which can be obtained from a download module such as a request accelerator (RA), and buffer information for calculating an estimate. The buffer information is used to determine the window width of short term rate measurements to obtain useful estimates.

도 10은 다운로드 레이트가 급격히 하락하는 때 어떻게 pker 레이트 추정기가 결론을 이끌어 내는지를 도시한다. 레이트가 하락하자 마자 버퍼 레벨이 하락하기 시작한다. 레이트 추정 역시 조정을 시작한다. 늦어도 버퍼 레벨이 2 인수만큼 하락한 때까지 레이트 추정이 새 레이트(NR)에 도달한다. 예를 들어, 중간에 레이트 결정들이 없어서 Buff가 선형적으로 하락한다. 중간 결정이 있으면, Buff의 하락이 점차 느려질 것이다.Figure 10 shows how the pker rate estimator draws a conclusion when the download rate drops sharply. As soon as the rate drops, the buffer level begins to drop. The rate estimate also begins to adjust. At the latest, the rate estimate reaches the new rate NR until the buffer level drops by a factor of two. For example, there is no rate determinations in the middle and Buff drops linearly. If there is an intermediate decision, the drop of Buff will be gradually slowed down.

pker 프로세스의 설계 목표는 노이즈가 있는 수들을 갖지 않도록 충분히 큰 평균 윈도우, 그러나 그것이 반응하기에 충분히 짧은 수들을 사용하는 것이다. pker 프로세스는 이 목표를 동적으로 변하는 윈도우 크기를 갖는 윈도윙된 평균을 사용함으로써 달성한다. RA는 B, (p-time에서) 플레이백 버퍼의 레벨, 프로세스 파라미터들 γ _B 및 γ _T , T _fast, 버퍼의 (p-time에서) 마지막 γ _T -부분을 다운로드 하는데 걸린 r-time에 대한 저장된 값, R, r-time에서 다운로드의 마지막 C 지속기간 동안의 평균 다운로드 속도를 포함하는, pker 프로세스에 의한 사용을 위해 메모리에 몇몇 변수들을 유지하고, 여기서, C=max(STP, min(γ _B ·B, T _fast,))이고, STP는 수용 가능한 최소의 윈도우 크기이고, 이는 (예를 들어, 100 ms와 같은) 샘플 시간 주기를 초과해야 한다. 몇몇 실시예들에서, γ _B =1 및 γ _T =0.5이고, 그러나 다른 값들이 가능하고, 둘 다 양수이고 γ _T >1인 한, 질적으로 유사한 동작을 초래한다. 작은 γ _B 는 pker 프로세스가 레이트 감소들에 빠르게 반응하도록 하고, 반면에 작은 γ _T 는 레이트 증가에 빠르게 반응하도록 한다.The design goal of the pker process is to use an average window large enough so that it does not have noise numbers, but numbers that are short enough for it to respond. The pker process achieves this goal by using a windowed average with a window size that varies dynamically. RA is the rate at which the level of the playback buffer (in p-time), the process parameters γ _B and γ _T , T _fast , and the r-time taken to download the last γ _T - for use by, pker process including the average download rate of the last C the duration of the download from the stored value, r, r-time holding a few variables to the memory, wherein, C = max (STP, min (γ _B · B, T _fast ,)) and STP is the smallest acceptable window size, which must exceed the sample time period (eg, 100 ms). In some embodiments, γ _B = 1 and γ _T = 0.5, but other values are possible, resulting in a qualitatively similar behavior, as long as both are positive and γ _T > 1. A small [ gamma] _B causes the pker process to respond quickly to rate reductions, while a small [ gamma] _T causes the rate to quickly respond.

여기에 설명된 바와 같이, C 지속기간 동안 다운로드 속도를 계산하기 위해, SM은 RA에 의해 주기적으로 제공되는 다운로드 속도 정보를 사용한다. 그 목적을 위해, SM은 RA에 의해 제공되는 다운로드 속도 정보의 히스토리를 보관할 수 있다. 평균이 취해지는 지속기간은 많아야 γ _B 버퍼 지속기간이고, 미디어 버퍼 레벨에 상한이 있는 때 얼마나 많은 히스토리가 보관될 필요가 있는지를 효과적으로 제한한다.As described herein, to calculate the download rate during the C duration, the SM uses the download rate information periodically provided by the RA. For that purpose, the SM can keep a history of the download speed information provided by the RA. The duration at which the average is taken is at most the gamma _B buffer duration and effectively limits how much history needs to be retained when there is an upper bound on the media buffer level.

스트림을 다운로드 하는데 걸린 양의 시간이 그것을 플레이 아웃 하는데 걸린 것과 동일하다면, 우리는 T _fast =γ _T ·B 을 가지므로, 선택된 플레이아웃 레이트가 대략 다운로드 레이트와 동일하다면, 버퍼링 값, C는 대략 버퍼 지속기간임을 주목하라. r-time에서 대략 버퍼 레벨 가량을 선택하는 것이 다운로드 레이트 추정을 위한 평판화 구간(smoothing interval)에 대한 자연스러운 선택이고, 그것이 스톨링을 피하기를 원한다면 스트리밍 클라이언트가 가져야 하는 전시(foresight)의 양이기 때문이다.If the amount of time it takes to download a stream same as that taken for it played out, and we therefore have a T _fast = γ _T · B, if the selected play-out rate equal to the approximate download rate, buffer values, C is approximately buffer Note that this is a duration. Since choosing roughly the buffer level amount at r-time is a natural choice for the smoothing interval for download rate estimation and if it is desired to avoid stalling, the amount of foresight the streaming client should have to be.

간단한 한 구현에서, 평균 윈도우 너비는 B, 비디오 버퍼에 포함된 p-time의 양에 비례한다. 그러한 선택이 스톨링으로부터 잘 보호하지만, 단점을 갖는다: 다운로드 레이트가 선택된 미디어의 레이트의 배수이면, 다운로드의 매 초가 다운로드 되고 있는 미디어의 p-time의 k 를 초래하고, 이는 레이트 추정이 아주 느리게 조정하도록 한다. 예를 들어, k=10이고, 10초의 버퍼가 있으면, 레이트 추정기가 조정 전에 약 k·10s=100s의 p-time을 다운로드 할 것이며, 이는 매우 긴 시간이다. 이는 pker 방법들에 T _fast 를 도입하는 동기를 부여한다. 사실, 지수 가중 이동 평균이 평판화를 위해 사용되면 사태가 심지어 다소 나빠질 수 있고, 이는 그러한 필터들이 무한 임펄스 응답을 갖기 때문이다. 이 이유로, pker 프로세스는 대신에 유한 임펄스 응답을 사용한다. 보통의 이동 평균이 유효하게 동작하고; 구현은 또한 보다 정교한 가중 이동 평균들도 사용할 수 있다.In a simple implementation, the average window width is proportional to B, the amount of p-time contained in the video buffer. If the download rate is a multiple of the rate of the selected media, then every second of the downloads results in k of p-time of the media being downloaded, which means that the rate estimate is adjusted very slowly . For example, if k = 10 and there is a buffer of 10 seconds, the rate estimator will download p-time of about k * 10s = 100s before adjustment, which is a very long time. This gives motivation to introduce T _fast in pker methods. In fact, if the exponentially weighted moving average is used for flattening, things can even get a little worse because those filters have an infinite impulse response. For this reason, the pker process uses a finite impulse response instead. The normal moving average works effectively; Implementations may also use more sophisticated weighted moving averages.

도 13은 이 마지막 포인트를 도시한다. 그것은 단일 (고정-너비) 이동 윈도우 평균과 지수 가중 이동 평균의 비교를 보여준다. 그래프는 레이트 변화가 보이는 때, 고정 윈도우 이동 평균은 처음에 새 레이트로 보다 느리게 수렴하지만, 그것은 일 윈도우 지속시간 내에 수렴할 것이다. 지수 가중 이동 평균은 초기에 빠르게 움직이는 경향이 있으나, 더 뒤의 단계들에서는 단지 느리게 수렴한다. 윈도윙된 이동 평균과 달리, 그것은 고정된 윈도우 내에 수렴하지 않지만, 대신에 수렴하는데 레이트 변화의 크기에 대수적인 시간이 걸린다.Figure 13 shows this last point. It shows a comparison of a single (fixed-width) moving window average and an exponentially weighted moving average. When the graph shows that the rate change is visible, the fixed window moving average will initially converge slower at the new rate, but it will converge within one window duration. The exponentially weighted moving average tends to move quickly at the beginning, but converges only slowly at later stages. Unlike a windowed moving average, it does not converge within a fixed window, but instead takes an algebraic time to the magnitude of the rate change to converge.

γ _B =1 및 γ _T =0.5 인 경우, pker 프로세스는 다양한 보증들을 제공한다. 하나는, 다운로드 속도가 임의의 인자에 의해 하락하면, 추정은 버퍼가 그 원래 지속시간의 반으로 줄어드는데 걸리는 시간 내에 새 다운로드 속도로 조정된다. 다른 하나는, 다운로드 속도가 임의의 인자에 의해 증가하면, 많아도 하나의 버퍼 워스(buffer worth)의 추가 p-time이 pker 프로세스가 새 레이트에 수렴하기 전에 다운로드 될 것이다. 수월한 계산들이 유사한 상수-부분(constant- fraction) 보장들이 0<γ _B 및 0<γ _T <1의 임의의 선택에 대해 유지된다는 것을 보여줄 것이다. For γ _B = 1 and γ _T = 0.5, the pker process provides various assurances. First, if the download speed drops by any factor, the estimate is adjusted to the new download speed within the time it takes for the buffer to reduce to half its original duration. On the other hand, if the download rate is increased by an arbitrary factor, at most one additional p-time of the buffer value will be downloaded before the pker process converges to the new rate. Smooth calculations will show that similar constant-fraction guarantees are maintained for any choice of 0 < ? _B and 0 < ? _T <1.

버퍼 레벨, B를 계산하기 위한 하나의 접근법은 다음과 같다. T를 미디어 플레이어의 현재 플레이백 p-time이라고 하고, F _i,1, …, F _i,n 을 다운로드 되었거나 되고 있고 시작 시간 오름차순으로 정렬된, 리프리젠테이션 그룹 i 에서 아직 플레이 아웃되지 않은 프래그먼트들이라고 하자. 아직 다운로드 되고 있는 그룹 i 의 임의의 프래그먼트는 F _i,1, …, F _i,n 중에 있다. α(F _i,j )를 바이트 단위의 프래그먼트 F _i,j 의 크기로 나누어진 이미 다운로드 된 프래그먼트 F _i,j 의 바이트 수와 같은, 다운로드 된 프래그먼트 F _i,j 의 비율이라고 하자. 다양한 i 및 j 에 대한 에 대한 값은 RA에 의해 계산되고 SM 에 전달될 수 있다. 주어진 그룹 i에 대해, 우리는 다운로드 된 p-time의 현재 전체 양을 식 1로 정의한다.One approach for calculating the buffer level, B, is as follows. Let T be the current playback p-time of the media player, F _{i, 1} , ... , F _{i, n} are fragments that have been downloaded or are sorted out in ascending order of start time and have not yet been played out in the representation group i. Any fragments of group i that are still being downloaded are F _{i, 1} , ... , F _{i, n} . Let the ratio of α (F _{i, j)} a fragment of bytes F _{i, j,} divided by the size of the already downloaded fragment F _i, such as the number of bytes of _j, the downloaded fragment F _{i, j.} The values for the various i and j can be calculated by the RA and passed to the SM. For a given group i , we define Equation 1 as the current total amount of downloaded p-time.

(식 1)

(Equation 1)

식 1의 결과들로부터 전체 T_P-값을 계산하기 위해, DASH 클라이언트는 MPD(Media Presentation Description metadata) 및 리프리젠테이션 그룹들, G의 수로부터 결정되는, 각 그룹의 가중 인자들, w'를 고려하고, 식 2의 계산을 수행한다. 버퍼 레벨 B는 다음에 B:=T _p -T 인 것으로 정의된다.To calculate the overall T _P -value from the results of Equation 1, the DASH client calculates the weighting factors w 'of each group, which are determined from the number of media presentation description metadata (MPD) and representation groups, G And performs the calculation of Equation (2). The buffer level B is then defined as B: = T _p -T .

(식 2)

(Equation 2)

식 2는 현재 플레이 아웃 되고 있는 프래그먼트들에 속하는 버퍼의 부분을 또한 캡쳐한다. 이 정의는 또한 몇몇 프래그먼트들이 동시 다운로드 되는 경우에도 유효하게 작용한다.Equation 2 also captures the portion of the buffer that belongs to the fragments that are currently being played out. This definition also works well when several fragments are concurrently downloaded.

T _fast를 계산하기 위해, SM은 일반적인 경우에 얼마간의 히스토리를 보관한다. T_r을 RA가 미디어를 다운로드 하는데 (시도하는데) 소비한 r-time의 총 양이라고 하고, Z를 RA에 의해 다운로드 된 바이트의 총 양이라고 두자. Tr의 값은 RA에 의해 계산된다. SM은 i=1, 2, …, K에 대해, 보통의 간격에서 (예를 들어, 100 ms 마다)샘플링 된, 투플

의 히스토리, H를 보관하고, 여기서 K-번째 관측이 마지막이다. 우리는 히스토리가 관측 순서로 저장된다고 가정하고; 따라서 우리는

와

및

또한 갖는다.To calculate T _fast , the SM maintains some history in the normal case. Let T _{r be} the total amount of r-time that RA spent trying to download (try), and let Z be the total amount of bytes downloaded by RA. The value of Tr is calculated by RA. SM is i = 1, 2, ... , Sampled at regular intervals (e.g., every 100 ms) for K ,

, H, where K-th observation is the last. We assume that the history is stored in observation order; Therefore,

Wow

And

.

이제, T _fast를 계산하기 위해, B가 전술한 방법으로 이미 계산되었다고 가정하자. 다음에, RA는 예를 들어 이진 검색으로 히스토리를 검색함으로써, 식 3의 부등식이 만족되도록 하는 j를 결정한다.Now, to calculate T _fast , assume that B has already been computed in the manner described above. Next, the RA determines j to cause the inequality of Equation 3 to be satisfied, for example, by searching the history with a binary search.

(식 3)

(Equation 3)

다음에

이다. 무한 히스토리를 보관할 필요는 없고, T_i 값들이 최대 버퍼 지속시간의 γ_B 보다 많이 스팬(span)하는 것으로 충분함을 주의해야 한다.Next

to be. It is not necessary to keep an infinite history, and it should be noted that T _i values span more than γ _B of the maximum buffer duration.

도 16의 확대 변형과 함께, 도 15는 pker 프로세스에 의해 사용되는 B 및 T _fast 값들이 기록된 (T_p, T_r) 값들의 히스토리로부터 어떻게 결정될 수 있는지를 도시한다. 도면은 r-time과 p-time이 똑같이 빠르게 진행하고(다운로드 중단이 없다), 따라서 플레이백 시간(p-time)이 다운로드 시간(r-time)의 45도 기울기 선이다. (T_p, T_r)-값들의 히스토리는 그래프에서 플레이백 스톨이 발생하지 않았다면, 플레이백 시간 선 바로 위에 있는 커브로 그려질 수 있다. 그러면 버퍼 레벨 B는 플레이아웃 시간에 대한 마지막으로 기록된 T_P-값의 차이이다. T _fast의 값은 이 그래프에서 현재 (마지막) T_P-값 아래의 γ·B의 레벨에서 (T_p, T_r)-커브로의 수평 거리를 측정함으로써 나타낼 수 있다.Along with the enlarged modification of FIG. 16, FIG. 15 shows how the B and T _fast values used by the pker process can be determined from the history of the recorded (T _p , T _r ) values. The drawing shows that r-time and p-time are equally fast (there is no interruption to downloading), so the playback time (p-time) is the 45-degree slope of the download time (r-time). (T _p , T _r ) - The history of values can be plotted as a curve directly above the playback time line, if no playback stall has occurred in the graph. Buffer level B is then the difference in the last recorded T _P - value for the playout time. The value of T _fast can be represented in this graph by measuring the horizontal distance from (T _p , T _r ) - curve at the level of γ · B below the current (last) T _P - value.

도 11은 갑작스런 레이트 증가에 대한 pker 프로세스의 응답들을 도시하기 위해 도 15-16과 같은 종류의 프리젠테이션을 사용한다. 수신 레이트가 플레이어가 아직 반응하지 않은 갑작스런 증가를 겪을 때 T _fast가 상대적으로 작다. 그것은 높은 수신 레이트에 빠른 응답을 나타낸다. 평균 윈도우 전체가 그래프의 높은 레이트 부분 내에 있음에 주목하고, 이는 그것이 상대적으로 좁기 때문이다. 따라서, 이 포인트에서, pker 추정은 이미 보다 긴 레이트에 수렴했다.Figure 11 uses a presentation of the kind shown in Figures 15-16 to illustrate the responses of the pker process to sudden rate increases. T _fast is relatively small when the reception rate experiences a sudden increase that the player has not yet reacted to. It exhibits fast response at a high receive rate. Note that the entire average window is within the high rate portion of the graph, as it is relatively narrow. Thus, at this point, the pker estimate has already converged to a longer rate.

도 12는 레이트 하락에 대한 가변 윈도우 크기 WMA 필터(예를 들어, pker) 응답을 도시하기 위해 다시 도 15의 프리젠테이션을 사용한다. 이 경우에, T _fast는 상대적으로 크게 되나, 버퍼는 비워지고, 따라서 B는 작아져서, 평균 윈도우 전체가 얼마간의 비우는 시간 뒤에 낮은-레이트 영역 내에 들어가게 한다. 도시한 바와 같이, 평균 윈도우의 너비, B는 B가 T _fast 보다 작지만, 버퍼가 완전히 비기 전에 추정은 여전히 더 낮은 새 레이트로 수렴하는 것이다.Figure 12 again uses the presentation of Figure 15 to illustrate a variable window size WMA filter (e.g., pker ) response to a rate drop. In this case, T _fast becomes relatively large, but the buffer is emptied, and therefore B becomes small, allowing the entire average window to fall within the low-rate region after some period of emptying. As shown, the averaging window width, B is B is smaller than T _fast, estimated before the buffer is completely non will still converge to the new lower rate.

도 14는 pker 레이트 추정 프로세스의 플로우차트이다.14 is a flowchart of a pker rate estimation process.

T _fast 및 B 값들이 계산되면, C 값이 쉽게 나오고 마지막 단계는 지속시간 C의 지난 윈도우에 걸쳐 레이트 R을 계산하는 것이다. 그 목적을 위해, 히스토리의 Zⁱ 및 Tⁱ 값들이 사용된다. Once the T _fast and B values are calculated, the C value is readily available and the final step is to calculate the rate R over the last window of duration C. [ For that purpose, the Z ⁱ and T ⁱ values of the history are used.

구간 C에 걸친 레이트를 계산하기 위해, SM 또는 RA는 다음을 행한다: (1)

인 가장 큰 j를 찾고, 다음에 (2) 식 4에서와 같이 평균 다운로드 레이트를 계산한다. 제 1 단계에서 그러한 j가 존재하지 않으면, SM 또는 RA는 j:=0, 즉 가장 오래된 공지의 관측으로 설정한다. j 값은 이전 검색에 의해 효과적으로 결정될 수 있다.To calculate the rate over interval C, SM or RA does the following: (1)

(2) Calculate the average download rate as shown in Equation (4). If such j does not exist in the first step, then SM or RA is set to j: = 0, the oldest known observation. The j value can be effectively determined by the previous search.

(식 4)

(Equation 4)

각 그룹은 그 그룹이 소비할 것으로 예상되는 전체 대역폭의 비율에 대응하는 연관된 가중치, w를 갖는다. 그것은 바람직하게는 사용할 수 없는 리프리젠테이션들이 제거된 뒤에, MPD에 의해 제공되는 정보의 함수이다. 여기에서, 그룹 g의 가중치 w의 제안된 정의는 w(g) := maxrate(g) + minrate(g)이고, 여기서 maxrate()는 그룹 g에서 최대의 플레이백 레이트이고 minrate()는 최소의 그것이다.Each group has an associated weight, w, corresponding to the ratio of the total bandwidth that the group is expected to consume. It is preferably a function of the information provided by the MPD after the useless representations have been removed. Here, the proposed definition of the weight w of group g is w (g): = maxrate (g) + minrate (g), where maxrate () is the maximum playback rate in group g and minrate It is.

가중치들 w로부터, SM 또는 RA는 정규화된 가중치들 w'를 다음과 같이 계산할 수 있다. 클라이언트는 그룹 1, …, G를 스트림하기를 원한다고 가정하면, 정규화된 가중치들은 식 5에서와 같이, 모든 가중치들의 합으로 나누어진 가중치들이다.From the weights w, the SM or RA may calculate the normalized weights w 'as follows: Client is group 1, ... , G, the normalized weights are the weights divided by the sum of all weights, as in Equation 5. < RTI ID = 0.0 >

(식 5)

(Equation 5)

정규화는 실제 스트림되는 가중치들에 대해 행해지는 것이 의도된다. 예를 들어, 스트리밍되고 있지 않은 그룹이 있다면, 그것은 계산에 들어가지 않아야 한다. It is contemplated that the normalization is performed on the weights actually being streamed. For example, if you have a group that is not being streamed, it should not count.

pker 프로세스의 동작에서 몇몇 가정들이 된다. 예를 들어, 별개의 리프리젠테이션 그룹들의 버퍼 레벨들은 상대적으로 서로 가깝게 유지되어야 한다. pker 프로세스는 그러한 방식으로 더 잘 동작한다. 예를 들어, 한 그룹이 아주 큰 버퍼를 갖고, 다른 하나가 아주 작은 버퍼를 가지며, 둘 다 비슷한 가중치를 갖는다고 가정하자. 그러한 경우, 빠르게 레이트 추정을 조정해야 할 필요가 있을 수 있고, 이는 작은 버퍼의 경우 상황 변화 시에 스톨링을 피하기 위해 그것이 필요하기 때문이다. 그러나 pker 프로세스는 훨씬 큰 버퍼에 대해 행동하는 것처럼 그 추정을 여전히 잘 평탄화할 것이다. 반대로, 큰 버퍼의 경우, 측정들은 버퍼레벨이 허용하는, 다소 높은 변동을 가질 것이고, 따라서 과민한 레이트 결정들을 초래할 것이다.There are some assumptions in the operation of the pker process. For example, the buffer levels of the different representation groups should be kept relatively close to one another. The pker process works better that way. For example, suppose a group has a very large buffer, the other has a very small buffer, and both have similar weights. In such a case, it may be necessary to adjust the rate estimate quickly, since in the case of small buffers it is necessary to avoid stalling at the time of the situation change. However, the pker process will still flatten the estimation as it does for a much larger buffer. Conversely, in the case of large buffers, the measurements will have somewhat higher fluctuations, which the buffer level allows, and therefore will result in sensitive rate decisions.

어떤 경우에는, 리프리젠테이션 그룹들이 버퍼 레벨에 큰 차이를 갖는 것이 피할 수 없다. 이 이유로, 다른 구현은 어떤 버퍼들이 아주 작을 때 레이트들을 더 빨리 조정하는, 따라서, 그러한 경우들에 스톨들에 대해 비트를 더 잘 보호할 수 있는 변형된 pker 방법을 사용할 수 있다. 그러한 구현은 T _fast를 이전과 동일한 방법으로 계산하지만, 윈도우 크기를 C=max(STP, min(T _fast , T _p,1 -T, T _p,2 -T, …, T _p,N -T))로 설정할 수 있다.In some cases, it is inevitable that the representation groups have a large difference in buffer level. For this reason, other implementations may use a modified pker method that adjusts rates faster when certain buffers are very small, and thus can better protect bits against stalls in such cases. Such an implementation is calculated in the same way as before the T _fast, but the window size = max C (STP, min _(fast T, _{p T,} -T _1, T _p, -T _2, ..., T _{p, N} -T )).

이들 다운로드 레이트들 추정의 다른 변형은 각 리프리젠테이션 그룹에 대해 독립적인 pker 추정을 사용하여 그 그룹에 대한 결정들을 하느 것을 포함한다.
Another variation of these download rates estimates involves making decisions for that group using a pker estimate that is independent for each representation group.

3. 페칭 전략(Fetching Strategy)3. Fetching Strategy

스트리밍 비디도 플레이어들은 일반적으로 제한된 미디어 퍼퍼를 갖는다. 따라서 보통의 동작에서, 버퍼가 가득 찬 상태는 언젠가는 도달하게 될 것으로 예상된다. 버퍼가 가득 찬 상태에 도달하는 때에, 스트리밍 모듈은 버퍼를 과도하게 채우는 것을 피하기 위해 미디어 입력을 막아야 한다. 이를 위한 쉬운 방법이 버퍼가 찰 때는 언제나 버퍼가 다음 프래그먼트를 보유할 수 있을 정도로 충분히 비워질 때까지 기다리고, 그 다음에 페칭을 다시 시작하는 것이다.Streaming video players also typically have limited media perfor- mers. Thus, in normal operation, the buffer full state is expected to arrive some day. When the buffer reaches a full state, the streaming module must block the media input to avoid overfilling the buffer. An easy way to do this is to wait until the buffer is empty enough to hold the next fragment, and then start fetching again whenever the buffer is full.

이 방법의 효과는 각 프래그먼트가 개별적으로 페칭될 것이고, 각 프래그먼트 리퀘스트 사이의 시간 간격, 즉 다음 프래그먼트가 적합하고 리퀘스트될 수 있도록 충분한 버퍼를 비우는데 걸리는 시간의 양이 있다는 것이다.The effect of this method is that each fragment will be fetched individually and there is a time interval between each fragment request, that is, the amount of time it takes to empty enough buffers so that the next fragment is eligible and requested.

TCP 프로토콜은 현재 네트워크 상태에 기초하여 그 다운로드 레이트를 자동적으로 조정한다. 다운로드가 TCP 접속을 통해 시작되는 때에, 초기 다운로드 레이트는 일반적으로 아주 느리고, TCP 프로토콜이 더 높은 다운로드 레이트가 달성될 수 있는지를 알기 위해 검사함에 따라 증가한다. TCP가 얼마나 빨리 다운로드 레이트를 정가시키는지, 일반적으로 TCP가 양단간 TCP 접속의 특성들에 반응하는지는, 매우 복잡하고 고유한 양단간 네트워크 레이턴시들, TCP 송달 및 애크널리지먼트 경로들을 따르는 네트워크 엘리먼트들의 버퍼 용량, 이러한 경로들을 따르는 경쟁 트래픽, TCP의 어떤 변형이 사용되는지, 등을 포함하는, 많은 인자들에 의존한다. 일반적으로, TCP는 느린 다운로드 레이트로 시작하고 시간이 지남에 따라 그 다운로드 레이트를 증가시키고, 따라서 전체 다운로드 시간에 걸쳐 TCP 접속의 평균 다운로드 레이트는 단지 전체 다운로드 시간이 상당한 때 지속 가능한 TCP 다운로드 레이트로 접근한다. 예를 들어, 지속 가능한 TCP 다운로드 레이트가 1 megabit/second 이고 TCP 접속이 본질적으로 0 인 다운로드 레이트에서 시작하고 1 초에 걸쳐 1 megabit/second로 시간이 지남에 따라 선형적으로 증가한다면, 첫 초 동안에 평균 다운로드 레이트는 500 kilobits/second이고, 평균 다운로드 레이트가 지속 가능한 다운로드 레이트의 95%를 달성하는데 10 초의 다운로드가 걸린다. 이 이유로, 많은 다운로드 갭들을 갖는 페칭 전략은 이상적이지 않고, 여기서 다운로드 갭들은 하나의 다운로드 리퀘스트의 완결과 다음 다운로드 리퀘스트의 시작 사이의 시간 주기들이다. 다운로드 리퀘스트들 간의 갭이 0인 때조차도 비-이상적인데, 일반적으로 TCP는 이전 리퀘스트의 완결 이후 다음 리퀘스트에 대한 다운로드 레이트를 늘리는데 얼마간의 시간 주기가 걸리기 때문이다. 각 갭 이후에, 지속 가능한 쓰로우풋이 다시 달성되어야 할 수 있고, 이는 전체 달성된 평균 다운로드 레이트를 줄인다.The TCP protocol automatically adjusts its download rate based on the current network conditions. When a download is initiated via a TCP connection, the initial download rate is generally very slow and increases as the TCP protocol checks to see if a higher download rate can be achieved. Whether TCP schedules download rates, and whether TCP generally responds to the characteristics of TCP-to-TCP connections, is a very complex and unique end-to-end network latencies, buffers of network elements that follow TCP serving and acknowledgment paths Capacity, contention traffic along these paths, what variants of TCP are used, and so on. In general, TCP starts with a slow download rate and increases its download rate over time, so that the average download rate of TCP connections over the entire download time is only approachable at a sustainable TCP download rate do. For example, if the sustainable TCP download rate is 1 megabit / second and the TCP connection starts at a download rate that is essentially zero and increases linearly over time to 1 megabit / second over one second, The average download rate is 500 kilobits / second, and the average download rate takes 10 seconds to download to achieve 95% of the download rate that is sustainable. For this reason, a fetching strategy with many download gaps is not ideal, where download gaps are time periods between the completion of one download request and the start of the next download request. Even when the gap between download requests is zero, it is generally non-ideal because TCP typically takes some time to complete the previous request and increase the download rate for the next request. After each gap, a sustainable throughput may have to be achieved again, which reduces the overall achieved average download rate.

그러한 감소된 레이트는 더 작은 레이트 추정을, 따라서 더 작은 미디어 레이트의 선택에 이르게 할 수 있다. 이것은 차례로 더 작은(바이트 크기에서) 미디어 프래그먼트들이 다운로드 되고 있도록 하고, 이는 갭들의 상대적 크기를 더 증가시키며, 잠재적으로 훨씬 작은 플레이백 레이트가 선택되도록 한다. 다시 말하면, 효과는 자기-증폭한다.Such a reduced rate can lead to a smaller rate estimate, and thus a smaller media rate. This in turn allows smaller (in byte size) media fragments to be downloaded, which further increases the relative size of the gaps and potentially a much smaller playback rate. In other words, the effect self-amplifies.

그러므로, DASH 클라이언트 구현이 이 이슈의 영향을 최소화하는 프로세스를 사용하는 것이 유리하다.Therefore, it is advantageous for the DASH client implementation to use a process that minimizes the impact of this issue.

다음과 같이 일 구현은 미디어 데이터를 계속하여 다운로드하고 그런 다음 다음과 같이 버퍼 레벨을 주기적으로 비울 수 있다. 리퀘스트된 그러나 아직 플레이 아웃 되지 않은 p-time의 양이, 미리 설정된 높은 워터마크, M _h 를 초과할 때마다, 버퍼 레벨이 낮은 워터마크 M _l 아래로 떨어질 때까지 SM은 더 이상 어떤 리퀘스트도 발하지 않는다. 보다 구체적인 구현에서, M _h =20초이고 M _l =10초이나, 다른 구현에서 그러한 값들은 더 낮거나 높을 수 있다. 낮은 워터마크 아래로의 하락 이후, 평상적인 동작이 재개하고, SM은 프래그먼트 결정들을 다시 발하기 시작한다.An implementation may continue to download media data and then periodically empty the buffer level as follows: Whenever the amount of p-time requested but not yet played out exceeds a preset high watermark, M _h , the watermark M _l The SM no longer issues any requests until it falls down. In a more specific implementation, M _h = 20 seconds and M _l = 10 seconds, but in other implementations such values may be lower or higher. After a drop below the low watermark, normal operation resumes and the SM begins to resume fragmentation decisions.

다른 구현은 유사한 효과를 달성하기 위해 프리젠테이션 시간보다 바이트로 특정되는 워터마크들을 사용할 수 있다.Other implementations may use watermarks specified in bytes rather than presentation time to achieve a similar effect.

버퍼가 주기적으로 비워지고 있다는 사실은 시스템의 다른 부분들에 의해 그들에게 유리하게 사용될 수 있다. 예를 들어, 섹션 6.1.2에서 설명된 바와 같이, RTT의 신선한 추정들을 획득하는데 사용될 수 있다.The fact that the buffers are periodically emptied can be advantageously used by them by other parts of the system. For example, as described in section 6.1.2, it can be used to obtain fresh estimates of the RTT.

도 17은 “워터마크” 페칭 프로세스의 동작을 도시한다. 상단 그래프는 비움 주기들과 페칭 주기들의 교대 패턴들이 보이는 버퍼 레벨 그래프이다. 다운로드 레이트는 하단 그래프에 표시된다. 각 페칭 주기의 시작에서, TCP는 지속 가능한 최대 속도에 도달하기 위해 얼마간의 시간이 걸리고, 따라서 평균 다운로드 레이트(페칭 주기 동안)는 최대 달성가능 다운로드 레이트보다 작다. 낮은 워터마크 및 높은 워터마크 사이의 차이가 클수록, 페칭 주기들이 더 길어지고, 평균 레이트가 더 높아진다.
Fig. 17 shows the operation of the " watermark " fetching process. The top graph is a buffer level graph showing alternating patterns of emptying periods and fetching cycles. The download rate is shown in the lower graph. At the beginning of each fetching period, TCP takes some time to reach a sustainable maximum rate, and thus the average download rate (during the fetching period) is less than the maximum achievable download rate. The greater the difference between the low watermark and the high watermark, the longer the fetching periods and the higher the average rate.

4. 레이트 선택 프로세스4. Rate selection process

미디어 데이터를 리퀘스트 시작할 때, 스트리밍 모듈(SM)은 첫 플레이 아웃 레이트 선택을 하기 위해 몇몇 방법들을 사용한다. 그것은 가장 낮은 가용 레이트를 취할 수도 있고, 예를 들어 네트워크 상황의 히스토리를 보관하고 그 다음에 이 히스토리에 기초하여 어떤 플레이 아웃 레이트가 스톨들 없이 유지될 수 있을 것 같은지 선택하는 추정을 결정할 수도 있다. SM이 이미 데이터를 수신하고 있고 따라서 그것이 사용할 수 있는 (예를 들어 섹션 2로부터의 방법들로 계산된 레이트 추정들 중 하나와 같은) 레이트 추정 R을 가지면, 그것은 그 레이트에 머무를지 또는 리프리젠테이션들을 변경할지 결정한다. When initiating the request for media data, the streaming module (SM) uses several methods to select the first playout rate. It may take the lowest available rate and may, for example, determine a presumption of keeping a history of network conditions and then selecting which playout rate may be maintained without stalls based on this history. If the SM is already receiving data and therefore has a rate estimate R (e.g., such as one of the rate estimates computed in the methods from Section 2) that it can use, To be changed.

간단한 레이트 결정 프로세스가 이제 기술될 것이다. 수신기는 추정된 다운로드 레이트 R 보다 낮은 플레이백 레이트를 갖는 가장 높은 대역폭 리프리젠테이션을 결정하고, 그것을 데이터를 플레이 아웃 (플레이 백) 할 리프리젠테이션으로 선택한다. 수월하긴 하지만, 이 접근법은 다수의 문제점들을 갖는다. 첫째로, 그것은 자연적으로 작은 미디어 버퍼가 늘어나도록 하지 않고, 따라서 다운로드 레이트가 단지 조금 변할 때조차도 스톨들을 의심한다. 둘째로, 변하는 추정 R은 빠르게 변하는 레이트 결정들에 이를 것이고, 이는 필수적이지 않을 수도 있고 시각적으로 불안할 수 있다. 셋째로, 그것은 적어도 대략적으로 프래그먼트의 지속시간인 개시 시간, 따라서 일반적으로 몇 초에 이른다.A simple rate determination process will now be described. The receiver determines the highest bandwidth representation with a playback rate lower than the estimated download rate R and selects it as a representation to play back the data. Although easy, this approach has a number of problems. First, it naturally does not let the small media buffer stretch, thus doubling the stall even when the download rate changes only slightly. Second, the varying estimate R will result in rapidly changing rate decisions, which may or may not be necessary and may be visually unstable. Third, it at least approximates the start time, which is the duration of the fragment, and thus generally a few seconds.

그러므로 DASH 클라이언트는 그 레이트 결정들을 다운로드 추정 R 뿐만 아니라 버퍼 레벨 B(즉, 버퍼링되었지만 아직 플레이 아웃되지 않은 p-time의 양) 및 일반적으로 두 연속한 스위치 포인트들 사이의 p-time 지속시간의 추정인 변화 레이트 D와 같은, 콘텐츠에 의존하는 변수들에 기초하는 레이트 결정 프로세스를 구현할 수 있다. Hence, the DASH client is able to determine its rate decisions by not only the download estimate R but also the buffer level B (i.e., the amount of p-time that has been buffered but not yet played out) and the estimate of the p- Lt; RTI ID = 0.0 > D , < / RTI >

따라서, 일 구현은 R에 비례하는 가장 큰 플레이백 레이트를 결정 레이트로 선택할 수 있고, 여기서 비례 인수는 버퍼 레벨의 함수이다.Thus, an implementation may select the largest playback rate that is proportional to R as the rate of decision, where the proportional factor is a function of the buffer level.

일반적으로, 비례 인수 λ는 버퍼 레벨의 증가 함수이다. 예를 들어, 일 구현은 λ를 버퍼 레벨의 아핀(affine) 함수로 만들 수 있다.In general, the proportional factor l is an increasing function of the buffer level. For example, an implementation may make λ an affine function at the buffer level.

λ가 버퍼 레벨의 함수이면, 일 구현은 버퍼가 비었거나 작은 때 λ가 작은 것으로 선택할 수 있다. 그러한 선택은 유익한데, 그것은 작은 버퍼가 커지도록 하고, 또한 다운로드 레이트가 정확히 예측되지 않은 때 스톨링에 대한 다소간의 안전성을 제공할 것이기 때문이다.If lambda is a function of the buffer level, then one implementation can choose lambda to be small when the buffer is empty or small. Such a choice is beneficial because it allows small buffers to grow and also provides some safety for stalling when the download rate is not precisely predicted.

보다 큰 버퍼 레벨에 대해, 일 구현은 λ의 값을 1에 가깝게, 같게 또는 심지어 초과하게 선택할 수 있다. 그것은 스톨링의 급박한 위험이 없을 때 높은 플레이 아웃 레이트가 다운로드 되는 것으로 선택되어, 안정적인 상태에서 높은 품질의 미디어가 스트리밍되는 것에 이른다는 것을 보증할 것이다.For larger buffer levels, an implementation may choose to set the value of lambda close to, equal to, or even exceed. It will ensure that a high playout rate is chosen to be downloaded when there is no immediate risk of stalling, leading to high quality media streaming in a stable state.

레이트 결정 프로세스는 간단한 아핀 함수보다는 B의 구분적(piecewise) 아핀 함수인 λ를 구현할 수 있다. 구분적 아핀 함수들은 임의의 연속 함수들을 임의의 원하는 정도의 정확도로 근접시킬 수 있고, 이는 그것들을 적절한 선택으로 만든다. 같은 특성을 갖는 임의의 다른 파라미터화 할 수 있는 종류의 함수들이 대신 선택될 수 있다.The rate determination process can implement a piecewise affine function of B rather than a simple affine function. The segmental affine functions can approximate any continuous functions to any desired degree of accuracy, which makes them an appropriate choice. Any other parameterizable types of functions having the same properties may be selected instead.

다른 구현은 λ를 p-time 단위의 버퍼 레벨보다는 바이트 단위의 버퍼 레벨의 함수로 할 수 있다.Other implementations can make λ a function of buffer level in bytes rather than p-time units of buffer level.

또 다른 구현은 λ를, 버퍼 레벨 B 뿐만 아니라, 버퍼 레벨 B 및 스위치 기회들의 빈도 모두의 함수로 할 수 있다. 그렇게 하는 이유는 레이트를 변경할 기회를 적게 갖는 플레이어가 변경할 기회를 보다 자주 갖는 것들보다 각각의 결정에 대해 보다 먼 장래에 관련될 것이기 때문이다. 따라서, 전자의 경우, 각 결정은 보다 큰 시간 스팬, 및 또한 보다 높은 위험에 관련된다. 이는 스톨링의 위험을 낮게 유지하기 위해, 버퍼 레벨 B 및 추정된 다운로드 레이트 R이 동일한 때 후자보다 전자의 경우에 더 낮은 레이트를 선택하는 것이 나을 것임을 암시한다. Another implementation may be the λ, as well as the buffer level B, as a function of both the frequency of the buffer level B and the switch chance. The reason for doing this is that players with fewer opportunities to change the rate will be more likely to be in the future for each decision than those who have more frequent opportunities to change. Thus, in the former case, each crystal is associated with a larger time span, and also a higher risk. This implies that in order to keep the risk of stalling low, it is better to select a lower rate in the former case than the latter when the buffer level B and the estimated download rate R are equal.

레이트 스위치 기회들의 빈도를 고려하는 레이트 선택 프로세스의 구체적 방법은 다음과 같다. D를 스트림에 두 연속하는 스위치 포인트 간의 p-time 의 일반적인 양이라고 하자. D의 값은 인코딩된 비디오에 의존하고, 예를 들어, 두 연속하는 스위치 포인트들 간의 p-time 에서의 최대거리, 또는 두 연속하는 스위치 포인트의 평균 거리, 또는 두 연속하는 스위치 포인트들의 90번째 백분위수, 또는 미디어의 두 연속하는 스위치 포인트들의 p-time 거리의 임의의 다른 적절한 측정인 것으로 될 수 있다. 그러한 D를 고려하면, 방법은 λ가 B/D 구분적 아핀 함수, 또는 예를 들어, B/max(u,D) 또는 B/(D+u)와 같은, 그 변형인 것으로 선택하는 것을 포함할 수 있고, 여기서 값 u는 리퀘스트들을 발할 때 초래되는 오버해드(overhead)를 고려하기 위해 추가된다. u의 값은 (예를 들어, 100 ms와 같은) 작은 일정한 양의 시간일 수 있다. 보다 세밀하게는, 일 구현은 u를 추정된 RTT의 작은 배수로 할 수 있다.A specific method of rate selection process that takes into account the frequency of rate switch opportunities is as follows. Let D be the general amount of p-time between two consecutive switch points in the stream. The value of D depends on the encoded video and may, for example, be the maximum distance in p-time between two successive switch points, or the average distance of two successive switch points, or the 90th percentile of two consecutive switch points Number, or any other suitable measure of the p-time distance of two consecutive switch points of the media. Considering such D , the method includes choosing lambda as a B / D delineated affine function or a variant thereof, such as B / max (u, D) or B / (D + u) , Where the value u is added to account for the overhead incurred when issuing the requests. The value of u may be a small constant amount of time (such as, for example, 100 ms). More precisely, an implementation may make u a small multiple of the estimated RTT.

전술한 방법과 같이, 그 레이트 결정을 단지 λ·R에 기초하는 프로세스는, R에서의 상대적으로 작은 변동성조차도 많은 레이트 스위치들을 초래한다는 단점을 갖는다. 이는 바람직하지 않을 수 있다. 충분한 버퍼가 있을 때에는, R에서의 작은 변화에 즉시 반응하지 않고, 대신에 버퍼 레벨을 따라서 변하게 두는 것이 나을 수 있다.As with the above-described method, the process of simply determining the rate decision based on ? R has the disadvantage that even relatively small variations in R result in many rate switches. This may not be desirable. When there is enough buffer, it may be better not to react immediately to a small change in R , but instead to vary along the buffer level.

그러한 동작을 얻기 위해, 프로세스는 둘 다 같은 양(예를 들어, 전술한 바와 같이, B, B/D, B/max(100 ms, D))의 함수들인 λ 및 μ 값들을 사용할 수 있고, 이는 현재 레이트와 함께, 새 레이트 결정을 선택하기 위한 것이다. 함수들은 λ·R이 낮은 수용가능 레이트 선택이고, μ·R이 높은 수용가능 레이트 선택 이도록 선택되어야 한다. 그러면 프로세스는 그러한 두 값들을 좋은 레이트 결정을 위한 가이드로서 사용하도록 설계될 수 있다. To achieve such an operation, the process can use the values of lambda and mu , both functions of the same amount (e.g., B, B / D, B / max (100 ms, D) This is for selecting a new rate decision together with the current rate. The functions should be chosen such that [ Lambda] .R is the low acceptable rate selection and [ mu] R is the high acceptable rate selection. The process can then be designed to use those two values as a guide for good rate determination.

그러한 설정에서, 함수들인 일반적으로 λ ≤ μ 이도록 선택되어야 한다.In such a setting, the functions, generally λ ≤ μ, must be chosen.

레이트 결정 프로세스는 이전선택이 이미 λ·R 에서 μ·R 까지의 범위에 있었다면 레이트를 동일하게 유지할 것을 결정할 수 있다. 이전선택이 λ·R 보다 작다면, λ·R 와 같거나 그보다 작은 가장 큰 가용 플레이백 레이트가 선택된다. 이전 선택이 μ·R 보다 크다면 μ·R 와 같거나 작은 가장 큰 가용 플레이백 레이트가 선택된다.The rate determination process may decide to keep the rate the same if the previous selection was already in the range from lambda. R to mu R. If the previous selection is smaller than λ · R, λ · R and equal to or less than The largest available play rate is selected. The previously selected is greater than μ · R equal to μ · R or smaller largest available playback rate is selected.

일 구현은 함수들 λ 및 μ 가 하드코딩되도록(hardcoded) 선택할 수 있다. 대안적으로, 그것은 환경에 의존하는 보다 정교한 방식의 함수들을 선택할 수 있다. 특히, 일 구현은 적절한 λ 및 μ 함수들을 클라이언트가 최대한으로 할 버퍼링의 양의 함수로서 선택할 수 있다. 온 디멘드 콘텐츠에 대해, 클라이언트는 많은 데이터를, 잠재적으로 몇 분의 미디어 데이터를 프리버퍼링(prebuffer)하는 것을 선택할 수 있다. 낮은 레이턴시 라이브 콘텐츠에 대해, 클라이언트는 단지 많아 봐야 양단간 레이턴시에 의해 규정되는 미디어의 양만을 버퍼링할 수 있고, 이는 아마 단지 몇 초에 불과하다. 작은 버퍼링을 갖는 콘텐츠에 대해, 클라이언트는 보다 보수적인, 즉 보다 작은 값을 갖는 λ 및 μ 함수들을 선택하도록 결정할 수 있다.An implementation may choose to hardcoded the functions lambda and mu . Alternatively, it can select functions in a more sophisticated manner depending on the environment. In particular, one embodiment is suitable λ and μ functions The client can choose as a function of the amount of buffering to do as much as possible. For on-demand content, the client can choose to prefetch a lot of data, potentially a few minutes of media data. For low latency live content, the client can buffer only the amount of media specified by the end-to-end latency, which is only a matter of seconds. For content with small buffering, the client can decide to select lambda and mu functions with a more conservative, i.e. smaller value.

구체적인 구현은 예를 들어 두 극(extremal) 함수들 λ ₁ 및 λ ₂ 사이에 선형적으로 함수를 보간할 수 있고, 여기서 선택된 보간 포인트는 낮은 버퍼 워터마크 M_l이다(섹션 3 참조). 그래서 그것은 두 하드코딩된 함수들, λ ₁ 및 λ ₂ 를 갖고, 어떤 값들 m ₁ , m ₂ 에 대해 λ ₁ 은 m ₁ 보다 작은, 작은 값들의 M _l 에 대해 사용되고, λ ₂ 는 M _l ≥m ₂ 일 때 사용되며, 여기서 m ₁ <m ₂ 이다. m ₁ 에서 m ₂ 까지의 범위에 있는 값들에 대해, 함수 λ(x) := λ ₁ (x)(m ₂ -M _l )/(m ₂ -m ₁ )+ λ ₂ (x)(M _l -m ₁ )/(m ₂ -m ₁ ) 이 사용된다.A specific implementation may, for example, linearly interpolate the function between two extremal functions λ ₁ and λ ₂ , where the selected interpolation point is the low buffer watermark M ₁ (see section 3). So it has two hard-coded functions, lambda ₁ and lambda ₂ , for certain values m ₁ , m ₂ , l ₁ is used for small values M _l less than m ₁ , and lambda ₂ is M _l ≥ m ₂ , where m _{1 &} lt; m ₂ . m ₁ In for the values in the range up to ₂ m, the function _{λ (x): = λ 1} (x) (m 2 -M l) / (m 2 -m 1) + λ 2 (x) (M l -m ₁ ) / (m ₂ -m ₁ ) is used.

우리는 이제 전술한 설명을 따르는 레이트 결정 프로세스의 구체적인 예를 설명한다. 이를 위해, 우리는 몇몇 표기법을 도입한다.We now describe a specific example of a rate determination process that follows the above description. To this end, we introduce some notations.

1) S ₁ , S ₂ , …, S _L 을 리프리젠테이션 그룹의 L 가용 리프리젠테이션들(오름 차순으로 주어짐)의 스트림 레이트들이라고 하자.1) S ₁ , S ₂ , ... , Let S _{L be} the stream rates of the L-available representations of the presentation group (given in ascending order).

2) λ(x) 를 입력으로 음이 아닌 스칼라를 취하고 음이 아닌 실 스케일일 계수를 돌려주는 구분적 선영(piece-wise linear) 함수라고 하자. 함수 λ(x) 는 컴파일 시간에 또는 구성 파일을 통해 설정될 수 있어야 한다. 큰 x에 대해, 예를 들어, M _l 보다 큰 x에 대해, λ(x) 는 변하지 않는 것이어야 한다.2) Let λ (x) be a piece-wise linear function that takes a non-negative scalar as input and returns a non-negative real scale day coefficient. The function λ (x) must be set at compile time or through a configuration file. For a large x , for example, for x larger than M _l , λ (x) should be unchanged.

어떻게 그러한 함수가 구현될 수 있는지에 대한 일 예가 있다. 코너 포인트들 (0, λ ₀), (x ₁ , λ ₁), …, (x _N , λ _N )이 주어지고, 여기서 x _i 는 오름차순이다. λ(x) 을 계산하기 위해, x _i ≤x이도록하는 가장 큰 i를 찾아라. 다음에, 식 6을 이용하여, 수신기는 함수를 계산할 수 있다.There is an example of how such a function can be implemented. The corner points (0 ,? ₀ ), ( x ₁ ,? ₁ ), ... , ( x _N , λ _N ), where x _i is the ascending order. To compute λ (x) , find the largest i such that x _i ≤ x . Next, using Equation 6, the receiver can calculate the function.

(식 6)

(Equation 6)

그러한 λ(x) 함수에 대한 적절한 예는 예시적 파라미터들 N=1,[(0,0.5),(3,1)]에 의해 정의되는 것, 즉, x=0에서 0.5와 같고 x가 3에 이를 때까지 선형적으로 증가하는 함수일 수 있고, 그 포인트에서 함수는 1과 같고 그 이후에 1로 유지된다. A suitable example for such a λ (x) function is one defined by the exemplary parameters N = 1, [(0,0.5), (3,1)], ie x = 0 to 0.5 and x equal to 3 , And at that point the function is equal to 1 and then remains at 1.

3) μ(x)를 다른 그러한 구분적 선형 함수라고 하자. 그러한 함수의 일 예는 x=0에서 0으로 계산하고 x=3에서 15에 도달하며, 그 뒤에 일정하게 유지하는 것이다.3) Let μ ( x ) be another such piecewise linear function. An example of such a function is to compute 0 from x = 0 and reach 15 at x = 3, and keep it constant thereafter.

4) D를 (이전에 특정된 바와 같이) 일 스위치 포인트에서 다음 것까지의 p-time에서의 지속시간의 추정이라고 하자.4) Let D be the estimate of the duration at p-time from one switch point to the next (as specified previously).

5) x := min{(T _d - T), M _l }/ max{D , 1 second)라고 하고, T는 현재 플레이백 p-time, T _d 는 레이트 결정이 되는 p-time, D는 전술한 바와 같고, M _l 은 버퍼 레벨 낮은 마크이다(섹션 3 참조). 5) x: = min {( T d - T), M l} / max {D, 1 second) that, T is the current playback p-time, T _d is p-time is a rate determination, and D is As described above, M _l is a mark with a low buffer level (see section 3).

6) CURR을 현재 선택된 리프리젠테이션(즉, 마지막 프래그먼트 리퀘스트에서 사용된 것)이라고 하자. UP를 많아도 λ(x)·R의 레이트를 갖는 가장 높은 비트레이트 리프리젠테이션의 플레이아웃 레이트라고 하고, 그러한 리프리젠테이션이 없으면 UP는 가장 낮은 비트레이트 리프리젠테이션의 플레이 아웃 레이트이다. DOWN은 많아도 μ(x)·R의 레이트의 가장 높은 비트레이트 리프리젠테이션의 플레이아웃 레이트라고 하고, 그러한 리프리젠테이션이 없으면, DOWN은 가장 낮은 비트레이트 리프리젠테이션의 플레이 아웃 레이트이다. 일반적으로 λ(x) ≤ μ(x)이므로, 일반적으로 DOWN ≥ UP이다.6) Let CURR be the currently selected representation (ie, used in the last fragment request). UP is at most the playout rate of the highest bit rate representation having a rate of lambda (x) R , and if there is no such representation, UP is the playout rate of the lowest bit rate representation. DOWN is at most the playout rate of the highest bit rate representation of the rate of μ (x) R , and if there is no such representation, DOWN is the playout rate of the lowest bit rate representation. In general, λ (x) ≤ μ (x), so DOWN ≥ UP in general.

다음에, 레이트 결정 프로세스는 다음 프래그먼트의 레이트 NEXT를 다음과 같이 선택한다: (1) UP < CURR 이면, NEXT := min(DOWN, CURR); (2) 아니면 NEXT := UP. Next, the rate determination process selects the rate NEXT of the next fragment as follows: (1) if UP < CURR, then NEXT : = min ( DOWN, CURR ); (2) Or NEXT: = UP.

위의 단계 5에서, 단순히 D 대신에 max{D, 1 second}를 사용하는 이유는 RTT 때문이고; 1의 역할은 RTT의 상한으로서 역할 하는 것이다.In step 5 above, the reason for using max { D, 1 second} instead of D simply is due to RTT; 1 serves as the upper limit of the RTT.

함수 λ(x) 및 μ(x)는 x의 함수로서 증가하는 것이 바람직하다. 작은 x에 대해 λ 및 μ 함수들은 <1인 것이 바람직하고, 이는 선택된 플레이 아웃 레이트가 R 보다 작아 작은 버퍼 레벨들에 대한 버퍼 성장을 유발하는 것을 보증한다. 선택된 플레이백 레이트가 많아도 max(λ(B/max{D,1}), μ(B/max{D,1}))·R 과 같아, λ(B/max{D,1}) 및 μ(B/max{D,1}) 모두가 1 보다 작은 모든 버퍼 레벨들 B 에 대해 버퍼 성장을 보증하는 것에 주목하라.The functions lambda (x) and mu (x) are preferably increased as a function of x . For small x , the lambda and mu functions are preferably < 1, which ensures that the selected playout rate is less than R resulting in buffer growth for small buffer levels. The selected playback rate at most equal to the max (λ (B / max { D, 1}), μ (B / max {D, 1})) · R, λ (B / max {D, 1}) and μ ( B / max { D , 1}) guarantees buffer growth for all buffer levels B less than one.

유사한 프로세스가 새 리프리젠테이션을 플레이백 레이트가 λ(B)·R 보다 작은 가장 좋은 리프리젠테이션인 것으로 직접 선택할 수 있다. 이는 여전히 버퍼가 거의 비었을 때 버퍼가 채우려고 하는 경향을 갖는 특성을 가질 것이다. 그러나 그것은 또한 많은 리프리젠테이션 스위치들을 유발하고, 이는 R이 아주 많이 요동칠 수 있기 때문이다. 여기에 설명된 보다 복잡한 레이트 선택 프로세스가 스위치들을 피하려고 시도하고, 대신에 더 낮은 플레이백 레이트로 스위칭 다운 전에 버퍼가 어느 정도까지 비는 것을 허용한다. 이것이 동작하기 위해, 함수들 μ 및 λ 는 큰 버퍼 레벨에 알맞도록 μ 가 λ 를 초과하도록 선택되어야 한다: 선택된 플레이백 레이트가 CURR이고, 측정된 레이트가 R이면, 식 7이 만족되는 한 레이트 변경은 일어나지 않고, 수신 레이트가 레이트 스위치들 없이 다소 요동치는 것을 허용함을 주의하라.A similar process can directly select the new representation as being the best representation that the playback rate is less than λ ( B ) · R. This will still have the property that the buffer tends to fill up when the buffer is nearly empty. But it also triggers a lot of re- presentation switches, because R can oscillate very much. The more complex rate selection process described herein attempts to avoid the switches and instead allows the buffers to scale to some degree before switching down to a lower playback rate. For this to work, the functions mu and l should be chosen such that mu exceeds lambda to fit the large buffer level: if the selected playback rate is CURR and the measured rate is R , And does not allow the receive rate to oscillate somewhat without the rate switches.

(식 7)

(Equation 7)

어떤 버전들에서는, λ 및 μ 는 단지 B/max{D,1}의 비 대신에 버퍼 레벨 B의 함수일 것이다. 후자를 도입하는 동기는 다음과 같다.In some versions, lambda and mu would be a function of buffer level B instead of just the ratio of B / max { D , 1}. The motivation for adopting the latter is as follows.

α는 다운로드 레이트에 대한 선택된 리프리젠테이션의 플레이백 레이트의 비율을 나타낸다고 하자. 우리는 양호한 α를 결정하기를 원한다. 다음 스위치 포인트까지 다운로드 하는데 대략 α·D 의 r-time이 소요된다. 수신된 데이터가 버퍼에 추가되기 직전에, 버퍼는 B - α·D 로 비워질 것이다. 스톨링을 피하기 위해, 우리는 그 양이 양수이기를 원한다; 안전 쿠션으로서 그것은 일단 그것이 다운로드되면 버퍼에 추가되는 프래그먼트의 플레이백 지속시간 D 에도 비례하여야 하고, 그래서 그것은 어떤 β>0 에 대해 적어도 β·D이어야 한다. 요약하면, 우리는 B - αD ≥ β·D 을 원한다. Let α denote the ratio of the playback rate of the selected representation to the download rate. We want to determine a good α . It takes approximately α · D r-time to download to the next switch point. Immediately before the received data is added to the buffer, the buffer will be emptied to B - α · D. To avoid stalling, we want the quantity to be positive; As a safety cushion it must also be proportional to the playback duration D of the fragment that is added to the buffer once it is downloaded, so it should be at least β · D for any β> 0. In summary, we want B - αD ≥ β · D.

α 에 대해 풀면 B/Dβ- ≥ α 가 나온다. 이는 리프리젠테이션 선택 프로세스는 B/D - β 를 넘지 않는 다운로드 레이트 대비 플레이백 비를 선택해야 한다. 함수들 λ(x) 및 μ(x) 는 수용 가능한 그러한 비율들 상의 한계이다; 따라서 그것들은 x -β 를 넘지 않는 x = B/D의 함수이어야 한다.Solving for α gives B / D β- ≥ α . This selection process should select a playback rate versus a download rate that does not exceed B / D - β. The functions lambda (x) and mu (x) are limits on acceptable ratios; Therefore, they must be a function of x = B / D that does not exceed x -β.

하나의 프래그먼트를 송신하기 위한 RTT의 추가 비용을 실제로 고려하기 위해, 우리는 B/D를 B/max{D,1}으로 대체한다. 보다 일반적으로, 1은 RTT의 추정의 어떤 배수, 또는 서버로부터 미디어 데이터의 다운로드를 시작하는 프로세스들의 반응 시간을 고려하는 다른 파라미터들로 대체될 수 있다.To actually account for the extra cost of the RTT to send a fragment, we replace B / D with B / max { D , 1}. More generally, 1 may be replaced by any number of estimates of the RTT, or other parameters that take into account the response time of the processes that initiate the downloading of media data from the server.

도 18은 플레이백 레이트를 선택하는데 사용될 수도 있는 λ 및 μ 함수들의 예들을 도시한다. x-축은 D 단위의 버퍼 레벨이고, y-축은 수신 비율, 즉 현재 수신 또는 다운로드 레이트로 나누어진 플레이백 리프리젠테이션이다. 선(1802)로 도시된 바와 같이, 수신 비율이 1보다 작으면, 버퍼가 커질 것이고, 그것이 1보다 크면, 그것은 줄어들 것이다. 3 영역들이 식별된다. 첫째로, 플레이어가 결정 포인트에서 λ-커브(1804) 아래에 있으면, 그것은 레이트에서 위로 스위칭할 것이다. 그것이 λ-커브(1804) 및 μ-커브(1806) 사이에 있으면, 그것은 선택된 레이트에 머무를 것이다. 그것이 μ-커브(1806) 위에 있으면, 그것은 아래로 스위칭할 것이다.Figure 18 shows examples of lambda and mu functions that may be used to select the playback rate. The x-axis is the buffer level in units of D, and the y-axis is the playback ratio, divided by the current receive or download rate. As shown by line 1802, if the receive ratio is less than 1, the buffer will grow, and if it is greater than 1, it will decrease. 3 regions are identified. First, if the player is below the lambda -curve 1804 at the decision point, it will switch up in rate. If it is between the lambda -curve 1804 and the mu -curve 1806, it will stay at the selected rate. If it is on the μ -curve (1806), it will switch down.

도 19는 “보수적”인 설정을 사용하는 (λ,, μ)-함수들의 예시적 선택을 나타낸다. 이 설정은 가용 대역폭을 모두 사용하지 않고 그 대신 매우 드물게 스톨할 것이라는 점에서 “보수적”이다.Figure 19 shows an exemplary selection of ( λ ,, μ ) -functions using a "conservative" setting. This configuration is "conservative" in that it will not use all available bandwidth, but instead will stall very rarely.

도 20은 “중도적”인 설정을 사용하는 (λ,, μ)-함수들의 예시적 선택을 나타낸다. 이 설정은 보수적인 것보다는 많은 대역폭을 사용하나, 조금 더 스톨들을 하기 쉽다는 점에서 “중도적”이다.Figure 20 shows an exemplary selection of ( ? ,,? ) - functions using a " moderate " setting. This configuration uses more bandwidth than is conservative, but it is "moderate" in that it is a little more easy to stall.

도 21은 “공격적”인 설정을 사용하는 (λ,, μ)-함수들의 예시적 선택을 나타낸다. 이 설정은 가용 대역폭을 모두 공격적으로 사용하려 한다는 점에서 “공격적”이다. 그것은 제시된 다른 두 예시적 설정들보다 더 자주 스톨할 것이다.Figure 21 shows an exemplary selection of ( λ ,, μ ) -functions using an "aggressive" setting. This setting is "aggressive" in that it tries to use all available bandwidth aggressively. It will stall more frequently than the other two example settings presented.

도 22는 MLB 프로세스, 즉, Major League Baseball (MLB)와 일하는 몇몇 연구원들에 의해 제안된 것과 어느 정도까지 유사한 프로세스를 에뮬레이팅하는 프로세스를 사용하는 (λ, μ)-함수들의 예시적 선택을 나타낸다. (λ, μ)-함수들은 미디어 버퍼 참(fullness)에 기초하여 변하지 않는다는 점을 주의하라.Figure 22 shows an exemplary selection of ( λ, μ ) -functions using a process that emulates a process similar to that proposed by the MLB process, ie, some researchers working with Major League Baseball (MLB) . ( ?,? ) - functions do not change based on the media buffer fullness.

도 23은 λ 및 μ 설정들에 대한 나란한 값들의 일 예를 도시한다.Figure 23 shows an example of side-by-side values for lambda and mu settings.

도 24는 λ 및 μ 설정들에 대한 나란한 값들의 일 예를 도시한다.Fig. 24 shows an example of side-by-side values for lambda and mu settings.

도 36은 레이트 선택에서 λ 및 μ 에 대해 사용될 수 있는 값들의 표들을 포함한다.Figure 36 includes tables of values that can be used for lambda and mu in rate selection.

도 25는 레이트 추정, 다음에 레이트-기반 레이트 선택, 다음에 버퍼 관리-기반 레이트 선택을 위한 프로세스를 도시한다. 이 예시적 프로세스에서, 여기에 기술된 하나 이상의 접근법들이 레이트 추정을 행하는데 사용된다. 그 추정에 기초하여, 새 플레이백 레이트가 선택되고 버퍼 관리 룰들에 기초하여 조정될 수 있다.
25 shows a process for rate estimation, then rate-based rate selection, and then buffer management-based rate selection. In this exemplary process, one or more of the approaches described herein are used to perform rate estimation. Based on that estimate, a new playback rate may be selected and adjusted based on buffer management rules.

5. 리퀘스트 취소5. Cancel the request

어떤 경우에는, 심지어 양호한 레이트 선택 프로세스도 단독으로는 비디오 플레이백 스톨들을 방지할 수 없다. 예를 들어, 다운로드 레이트가 리퀘스트가 생성되었으나 완결되기 전에 급격히 하락하면, 선택된 비트레이트는 너무 컸을 수 있고, 느린 다운로드 레이트는 플레이백 레이트를 변경할 다음 기회가 오기도 전에 플레이백 스톨에 이를 수 있다.In some cases, even a good rate selection process alone can not prevent video playback stalls. For example, if the download rate drops rapidly before the request is created but before it is completed, the selected bit rate may be too large and the slow download rate may lead to a playback stall before the next opportunity to change the playback rate.

다른 예에서, 예를 들어, 셀룰러 접속에서 WiFi 접속으로의 전이로 인해 가용 대역폭이 극적으로 증가하는 때에 미디어 버퍼는 상대적으로 낮은 플레이백 레이트 미디어로 찰 수 있다. 이 경우, 이미 다운로드 되었지만 아직 플레이 아웃 되지 않은 미디어의 큰 부분을 폐기하고, 폐기된 p-time 부분들을 다시 그러나 이번에는 다운로드할 보다 높은 플레이백 레이트 리프리젠테이션을 선택하여 다운로드하는 것이 유리할 수 있다. 따라서 이미 다운로드 된 낮은 플레이백 레이트 미디어는 취소하고, 다른 리프리젠테이션으로부터의 보다 높은 플레이백 레이트 미디어가 플레이 아웃될 그 곳에 다운로드 되고, 따라서 보다 높은 품질의 사용자 경험을 유도한다.In another example, the media buffer may hit the relatively low playback rate media when the available bandwidth increases dramatically due to, for example, a transition from a cellular connection to a WiFi connection. In this case, it may be advantageous to discard a large portion of the media that has already been downloaded but not yet played out, and to download the discarded p-time portions again, but this time with a higher playback rate presentation to download. Thus, the already downloaded lower playback rate media is canceled and the higher playback rate media from the other presentation is downloaded to where it will be played out, thus leading to a higher quality user experience.

이 이유로, 스트리밍 모듈 구현은 다운로드 레이트를 모니터링하는 모듈을 구현하고 어떤 상황에서 이전의 결정들을 취소할 수 있다. 리퀘스트가 취소되면, 스트리밍 모듈은 보다 새롭고, 보다 적절한 다운로드 레이트의 추정에 기초하여 새 리퀘스트를 발해야 한다. 우리는 여기서 이 모니터링 모듈을 리퀘스트 취소 프로세스라고 부른다.For this reason, a streaming module implementation may implement a module to monitor the download rate and in some circumstances may cancel previous decisions. If the request is canceled, the streaming module must issue a new request based on an estimate of a newer, more appropriate download rate. We call this monitoring module here a request cancellation process.

리퀘스트 취소 프로세스는 다른 이유들에서 리퀘스트들을 취소할 수 있다. 예를 들어, 그것은 다운로드 레이트가 급격히 하락하였고, 플레이백이 충분히 빠르게 수신되지 않고 있는 데이터로 인한 스톨링의 위험이 있어 리퀘스트들을 취소할 수 있다. 취소하는 다른 이유는 보다 높은 품질의 미디어가 플레이백을 위해 적시에 선택되고 리트리빙될 수 있다고 결정되는 여부이다. 취소하는 또 다른 이유는 리시버가 리시버가 하는 것과 무관하게 스톨이 발생할 것이라고 결정하고 미완의 리퀘스트의 완결을 허용하는 것에 비해 취소가 스톨 기간을 단축시킬지를 추정하는데 있다. 리시버는 추정된 보다 짧은 스톨이 동반하는 동작을 선택하고, 또한 플레이 백 될 미디어 리프리젠테이션의 품질도 잠재적으로 고려한다. 물론, 스톨이 있는지 없는지 및 스톨이 있다면 그 지속시간은 추정과 다를 수 있다.The request cancellation process can cancel requests for other reasons. For example, it can cancel requests because the download rate has fallen sharply and there is a risk of stalls due to data not being received fast enough for playback. Another reason for cancellation is whether a higher quality media is determined to be selected and retrieved in a timely manner for playback. Another reason for cancellation is to determine that the stall will occur regardless of what the receiver is doing and to estimate if the cancellation will shorten the stall duration, as opposed to allowing the completion of the unfinished request. The receiver chooses the action with the estimated shorter stall and also potentially takes into account the quality of the media representation to be played back. Of course, if there is a stall and there is a stall, its duration may be different from the estimate.

실제 취소는, 일단 그것이 결정되었다면, 리퀘스트가 이슈된 TCP 접속을 닫음으로써 달성될 수 있다. 닫는 것은 서버에게 취소된 프래그먼트에 대한 데이터를 계속 보내지 말라고 하는 효과를 가질 것이고, 따라서, 닫힌 접속에 의해 사용되는 대역폭이 대체 데이터를 페칭하는데 사용될 수 있게 된다.The actual cancellation can be accomplished by closing the TCP connection on which the request was issued, once it has been determined. Closing will have the effect of telling the server not to continue sending data for the canceled fragment so that the bandwidth used by the closed connection can be used to fetch the replacement data.

그 다음 스트리밍 모듈은 취소된 조각을 대체할 리퀘스트를 이슈할 수 있다. 이 목적을 위해 새로운 TCP 접속을 여는 것이 필요할 수 있다. The streaming module can then issue a request to replace the canceled fragment. For this purpose it may be necessary to open a new TCP connection.

일 구현은 대체 리퀘스트를 선택하는 몇몇의 옵션들을 갖는다. 어느 것이 가장 적합한 것인가는 플레이 아웃 되고 있는 콘텐츠의 유형에 의존할 수 있다.One implementation has several options for selecting alternate requests. Which one is best suited depends on the type of content being played out.

그것은 스트림의 끊김 없는 플레이 백을 허용하는 대체 리퀘스트를 선택하려 할 것이다. 일반적인 경우에 이는 대체 리퀘스트는 이전 다운로드 된 프래그먼트의 종료 시간에 또는 그 전에 스위치 포인트를 가져야 한다는 것을 의미한다.It will try to select an alternate request that allows seamless playback of the stream. In the general case this means that the alternate request should have a switch point at or before the end time of the previously downloaded fragment.

그 경우에, 취소 없이 다운로드를 계속할 때 스톨이 예견되고, 취소 및 대체 세그먼트의 선택으로, 스톨이 회피되거나 적어도 지속시간에서 줄어든다면, 플레이어는 취소해야 한다. 그 다음에 그것은 대체 리퀘스트를 위해 그 특성을 갖는 최고 품질의 미디어 리퀘스트를 선택할 수 있다.In that case, the stall is foreseen when continuing the download without cancellation, and if the stall is avoided or at least reduced in duration, with the choice of cancellation and replacement segments, the player must cancel. It can then select the highest quality media request that has that characteristic for alternate requests.

레이트 취소 프로세스는 스톨들을 다음과 같이 예견할 수 있다: 그것은 프래그먼트의 아웃스탠딩(outstanding) 바이트들의 수를 다운로드 레이트의 추정으로 나눔으로써 이슈된 리퀘스트의 추정된 완결 시간을 계산할 수 있다. 그 시간이 프래그먼트가 평활한 플레이백을 위해 필요한 마감시간보다 뒤이면, 스톨이 예견된다.The rate cancellation process can predict the stalls as follows: It is possible to calculate the estimated completion time of the requested request by dividing the number of outstanding bytes of the fragment by the estimate of the download rate. If the time is later than the deadline required for smooth playback, the stall is foreseen.

긴박한 스톨이 예견되는 때, 리퀘스트 취소 프로세스는 레이트에서의 스위치가 상황을 개선할지 아닐지를 결정하고; 오로지 개선이 있을 것 같을 때만 취소 결정이 된다.When an impending stall is foreseen, the request cancellation process determines whether the switch at the rate will improve the situation; Only when there seems to be improvement is the cancellation decision.

일 구현은 레이트 추정 및 후보 대체 프래그먼트의 크기에만 기초하여 대체 프래그먼트를 로드하는데 걸리는 시간을 추정할 수 있다.One implementation may estimate the time it takes to load the replacement fragment based only on the rate estimate and the size of the candidate replacement fragment.

다른 구현은 취소로 인한 추가 오버헤드 또한 고려할 수 있다: 그것은 존재하는 리퀘스트를 취소하고 새로운 리퀘스트를 이슈하는데 필요한 시간을 계산하기 위해 추정된 RTT의 배수를 추가할 수 있다. 취소된 리퀘스트로부터 네트워크 상으로의 전달을 위해 큐잉되나(queued), 아직 목적지데 도착하지 않은 데이터가 추가 지연에 기여할 수 있다. 클라이언트는 TCP 수신 윈도우 크기를 추정된 레이트로 나눔으로써 이 지연을 추정할 수 있다. 지연의 다른 추정은 추정된 대역폭-지연 곱에 기초할 수 있다. 클라이언트는 둘 중의 최대와 같이, 두 추정들의 조합을 취할 수 있다.Other implementations may also consider additional overhead due to cancellation: it may add a multiple of the estimated RTT to cancel the existing request and calculate the time needed to issue the new request. Data that has been queued for transmission over the network from the canceled request, but has not yet arrived at the destination may contribute to the additional delay. The client can estimate this delay by dividing the TCP receive window size by the estimated rate. Other estimates of delay may be based on the estimated bandwidth-delay product. The client can take a combination of the two estimates, as at most of the two.

요약하면, 클라이언트는 큐잉 지연의 추정에 더하여 전체 대체 프래그먼트를 다운로드 하는데 필요한 시간, RTT에 일반적으로 비례하는 양의 합을 계산한다. 스톨이 예견되고 그 시간이 현재 프래그먼트를 다운로드 하는데 추정되는 잔여 시간보다 작으면, 취소가 이슈된다.In summary, the client computes the sum of the amounts required to download the entire replacement fragment, in addition to the estimate of the queuing delay, which is generally proportional to the RTT. If the stall is anticipated and the time is less than the estimated remaining time to download the current fragment, the cancellation is an issue.

초기 레이트 선택이 정확하지 않았기 때문에, 첫 번째 프래그먼트를 다운로드 하는데 원했던 것보다 더 오래 걸리는 것을 플레이어가 인식하면, 리퀘스트 취소 프로세스는, 개시 시에도 또한 취소할 수 있다.The request cancellation process can also be canceled at the start if the player recognizes that it takes longer than desired to download the first fragment, since the initial rate selection was not correct.

다른 레이트 취소 구현은 또한 끊김 없는 플레이백을 허용하지 않지만, 대신에 다수의 프레임들을 스킵(skip)하는 것을 수반하는 대체 리퀘스트를 선택할 수 있다. 이는 양단간 지연이 작게 유지되어야 하는 것을 요구하는 라이브 콘텐츠를 재생하는 때에 필요할 수 있다.Other rate cancellation implementations also do not allow seamless playback, but instead can select a replacement request that involves skipping a number of frames. This may be necessary in reproducing live content that requires that the delay between both ends be kept small.

프레임 스킵으로 취소들을 하는 구현은 프레임 스킵이 가능한 한한 작도록 대체 프래그먼트를 선택할 수 있다. Implementations that perform cancellation with a frame skip can select a replacement fragment so that frame skipping is as small as possible.

그 구현은, 대체 리퀘스트로서, 특정 스톨 지속시간 또는 스킵 프레임 지속시간을 초과함 없이 지속 가능하게 다운로드 될 수 있는 최고 품질의 리퀘스트를 선택할 수 있다.The implementation may, as an alternate request, select a top quality request that can be downloaded sustainably without exceeding a particular stall duration or skip frame duration.

다른 종류의 취소는 이미 다운로드 된 프래그먼트에 대해 구현될 수 있다: 플레이어가 플레이 아웃 될 어떤 미디어를 이미 버퍼링했다면, 그것은 이전에 버퍼링된 저-품질 버전을 폐기함과 동시에, 네트워크를 통해 더 고품질의 리프리젠테이션을 페치하고 그것을 스트림하기 위해 그렇게 결정할 수 있다.Another kind of cancellation can be implemented for already downloaded fragments: if the player has already buffered some media to be played out, it will discard the previously buffered low-quality version and, at the same time, You can do so to fetch the presentation and stream it.

그 취소 프로세스는 그것이 그것이 더 양호한 품질의 비디오를 적시에 수신하여 그것이 스톨링 없이 플레이 아웃될 수 있다고 결정하면 이들 대체 동작들을 수행하는 것으로 결정할 수 있다The cancellation process can decide to perform these substitute operations if it determines that it can receive the better quality video in a timely manner and that it can be played out without stalling

도 26은 T1 시각의 새 프래그먼트 리퀘스트 직후에 발생하는 다운로드 레이트에서의 강한 하락을 도시한다. 리퀘스트 시까지 수신 레이트는 OR이었고, 그 다음에 그것은 NR로 떨어진다. 버퍼 레벨은 그때 떨어진다. 새로 리퀘스트된 프래그먼트는 약 T2 = T 1 + OR/NR * 프래그먼트 지속시간에 완전히 다운로드 될 것이다. OR/NR이 크면, 이는 T1 시각의 버퍼의 미디어 콘텐츠의 p-time 지속시간보다 많고, 이는 리퀘스트된 프래그먼트가 스톨 없이 플레이 백 될 수 없다는 것을 의미한다. pker 추정기는 레이트 NR로 훨씬 빨리 수렴할 것이나, 리퀘스트가 T1 이전에 되었으므로 추정이 새 레이트 NR로 수렴할 기회를 가지기 전에 프래그먼트의 다운로드가 이루어진다. 스톨을 회피하고, 수정된 추정으로 새 리퀘스트를 이슈하기 위해, 리퀘스트를 취소하고 보다 적절한 비트레이트의 리퀘스트를 다시 이슈하는 것이 필요하다.FIG. 26 shows a strong drop at a download rate occurring immediately after a new fragment request at time T1. By the time of the request, the reception rate was OR, and then it dropped to NR. The buffer level then falls. The newly requested fragment will be completely downloaded at approximately T2 = T 1 + OR / NR * fragments duration. If OR / NR is large, this is greater than the p-time duration of the media content of the T1 time buffer, which means that the requested fragment can not be played back without stalling. The pker estimator will converge much faster at rate NR but since the request is before T1, the fragment is downloaded before the estimate has the opportunity to converge to the new rate NR. In order to avoid the stall and issue a new request with a modified estimate, it is necessary to cancel the request and re-issue the request with a more appropriate bit rate.

도 27은 리퀘스트 취소를 갖는 경우를 도시한다. 다운로드 레이트(선 2702)의 급격한 하락 이후에, 버퍼는 비워지기 시작하고,, 추정된 다운로드 레이트(예를 들어, pker 프로세스)가 새 다운로드 레이트로 수렴하기 시작한다. 어떤 포인트에서, 스트림 매니저가 프래그먼트가 스톨링 없는 플레이백을 위해 적시에 수신되지 않을 것임을 인식한다. 그 포인트는 도 27의 도표에서 “취소 포인트”(2704)로 표시된다. 그 포인트에서, 부분적으로 수신된 프래그먼트가 취소될 것이고, 그것은 (버퍼 레벨에서의 추가 하락 때문에) 버퍼로부터 쫓겨난다. 그러나 그 이후에, 올바른 레이트의 프래그먼트가 리퀘스트 될 수 있고, 따라서 버퍼 레벨은 더 하락하지 않는다. 사실, 비자명(nontrivial) 레이트-선택 프로세스가 사용되면, 그것은 다시 증가할 것이다.Fig. 27 shows a case with request canceling. After a sudden drop in the download rate (line 2702), the buffer begins to empt and the estimated download rate (e.g., pker process) begins to converge to the new download rate. At some point, the stream manager recognizes that the fragment will not be timely received for stall-free playback. The point is indicated by " cancellation point " 2704 in the diagram of Fig. At that point, the partially received fragment will be canceled and it is kicked out of the buffer (due to further drop at the buffer level). However, after that, fragments of the correct rate can be requested, and therefore the buffer level does not drop further. In fact, if a nontrivial rate-selection process is used, it will increase again.

도 28은 예시적 리퀘스트 취소 프로세스를 도시하는 플로우차트이다.28 is a flowchart showing an exemplary request canceling process.

도 29는 리퀘스트 취소 검출을 위한 프로세스를 도시한다.Fig. 29 shows a process for request cancel detection.

우리는 이제 위의 상세한 설명에 기초하여 리퀘스트 취소 구현을 기술한다.We now describe a request cancellation implementation based on the above detailed description.

이 섹션에서, N _i 는 리퀘스트되었지만 아직 완전히 수신되지 않은 리프리젠테이션 그룹 i의 프래그먼트들의 수를 나타낸다. 그것들은 F _i,1 , …, F _i,Ni 으로 참조된다. F _i,j 는 시작-p-time의 오름차순으로 정렬되고, α(F _i,j )는 리퀘스트된 프래그먼트 F _i,j 에 대해 바이트 단위에서 그 크기로 나누어진 이미 다운로드 된 바이트의 양임을 더 가정하라. 변수 T는 현재 플레이백 p-time을 나타낸다. 리퀘스트 취소 검출 프로세스는 도 29의 의사코드(pseudocode)에 의해 도시된 바와 같이 진행할 수 있다.In this section, N _i represents the number of fragments of the presentation group i that have been requested but have not yet been completely received. They are F _{i, 1} , ... , F _{i, Ni} . F _{i, j} is arranged in a start -p-time _high, α (F _{i, j)} is further assumed that the amount of divided by the size in bytes for the requested fragment F _{i, j} already been downloaded byte do it. The variable T represents the current playback p-time. The request cancellation detection process can proceed as shown by the pseudocode of FIG.

리퀘스트 취소 검출 프로세스가 구동중인 때에, 그것은 어떤 동작도 취해질 필요가 없는 경우, 0(nil)을 리턴하고, 또는 그것은 취소할 그룹의 프래그먼트를 식별할 것이다. 그러한 프래그먼트가 식별되면, 그것은 이 프래그먼트, 및 (p-time 순서에서) 그것 뒤에 오는 동일한 그룹의 모든 것이, 취소되고, 버퍼로부터 내보내질 것임을 의미한다. 다음에 SM은 그 레이트 결정 프로세스를 다시 발동하고, 그 섹션에 대한 새 대안적 리퀘스트를 이슈하여야 한다.When the request cancellation detection process is running, it will return 0 (nil) if no action needs to be taken, or it will identify the fragments of the group to cancel. If such a fragment is identified, it means that this fragment and all of the same group following it (in p-time order) will be canceled and exported from the buffer. The SM shall then re-invoke the rate determination process and issue new alternate requests for that section.

프로세스를 설명하기 위해, 당분간 오로지 하나의 리퀘스트만이 여전히 아웃스탠딩(outstanding) 이라고 가장하라. 그 경우에, R이 다운로드 레이트의 정확한 추정이라고 하고, d _avail을 문제의 프래그먼트가 플레이 아웃될 때까지 여전히 수신될 수 있는 바이트의 수라고 하자. 양 d _need는 그 프래그먼트에서 여전히 빠진 바이트의 수이다. 따라서, d _avail< d _need 이면, 우리는 플레이어가 프래그먼트 F _i,j 를 재생하기 전에 스톨될 것임을 예견한다. 이는 위 프로세스에서 첫 번째 “if” 조건을 설명한다.To illustrate the process, assume that in the meantime, only one request is still outstanding. In that case, let R be an accurate estimate of the download rate, and let d _avail be the number of bytes that can still be received until the problem fragment is played out. The amount d _need is the number of bytes still missing from the fragment. Thus, if d _avail < d _need , we foresee that the player will be stalled before playing the fragment F _{i, j} . This explains the first "if" condition in the above process.

스톨이 예견된다 하더라도, 취소가 스톨을 회피할 수 있거나, 적어도 그 지속시간을 줄일 수 있을 때만 취소하는 것이 타당하다. 취소 이후, 새 프래그먼트가 선택되고 스크래치(scratch)로부터 다운로드되어야 할 것이다. 오로지 하나의 리프리젠테이션 그룹이 있고, 레이트 결정 프로세스가 올바른 레이트를 선택한다면, 이는 대략 λ 곱하기 지속시간 (F _i,j ) 의 시간이 걸릴 것이고, 여기서, λ는 현재 관련된 람다(lambda) 인수이다. 반면에, SM이 스위치 하지 않기로 결정하면, 현재 프래그먼트 다운로드를 완료하는데 d _need ·R ^-1의 시간이 걸질 것이다. 간략화를 위해, λ = 1 을 가정하면, 우리는, 아마도 다른 인자들과 함께, 두 번째 조건을 갖는다.
Even if the stall is foreseen, it is reasonable to cancel only if the cancellation can avoid the stall or at least reduce its duration. After cancellation, a new fragment will be selected and downloaded from scratch. If there is only one representation group and the rate determination process selects the correct rate, it will take a time of approximately l times times the duration ( Fi _{, j} ), where lambda is the currently associated lambda factor . On the other hand, if the SM decides not to switch, it will take d _need R ^-1 to complete the current fragment download. For the sake of simplicity, assuming λ = 1, we have the second condition, possibly with other factors.

6. 리퀘스트 액셀러레이터6. Request Accelerator

스트리밍 미디어 클라이언트를 위한 수월한 방법은 단일 HTTP 접속으로 미디어를 페칭하는 것이다. 그러한 클라이언트는 프래그먼트 리퀘스트들을 순차적으로 처리할 것이다. 그러한 접근법은 비디오 스트리밍에 있어 다소간의 단점을 갖는다. 첫째로, 일반적인 네트워킹 소프트웨어는 긴 다운로드에 걸쳐 최대 쓰로우풋만을 위해 종종 튜닝된다. 이것은 많은 파일들을 받는데 좋지만, 그것은 안정된 수신 레이트와 같은, 중요한 다른 스트리밍 목표들과 상충된다. 둘째로, TCP의 특성상, 링크의 전체 용량은 필수적으로 그러한 HTTP 접속으로 사용될 수는 없다. 채널이 다소간의 지연과 패킷 손실을 경험한다면, TCP는 달성될 수 있는 실제 쓰로우풋을 제한하고, 이는 스트리밍 클라이언트가 양호한 품질의 미디어를 스트리밍하는 것을 잠재적으로 방해한다. An easy way for a streaming media client is to fetch media with a single HTTP connection. Such a client will process the fragment requests sequentially. Such an approach has some disadvantages in video streaming. First, common networking software is often tuned only for maximum throughput over long downloads. This is good for receiving many files, but it conflicts with other important streaming goals, such as a stable receiving rate. Second, due to the nature of TCP, the total capacity of the link can not necessarily be used for such HTTP connections. If the channel experiences some delay and packet loss, TCP limits the actual throughput that can be achieved, which potentially hampers the streaming client from streaming good quality media.

이러한 문제점들을 회피하기 위해, 특수한 HTTP 클라이언트가 구현될 수 있고, 이를 우리는 여기서 리퀘스트 액셀러레이터(RA)라고 부른다. 리퀘스트 액셀러레이터는 전술한 문제점들을 회피하거나 줄일 수 있는 특수한 프로세스들을 갖는다. 리퀘스트 액셀러레이터의 구현은 몇몇 키 구성요소를 이용하여 그 목표를 달성할 수 있다. 그것은 몇몇 TCP 접속들을 이용하여 데이터를 수신할 수 있다. 그러한 접속들은 병렬로 활성화될 수 있다. 그것은 데이터 리퀘스트들을 보다 작은 청크(chunk) 리퀘스트들로 분할할 수 있고, 이는 상이한 접속들 상으로 개별적으로 다운로드 되고 리퀘스트 액셀러레이터에서 하나의 큰 조각으로 재집합될 수 있다. 그것은 접속들이 서로에게 적정하고, 상대적으로 안정적인 데이터 수신을 갖도록, (특히 TCP 수신 윈도우 크기와 같은) TCP 접속 파라미터들을 튜닝할 수 있다. 그것은 측정된 네트워크 상태와 목표 플레이백 레이트들에 기초하여 사용할 TCP 접속들의 수를 동적으로 조절할 수 있다.To avoid these problems, a special HTTP client can be implemented, which we call a request accelerator (RA) here. The request accelerator has special processes that can avoid or reduce the above-mentioned problems. The implementation of the request accelerator can achieve its goal using several key components. It can receive data using several TCP connections. Such connections may be activated in parallel. It can split data requests into smaller chunk requests, which can be individually downloaded over different connections and reassembled into one larger piece in the request accelerator. It can tune TCP connection parameters (such as TCP receive window size in particular), so that connections have an appropriate and relatively stable data reception for each other. It can dynamically adjust the number of TCP connections to use based on measured network conditions and target playback rates.

사용할 TCP 접속들의 이상적인 수는 네트워크 상테, 및 특히 라운드 트립 시간(RTT) 및 패킷 손실 동작에 의존한다. 따라서 RA는 이들 양들을 추정하는 방법들을 사용할 수 있다.The ideal number of TCP connections to use depends on the network topology, and in particular the round trip time (RTT) and packet loss operation. Thus, RA can use methods to estimate these quantities.

RA는 HTTP 리퀘스트를 이슈하는 것으로부터 응답이 들어오기 시작하는 때까지 걸리는 시간을 샘플링함으로써 RTT를 추정할 수 있다. 일 구현은 고정된 시간의 주기, 말하자면 마지막 몇 초 동안 획득된 그러한 모든 샘플들의 최소를 취함으로써 획득된 RTT의 추정을 사용할 수 있다. 다른 구현은 마지막으로 획득된 N 샘플들의 최소를 추정으로서 사용할 수 있고, 여기서 N은 어떤 정수이다.The RA can estimate the RTT by sampling the time it takes from issuing the HTTP request to the beginning of the response. An implementation may use an estimate of the RTT obtained by taking a fixed period of time, say, a minimum of all such samples obtained over the last few seconds. Other implementations may use a minimum of the last obtained N samples as an estimate, where N is any integer.

TCP 레이어 상에서 패킷 손실의 측정을 획득하는 것이 대개 어렵고, 이는 TCP 프로토콜이 패킷 손실을 다루고 데이터의 연속하는 프리픽스(prefix)들을 어플리케이션으로 전달하기 때문이다. 따라서, 대신에 패킷 손실에 대한 합리적인 값을 RA 프로세스로의 입력으로 고정하는 것이 때로는 유용하다. 일 구현은 손실이 일정하다고 추정한다. 임의의 패킷 손실 측정이 없으면, RA는 손실을 1%로 추정할 수 있고, 또는 RA는 손실을 0.1%로 추정할 수 있다. 추정은 접속의 유형에 의해 결정될 수 있고, 예를 들어, 추정은 WiFi 접속에 대해 0.1%로 설정될 수 있고 셀룰러 접속에 대해 1%로 설정될 수 있다. RTT들에서의 변형과 같은 다른 방법들이 RA가 간접적으로 패킷 손실을 추론하는데 사용될 수 있다. 대안적으로, 일 구현은 그에 대한 정보를 위해 운영 시스템에 질의함으로써 패킷 손실 추정을 획득할 수 있다.Obtaining measurements of packet loss on the TCP layer is usually difficult because the TCP protocol handles packet loss and delivers successive prefixes of data to the application. Therefore, it is sometimes useful to fix a reasonable value for packet loss as input to the RA process instead. One implementation estimates that the losses are constant. Without any packet loss measurements, the RA can estimate the loss at 1%, or the RA can estimate the loss at 0.1%. The estimate may be determined by the type of connection, e.g., the estimate may be set at 0.1% for WiFi connection and 1% for cellular connection. Other methods, such as transforms in RTTs, can be used by RAs to infer packet loss indirectly. Alternatively, an implementation may obtain a packet loss estimate by querying the operating system for information about it.

다른 구현은 어플리케이션 그 자체에서 손실을 추정할 수 있다. 이를 하기 위해, 그것은 네트워크 소켓으로부터의 데이터가 일반적으로 최대 세그먼트 크기(maximum segment sized; MSS) 청크들로 수신된다는 점, 그러나 패킷 손실은 훨씬 큰 청크, 대략 전체 TCP 수신 윈도우의 크기의 버스트(burst)의 수신을 유발한다는 점의 인지에 기초하는 다음 절차를 사용할 수 있다. M을 바이트 단위의 MSS(잘 맞는 추측은 M = 1500)라고 하자; 그러면 n 바이트가 수신되면, 전송된 패킷들의 수는 약 n/M이다. z를 k·M 보다 많은 바이트 리드(read)를 야기하는 소켓 리드들의 수라고 하고, 여기서 k는 얼마간의 작은 정수이다. k는 어플리케이션의 두 네트워크 리드들 사이에 k 이상의 패킷이 도착했을 것 같지 않을 만큼 충분히 크게 선택된다고 가정하라. 소켓 상에 계속 대기하는 어플리케이션에 대해, k = 3 이 좋을 것이다. 그 다음에, p = z·M/n 이 패킷 손실 확률의 추정이다. 원하는 시작 포인트로부터 z 및 n을 계산함으로써, 이 절차는 임의의 원하는 범위의 시간에 걸쳐 패킷 손실 레이트를 추정할 수 있다.Other implementations can estimate the loss in the application itself. To do this, it is assumed that the data from the network socket is generally received in maximum segment sized (MSS) chunks, but the packet loss is much larger chunks, a burst of approximately the size of the entire TCP receive window, Lt; RTI ID = 0.0 > a < / RTI > Let M be the MSS in bytes (a good guess is M = 1500); Then, when n bytes are received, the number of transmitted packets is about n / M. z as the number of socket leads to cause a number of bytes read (read) than k · M, where k is some small integer. Assume k is chosen large enough that no more than k packets have arrived between the two network leads of the application. For applications that continue to wait on the socket, k = 3 would be nice. Then, p = z · M / n is an estimate of the packet loss probability. By computing z and n from the desired starting point, this procedure can estimate the packet loss rate over any desired range of time.

RTT의 추정 및 패킷 손실 확률이 주어지면, 어플리케이션은 필요한 양호한 수의 접속을 계산할 수 있다. 프로세스는 특히 목표 다운로드 레이트가 그 접속 수로 달성될 수 있도록 충분히 큰 접속의 수를 선택할 수 있다. 단일 레이트의 달성 가능한 레이트는 일반적으로 달성 가능한 쓰로우풋에 대한 TCP 식에 의해 제한되고, 이는 대략 단일 TCP 접속은 T = MSS/(RTT-√p)의 평균 다운로드 레이트를 달성할 수 있다고 한다. 따라서, 프로세스는 T로 나누어진 목표 다운로드 레이트에 비례하는 접속들의 수를 선택할 수 있다.Given an estimate of RTT and a packet loss probability, the application can calculate the required good number of connections. The process can select a number of connections that are large enough to allow the target download rate to be achieved with that number of connections. The achievable rate of a single rate is generally limited by the TCP equation for the achievable throughput, which suggests that a single TCP connection can achieve an average download rate of T = MSS / (RTT -? P). Thus, the process may select the number of connections that are proportional to the target download rate divided by T.

RA는 또한 실제적인 이유들에서, 사용할 TCP 접속들의 수에 하한 및 상한을 둘 수 있다. 예를 들어, RA는 그것이 여는 접속들의 최대 수를 8로, 접속들의 최소 수를 2로 한정할 수 있다. The RA can also place lower and upper bounds on the number of TCP connections to use, for practical reasons. For example, an RA may limit the maximum number of connections it opens to 8 and the minimum number of connections to two.

대역폭, 손실 확률, 및 RTT는 변하기 쉽다. 리퀘스트 액셀러레이터는 그러한 양들을 모니터링하고 접속들의 수를 동적으로 변경한다.Bandwidth, loss probability, and RTT are variable. The request accelerator monitors such quantities and dynamically changes the number of connections.

리퀘스트 액셀러레이터는 HTTP 리퀘스트들을 더 작은 서브리퀘스트들로 분할하고 모든 서브리퀘스트에 대해 되돌아온 데이터 응답들을 원래 리퀘스트에 대응하는 코헤런트 응답으로 재집합할 수 있다. 리퀘스트들을 서버리퀘스트들로 분할하는 데는 많은 이점들이 있다. 첫째로, 가용 TCP 접속들을 이용하기 위해서는, 그들 모두에 리퀘스트들을 이슈할 필요가 있다. 미디어 스트리밍 플레이어는 모든 접속들을 사용하기에 충분한 리퀘스트들을 제공하지 않을 수 있다. 리퀘스트 분할은 이러한 문제점들을 완화하는데, 그것은 더 큰 수의 서브리퀘스트들을 낳고, 이는 다음에 다른 접속들 상에 이슈될 수 있기 때문이다. 둘째로, 리퀘스트 분할은 더 짧은 리퀘스트들을 낳고, 이는 부적절한 시기의 데이터 전달의 위험을 줄인다: 어떤 TCP 접속들이 다른 것들보다 일시적으로 더 느리면, 그것들은 여전히 짧은 리퀘스트로 사용될 수 있다. 그것들은 빠른 접속보다 느린 응답을 전달할 것이나, 전체 리퀘스트를 완결하기 위한 추가된 상대적 지연은 일반적으로 그렇게 크지 않을 수 있고, 이는 리퀘스트들이 작기 때문이다.The request accelerator may split the HTTP requests into smaller sub-requests and reassemble the data responses returned for all sub-requests into a coherent response corresponding to the original request. There are many advantages to splitting requests into server requests. First, in order to use the available TCP connections, it is necessary to issue requests to all of them. The media streaming player may not provide sufficient requests to use all the connections. Request splitting mitigates these problems, since it results in a larger number of sub-requests, which can then be addressed on other connections. Second, request splitting yields shorter requests, which reduces the risk of data transmission at inappropriate times: if some TCP connections are temporarily slower than others, they can still be used as short requests. They will deliver a slower response than a fast connection, but the added relative delay to complete the entire request may not be so large in general because the requests are small.

일반적으로, 더 많은 접속들이 사용되면, 리퀘스트 당 더 많은 서브리퀘스트들을 생성하는 것이 바람직하다. 이를 달성하기 위해, n 접속들이 있을 때 리퀘스트 액셀러레이터는 각 리퀘스트를 n 서브리퀘스트로 분할할 수 있다.In general, if more connections are used, it is desirable to create more subrequests per request. To achieve this, when there are n connections, the request accelerator may split each request into n sub-requests.

다른 구현은 리퀘스트 크기에 의존하여 리퀘스트당 서브리퀘스트들의 수를 선택한다. 서브리퀘스트 크기가 다운로드 하는데 고정된 양의 시간(말하자면, 2초)이 걸린다고 예견된 크기인 것으로 선택되면, 원하는 효과를 달성하기 위해, 보다 많은 접속들이 있다면 리퀘스트들은 보다 많은 서브리퀘스트들로 분할될 것이다. Other implementations choose the number of subrequests per request depending on the request size. If the subrequest size is chosen to be the size expected to take a fixed amount of time (say, 2 seconds) to download, the requests will be split into more subrequests if there are more connections to achieve the desired effect.

분할 룰은 불필요하게 작은 서브리퀘스트들이 없음을 확실히 해야 한다. 예를 들어, RA 구현은 분할 프로세스들에서 최소 서브리퀘스트 크기를 강제하고, 최소가 만족되지 않는다면 더 적은 서브리퀘스트들로 분할할 수 있다.The partitioning rule should ensure that there are no unnecessarily small subrequests. For example, an RA implementation may force a minimum sub-request size in partitioning processes and may divide into fewer sub-requests if the minimum is not satisfied.

복수의 TCP 접속들이 사용되는 때에는 그것들은 대역폭에 대해 아마도 경쟁한다. 큰 시간 스케일 상에서, 각 접속은 다른 것들과 같은 양을 수신할 것이지만, 몇 초 동안과 같은, 보다 작은 스케일에서는, 어떤 TCP 접속들은 다른 것들보다 상당히 느릴 수 있다. 이는 스트리밍에 문제를 일으키는데, 그것은 몇몇 서브리퀘스트들은 다른 것들보다 훨씬 오래 걸릴 수 있다는 것을 암시하고, 이는 플레이백 스톨들을 야기할 수 있기 때문이다.When multiple TCP connections are used, they probably compete for bandwidth. On a large time scale, each connection will receive the same amount as the others, but on a smaller scale, such as for a few seconds, some TCP connections may be significantly slower than others. This causes streaming problems, since it implies that some subrequests can take much longer than others, which can cause playback stalls.

이를 회피하기 위해, RA는 접속들을 “길들이는(tame)” TCP 흐름 제어를 사용할 수 있다. 그것은 각 TCP 접속들의 최대 수신 윈도우를 충분히 제한 할 수 있어, 어떤 접속도 쓰로우풋의 그 정당한 몫보다 상당히 많이 사용할 수 없도록 한다. TCP 접속을 통한 비행 중의(전송되었지만 아직 승인되지(acknowledged) 않은) 데이터의 양은 대략 RTT로 나누어진 다운로드 레이트이다. 따라서, TCP 수신 윈도우가 대략 추정된 RTT에 의해 나누어진 접속에 대한 목표 다운로드 레이트로, 또는 그보다 조금 크게 설정되면, 다운로드 레이트는 대략 목표 다운로드 레이트로 또는 그보다 조금 더 크게 제한될 것이다. 따라서, TCP 수신 윈도우 크기를 설정하는 것은 관리자로서 행할 수 있어, 주어진 TCP 접속이 다른 TCP 접속들이 훨씬 낮은 레이트에서 다운로드하도록 강요한 정도의 높은 레이트에서 다운로드 하지 않는 것을 보증할 수 있도록 한다. 그러한 메커니즘이 가동중인 상태에서, 접속들은 대략 동일한 속도로 페칭하는 경향이 있는데, 느린 접속들은 그 경우에 그들의 공정한 몫까지 속도를 올리기 위해 이용 가능한 대역폭을 갖고, 그러나 동시에 접속들은 적어도 집합 목표 수신 레이트인, 또는 그보다 조금 높은, 집합 다운로드 레이트를 달성할 수 있기 때문이다.To avoid this, the RA can use TCP flow control to "tame" connections. It can sufficiently limit the maximum receive window of each TCP connection so that no connection can use much more than its legitimate share of throttput. The amount of data (not yet acknowledged but transmitted) over the TCP connection is approximately the download rate divided by the RTT. Thus, if the TCP receive window is set at or slightly larger than the target download rate for the connection divided by the roughly estimated RTT, the download rate will be limited to approximately the target download rate or slightly larger. Thus, setting the TCP receive window size can be done as an administrator, ensuring that a given TCP connection will not download at a high rate that forces other TCP connections to download at a much lower rate. With such a mechanism in motion, connections tend to fetch at approximately the same rate, with slower connections having bandwidth available to speed up their fair share in that case, but at the same time, , Or a slightly higher aggregate download rate.

RA는 수신 버퍼를 조정함으로써 수신 클라이언트의 수신 윈도우를 조정할 수 있다. 그것은 연속한 리퀘스트들 사이에서 항상 이 설정들을 재조정한다.The RA can adjust the receiving window of the receiving client by adjusting the receiving buffer. It always resets these settings between consecutive requests.

일 구현은 각 접속의 TCP 수신 윈도우를 추정된 RTT 및 접속들의 수로 나누어진 목표 다운로드 레이트의 곱 보다 조금 크게 설정할 수 있다.One implementation may set the TCP receive window of each connection to be slightly larger than the product of the estimated RTT and the target download rate divided by the number of connections.

목표 다운로드 레이트는 예를 들어 플레이 백 할 미디어 레이트로부터 결정될 수 있다. 다른 구현은 현재 플레이백 레이트(예를 들어, 현재 다운로드 레이트의 두배)에 기초하여 목표 레이트를 설정할 수 있다.
The target download rate may be determined from the media rate to be played back, for example. Other implementations may set the target rate based on the current playback rate (e.g., twice the current download rate).

6.1 RA 실시예6.1 RA Example

우리는 이제 전술한 엘리먼트들을 포함하는 리퀘스트 액셀러레이터의 실시예를 설명한다. We now describe an embodiment of a request accelerator that includes the elements described above.

도 30은 복수의 TCP 접속들로 페칭하는 동작의 도표이다. 도 30-31은 다른 조건 하의 동작을 나타낸다. 예를 들어, 웹 서버로의 접속은 초당 2 megabit(“mbps”)로 대역폭 제한되었고, 라운드 트립 시간은 150ms였으며, 0.1%의 패킷 손실이 있었다. 프래그먼트들을 패칭하는 4개의 활성 접속들이 있었다. 도 30-31의 도표는 집합 레이트들, 클라이언트에 서 획득된 RTT 추정뿐만 아니라, 4 접속들의 순간 레이트를 나타낸다.30 is a chart of operations for fetching into a plurality of TCP connections. Figures 30-31 illustrate operations under different conditions. For example, the connection to the Web server was bandwidth limited to 2 megabits per second ("mbps"), round trip time was 150ms, and packet loss was 0.1%. There were four active connections fetching fragments. The diagrams in Figures 30-31 represent the aggregate rates, RTT estimates obtained at the client, as well as the instantaneous rate of 4 connections.

도 30에서, 접속들의 수신 버퍼들은 제한되지 않는다. 도 31에서, 그들은 대략 두 배의 대역폭-지연-곱으로 제한된다.In Figure 30, the receive buffers of the connections are not limited. In Fig. 31, they are limited to about twice the bandwidth-delay-product.

도 30 및 도 31의 예에서, 두 방법은 모두 2mbps 전체 쓰로우풋을 안정적으로 달성한다. 접속들이 수신 윈도우들을 제한한 경우(도 31), 접속들 간의 전달은 훨씬 더 고르다: 대부분 그들은 대략 동일한 레이트로 수신한다. 제한되지 않은 윈도우들을 갖는 접속들(도 30)에 대해 그것은 전혀 사실이 아니며, 이 경우 긴 시간의 스트래치에 걸쳐 어떤 접속들은 다른 것들보다 더 느리다.In the examples of Figs. 30 and 31, both methods stably achieve a full throughput of 2 mbps. 31), the transfer between connections is much more uniform: in most cases they are received at approximately the same rate. For connections with unconstrained windows (Fig. 30), this is not true at all, and in this case some connections are slower than others over long stretches.

고르지 않은 접속 속도들은 스트리밍 어플리케이션에 문제가 되는데, 이는 어떤 긴급한 데이터가 단지 아주 느리게 (느린 접속 상으로) 수신되고 있고 반면에 대역폭은 긴급하게 필요하지 않은 데이터를 페치할 지도 모르는 더 빠른 접속들로 전용된다는 것을 의미할 수도 있기 때문이다.Uneven connection rates are a problem for streaming applications because some urgent data is only being received very slowly (on a slow connection), while bandwidth is dedicated to faster connections that may fetch data that is not urgently needed This may mean that

제한되지 않은 수신 윈도우와 제한된 수신 윈도우 간의 다른 차이는 클라이언트가 동작하는 RTT이다. 제한이 있는 경우, RTT는 전파 지연에 가깝게, 느리게 유지한다. 수신 윈도우 제한이 없는 경우, 비행중인 데이터의 양이 기저 전파 지연 및 접속 용량의 곱을 초과하여 큐잉 지연이 아주 현저하게 되고, 높은 RTT를 유발한다. 높은 RTT는 미디어 스트리밍 클라이언트에 대해 바람직하지 않은데, 많은 이벤트들에 대한 클라이언트의 반응 시간은 일반적으로 RTT의 배수이기 때문이다. 예를 들어, 새 미디어 콘텐츠가 다운로드되도록 하는 사용자 찾기 이벤트, 또는 리퀘스트 취소 또는 리프리젠테이션들의 스위치를 유발하는 다운로드 속도의 감소에 대한 클라이언트 반응 시간은, 일반적으로 현재 RTT의 큰 배수이고, 따라서 그러한 이벤트에 대한 클라이언트의 일반적 민감도는 RTT가 클 때 저하될 것이다.The other difference between the unrestricted receive window and the restricted receive window is the RTT in which the client operates. If there is a limit, the RTT is kept close to the propagation delay and slow. Without the receiving window limit, the amount of data in flight exceeds the product of the base propagation delay and the connection capacity, resulting in a very prominent queuing delay and high RTT. High RTT is undesirable for media streaming clients because the client's response time to many events is typically a multiple of the RTT. For example, a client discovery event that causes new media content to be downloaded, or a client response time to a decrease in download speed that causes a cancel of a request or a change of representations, is generally a large multiple of the current RTT, The client's general sensitivity to RTT will degrade when the RTT is large.

도 32는 리퀘스트 액셀러레이터 프로세스의 플로우차트이다.32 is a flowchart of a request accelerator process.

도 33은 소정의 프래그먼트 리퀘스트에 대해 만들 서브리퀘스트들의 수를 찾기 위한 프로세스를 도시한다.Figure 33 shows a process for finding the number of subrequests to make for a given fragment request.

도 34는 계산된 크기들을 갖는 소스 리퀘스트들의 해체 구간들인 것으로 선택된 개개의 리퀘스트들을 선택하기 위한 프로세스를 도시한다. 이 프로세스에서, 서브리퀘스트 크기는 의도적으로 랜덤화되어, 접속들이 아이들(idle) 상태인 시간이 접속 마다 다르게 한다. 이는 모든 접속들이 동시에 아이들 상태인 것을 방지하여, 더 양호한 채널 이용을 가능하게 한다. 리퀘스트 크기들은 또한 정렬되어, 더 큰 리퀘스트들이 먼저 나가 아이들 시간에서의 차이들을 제한되도록 돕는다.34 shows a process for selecting individual requests that are selected as being disassembly periods of source requests having computed sizes. In this process, the subrequest size is intentionally randomized such that the time during which connections are idle is different for each connection. This prevents all connections from being idle at the same time, thereby enabling better channel utilization. The request sizes are also sorted to help larger requests go ahead and limit differences in idle time.

도 35는 시간 오프셋들 및 시간 오프셋들에 의해 결정된 수선 세그먼트(repair segment)에 대한 프래그먼트 구조를 나타낸다.Figure 35 shows a fragment structure for a repair segment determined by time offsets and time offsets.

동작에서, 리퀘스트 액셀러레이터는 SC로부터 HTTP 리퀘스트들(각 리퀘스트들은 URL 및 바이트 범위임)을 수신한다.In operation, the request accelerator receives HTTP requests from the SC (each request is in the URL and byte range).

리퀘스트 액셀러레이터는 HTTP를 통해 리퀘스트된 바이트 범위들을 다운로드하고 데이터를, 그것이 완전히 수신되면, SC로 다시 넘긴다. RA는 충분히 큰 다운로드 속도를 달성하지만, 동시에 각 프래그먼트가 그 마감시간 전에 수신된다는 것을 확실히 하는 것을 목표로 한다. 높은 다운로드 속도는 높은 품질의 비디오 리프리젠테이션을 선택하는 것을 가능하게 하고, 한편 마감시간을 지키는 것은 플레이백이 스톨 없이 진행한다는 것을 확실히 한다. The request accelerator downloads the requested byte ranges over HTTP and passes the data back to the SC when it is fully received. The RA achieves a sufficiently large download speed, but at the same time aims to ensure that each fragment is received before its deadline. High download speeds make it possible to select high quality video representations, while ensuring deadlines ensure that playback proceeds without stall.

높은 다운로드 속도의 목표를 달성하기 위해, RA는 개방된 TCP 접속들의 바뀌는 수를 관리하고, 이들 모두는 HTTP를 통해 데이터를 수신하는데 사용된다. RA는 얼마나 많은 접속들을 사용할지, 필요하면 그것들을 개방 또는 재개방할지, 및 접속들에 리퀘스트들을 어떻게 내보낼지에 대한 세부사항을 관리한다.To achieve the goal of high download speeds, the RA manages the changing number of open TCP connections, all of which are used to receive data over HTTP. The RA manages the details of how many connections to use, if necessary, to open or reopen them, and how to send requests to connections.

RA는 몇몇 경우에 소스 리퀘스트들을 다음에 상이한 접속들로 보내지는 더 작은 소위 RA 리퀘스트들 및 도착 시 RA에 의해 투과적으로(transparently) 재조립되는 응답 데이터로 분할할 것을 결정할 것이다. 예를 들어, 어떤 파일의 처음 64 kilobytes를 포함하는 소스 리퀘스트에 대해, RA는 두 RA 리퀘스트들을 생성할 수 있고, 하나는 32 kilobytes 청크이고 다른 하나는 그 파일의 제 2의 32 kilobytes 청크이다. RA는 그 다음에 그 두 청크들을 두 다른 접속들 상에 병렬로 리퀘스트하고 두 32 kilobyte 청크들이 수신되었으면 원래 리퀘스트를 위해 코헤런트 64kilobyte 응답을 생성할 수 있다.The RA will in some cases decide to split the source requests into smaller so-called RA requests that are then sent to different connections and response data that is transparently reassembled by the RA on arrival. For example, for a source request containing the first 64 kilobytes of a file, the RA can generate two RA requests, one 32 kilobytes chunk and the other a second 32 kilobytes chunk of the file. The RA can then request the two chunks in parallel on two different connections and generate a coherent 64 kilobyte response for the original request if both 32 kilobyte chunks have been received.

RA는 소스 리퀘스트들의 단지 보통의 서브범위들 보다 많은 RA 리퀘스트들을 이슈할 수 있다. 예를 들어, 그것은 평이한 비디오데이터에 더하여 프래그먼트의 FEC 데이터에 대한 리퀘스트를 이슈할 수 있다. 그 경우, RA는 그것이 수신되었으면 FEC 정보를 투과적으로 디코딩하고, 오로지 최종의, 디코딩된 프래그먼트를 소스로 제공할 것이다.An RA can issue more RA requests than just the normal sub-ranges of source requests. For example, it may issue a request for FEC data of a fragment in addition to plain video data. In that case, the RA will transparently decode the FEC information if it has been received, and will only provide the final, decoded fragment to the source.

RA는 네트워크 상태에 따라 그 자신을 자동튜닝한다. 예를 들어, RTT가 크면, RA는 리퀘스트들 간에 많은 아이들 시간을 피할 수 있도록 보다 큰 청크 리퀘스트들을 이슈할 것을 결정할 수 있다. 자동튜닝의 다른 예는 RA가 그 리퀘스트들의 적시성을 보장할 수 있도록 개개의 접속들의 속도들을 보다 작게 유지하려 한다는 것이다. 그러한 것들을 할 수 있기 위해, 바람직하게 RA는 그 접속들의 소켓들에 직접 액세스한다. 예를 들어, Unix-같은 환경에서, 그것은 setsockopt() 함수를 이용하여 소켓 옵션들을 설정할 수 있을 것이다. RA automatically tunes itself according to the network conditions. For example, if the RTT is large, the RA can decide to issue larger chunk requests to avoid much idle time between requests. Another example of automatic tuning is to keep the speeds of individual connections smaller so that the RA can guarantee the timeliness of those requests. To be able to do such, preferably the RA accesses the sockets of those connections directly. For example, in a Unix-like environment, it may be possible to set socket options using the setsockopt () function.

RA는 네트워크 상태를 관리하고 계속 확인한다; 이는 특히 다운로드 레이트 및 추정된 라운드 트립 시간(RTT)를 측정하는 것을 포함한다. 그것은 이 정보들을 수집하는데, 첫째로 접속 자동튜닝은 그것들의 가용성에 의존하기 때문이고, 둘째로 대역폭 정보는 SM에 전달될 필요가 있고, 이는 그것을 이용하여 그 레이트 추정들을 계산하기 때문이다.RA manages and keeps track of network conditions; This in particular involves measuring the download rate and the estimated round trip time (RTT). It collects this information because first, automatic tuning of the connection depends on their availability, and secondly bandwidth information needs to be passed to the SM, which uses it to calculate the rate estimates.

RA가 (SC를 통해) SM으로 전달하는 정보의 다른 부분은 아웃스탠딩(outstanding) 리퀘스트, 즉 주어진 리퀘스트의 얼마나 많은 데이터가 이미 수신되었는지에 관한 진행 정보이다. SM은 그 정보를 그 레이트 추정들 및 리퀘스트 취소 결정들 모두에 사용한다.Another part of the information that the RA passes to the SM (via the SC) is the outgoing outstanding, ie, how much data of a given request has already been received. The SM uses the information for both the rate estimates and the request cancellation decisions.

RA는 SM에 의해 대역폭 추정들을 하는데 필요한 정보를 계속 확인한다. 이 정보는 다운로드에 소비된 r-time의 총 양, T_r, 및 다운로드 된 바이트의 총 양, Z이다. 이 수들 모두는 감소하지 않고 증가하고, SM에 의해 자주 조사된다. T_r 타이머는 적어도 하나의 접속이 활성일 때에만(if and only if) 구동하고 있다. 접속은 HTTP 리퀘스트를 보내고 있거나 응답 데이터가 들어오기를 기다리고 있으면 활성이라고 간주된다. Z 카운터는 들어오는 바이트들을 카운트하고 모든 접속들에 걸쳐 집합된다.
The RA keeps track of the information needed to make bandwidth estimates by the SM. This information is the total amount of r-time spent on the download, T _r , and the total amount of downloaded bytes, Z. All of these numbers increase without decreasing, and are frequently examined by the SM. T _r timer is driven (if and only if) only when the at least one connection is active. A connection is considered active if it is sending an HTTP request or waiting for incoming response data. The Z counter counts incoming bytes and is aggregated across all connections.

RA 다운로드 레이트 히스토리RA download rate history

리퀘스트 액셀러레이터는 (T_r, Z)-쌍들의 커지는 어레이를 보관함으로써 레이트의 어떤 히스토리를 계속 확인하며, 이는 그들의 역시적인 순서로 저장된다. 우리는 이것을 어레이 mapTrZ라고 한다. mapTrZ의 업데이트들은, 빈번하게, 적어도 고정된 시간 간격으로(예를 들어, 100 ms 마다), 및 아마도 또한 뉴 데이터가 수신될 때, 일어난다.The request accelerator keeps track of a certain history of the rate by keeping the growing array of (T _r , Z) - pairs, which is stored in their respective order. We call this array mapTrZ. Updates to mapTrZ frequently occur at least at fixed time intervals (e.g., every 100 ms), and possibly also when new data is received.

RA는 다음과 같이 윈도윙된 대역폭 추정을 계산하는데 mapTrZ를 사용한다. 너비 t의 관심 대상인 윈도우를 고려하고, mapTrz[last]를 mapTrZ의 마지막 엔트리(entry)라고 하자. 그 다음 mapTrZ[i].T _r ≤ mapTrZ[last].T _r -t 를 만족하는 가장 큰 인덱스 i를 찾아라. i는 이전 검색으로 효과적으로 찾을 수 있음에 주목하라. 그러면 레이트 평균은 식 8에 표시된 바와 같다.RA uses mapTrZ to compute the windowed bandwidth estimate as follows. Consider the window of interest in width t, and let mapTrz [last] be the last entry in mapTrZ. Then mapTrZ [ i ]. T _r ≤ mapTrZ [last]. Find the largest index i that satisfies T _r -t . Note that i can be effectively searched by previous searches. The rate averages are then as shown in equation (8).

(식 8)

(Expression 8)

식 8은 후속하는 Tr에서의 차이가 t에 비하여 작다고 가정한다. 이는 충분히 자주 샘플링하고 절대 작은 윈도우 너비 t를 선택하지 않음으로써 보증된다.Equation 8 assumes that the difference in the subsequent Tr is small relative to t. This is ensured by sampling sufficiently frequently and never selecting a small window width t.

실제로, 임의로 커지는 어레이는 성가시다. 과거가 살펴지는 최대 지속시간은 상한이 있을 수 있고, 따라서 mapTrZ를 대신에 고정된 크기의 링 버퍼로 구현하는 방법이 있다. 이는 다음과 같이 행해질 수 있다. mapTrz 어레이가 업데이트되어야 하고, mapTrz 어레이는 이미 적어도 두 쌍을 포함하고 있을 때에는 언제나, T _r - mapTrZ[last-1].T _r < 100 ms 이면 마지막 엔트리를 대체하고, 아니면 새 엔트리를 추가하라.
In practice, arbitrarily growing arrays are annoying. There is an upper bound on the maximum duration for which the past is being looked at, so there is a way to implement mapTrZ instead of a fixed-size ring buffer. This can be done as follows. The mapTrz array must be updated, and whenever the mapTrz array already contains at least two pairs, T _r - mapTrZ [last-1]. If T _r <100 ms, replace the last entry, or add a new entry.

6.1.2 라운드 트립 시간(“RTT”) 추정6.1.2 Estimation of Round Trip Time ("RTT")

RA는 대역폭 추정들을 수집한다. 선험적으로, RTT 샘플을 얻기 위한 간단한 방법은, HTTP GET 리퀘스트가 아이들 접속 상으로 전송되고 응답이 들어오기 시작하는 시간에서의 차이를 측정하는 것이다. The RA collects bandwidth estimates. A priori, a simple way to obtain an RTT sample is to measure the difference in time at which an HTTP GET request is sent over the idle connection and the response begins to come in.

그러나, 그러한 측정들은 큐잉 지연을 포함한다; 클라이언트가 다른 열린 활성 접속들을 가지는 경우, 클라이언트로의 그 링크가 그것이 데이터를 수신하는 레이트보다 낮은 레이트를 가지면. 클라이언트로 데이터를 보내는 마지막 홉(hop)은 다수의 패킷들을 버퍼링할 수 있다. 그 경우에, 패킷들은 그들이 본래 하는 것보다 더 긴 지연으로 전달될 수 있다. However, such measurements include queuing delays; If the client has other open active connections, then the link to the client has a lower rate than the rate at which it receives the data. The last hop to send data to the client can buffer multiple packets. In that case, the packets may be delivered with a longer delay than they do.

우리의 경우, 클라이언트 그 자체의 활동에 의해 유도된 큐잉 지연에 대한 RTT 디스카운팅(discounting)을 아는 것이 바람직하다. 그 양의 추정을 얻기 위해, 우리는 다음과 같이 진행한다.In our case, it is desirable to know the RTT discounting on the queuing delay induced by the activity of the client itself. To get an estimate of that amount, we proceed as follows.

활동의 각 주기 동안, 우리는 후술하는 타이밍 방법으로 RTT 샘플들을 수집한다; 각 GET는 그 결과 샘플이 된다. 현재 추정은 그 다음 그 샘플들 모두의 최소값이다. 샘플들의 리스트는 RA가 비활성이 될 때마다 비워진다. (클라이언트는, 예를 들어, 섹션 3의 높은 워터마크가 초과되고, 시작된 다운로드들이 완료한 때, 비활성이 된다.) 비활성 주기들에서, 또는 임의의 RTT 샘플이 수신되기 전의 활성 주기들에서, RTT 추정은 마지막 공지의 추정이다.During each period of activity, we collect RTT samples with the timing method described below; Each GET results in a sample. The current estimate is then the minimum of all of the samples. The list of samples is emptied each time the RA becomes inactive. (The client becomes inactive, for example, when the high watermark in section 3 is exceeded and downloads initiated are complete.) In the inactive periods, or in the active periods before any RTT samples are received, RTT The estimate is the estimate of the last notice.

RTT 추정기는 또한 “어떤 RTT 추정도 알려지지 않음”의 표시를 리턴할 수 있고, 이는 예를 들어 클라이언트 개시 시 사용될 수 있다.
The RTT estimator may also return an indication of " no RTT estimate is unknown ", which may be used, for example, at client initiation.

6.1.3 TCP 접속들의 수의 조정 6.1.3 Tuning the Number of TCP Connections

TCP 흐름 제어를 튜닝하는 것은 RA가 다른 접속들에서의 대역폭을 대략 동일하게 유지하게 한다. 많은 설정 가능한 튜닝 상수들은 k _R (RTT들에서 측정된 레이트 측정 윈도우; 제안 값:30), k _N (비례 인수; 제안 값:8192 bytes), N _min (N _target 낮은 캡); 제안 값: 1), 및 N _max (N _target 높은 캡; 제안 값: 8)을 포함할 수 있다.Tuning TCP flow control allows the RA to keep the bandwidth on other connections approximately the same. Many configurable tuning constants are k _R (rate measurement window measured at RTT; suggested value: 30), k _N (proportional factor; suggested value: 8192 bytes), N _min ( N _target low cap); Suggested value: 1), and N _max ( N _target high cap; suggested value: 8).

추정된 대역폭-지연-곱(bandwidth-delay-product; BDP)은 BDP := RTT ·R 으로 정의되고, 여기서, RTT는 (위에서와 같이) 추정된 RTT이고, R은 (윈도우 방법으로 추정된) 마지막 k _R ·RTT 시간동안의 평균 수신 레이트이다.The estimated bandwidth-delay-product (BDP ) is defined as BDP: = RTT · R , where RTT is the estimated RTT (as above) and R is the estimated (windowed method) The average reception rate during the last k _R RTT time.

접속들의 목표 수는 다음에 식 9에서와 같이 정의되고, 여기서 kN은 설정 가능한 상수이다.The target number of connections is then defined as in Equation 9, where kN is a configurable constant.

(식 9)

(Equation 9)

N _target 의 값은 주기적으로 재계산된다. 현재 열린 접속들의 수가 N _target 보다 작으면, 새 접속들은 N _target 에 매칭하기 위해 즉시 열린다. 반면에, N _target 이 현재 열린 접속들의 수보다 작으면, 어떤 즉각적인 동작도 취해지지 않는다. 대신에, RA 리퀘스트가 완료되는 때는 언제나, RA는 너무 많은 접속들이 열려있는지를 체크하고, 그렇다면, 방금 아이들 상태가 된 접속을 닫는다.
The value of N _target is periodically recalculated. If the number of currently open connections is less than N _target , new connections are opened immediately to match N _target . On the other hand, if N _target is less than the number of currently open connections, no immediate action is taken. Instead, whenever the RA request completes, the RA checks to see if too many connections are open, and if so, closes the connection that just became idle.

6.1.4 접속들 상의 TCP 수신 윈도우의 조정6.1.4 Tuning TCP Receive Window on Connections

RA는 각 접속의 TCP 수신 윈도우 크기를

으로 설정한다. 여기서, c_w는 설정 가능한 하드코딩된 상수이고, 예를 들어, c_w = 3이다. RA는 접속의 TCP 수신 윈도우의 크기를 그것이 그 접속에 다음 HTTP 리퀘스트를 이슈할 할 때마다 설정한다.
RA sets the TCP receive window size for each connection

. Where c _w is a settable hard-coded constant, for example, c _w = 3. The RA sets the size of the TCP receive window of the connection whenever it issues the next HTTP request to that connection.

6.1.5 리퀘스트 분할 프로세스6.1.5 Request Split Process

RA로 전달된 각 소스 리퀘스트는 잠재적으로 하나보다 많은 RA 리퀘스트로 분할되고, 그 각각은 리퀘스트된 범위의 다른 부분들에 대응한다. 소정의 소스 리퀘스트에 대응하는 RA 리퀘스트들이 모두 완료되면, 수신된 데이터는 RA에 의해 완전한 프래그먼트로 재집합되고, 이는 다음에 SC로 리턴된다.Each source request delivered to the RA is potentially divided into more than one RA request, each of which corresponds to the different parts of the requested range. When all of the RA requests corresponding to a given source request are completed, the received data is reassembled into a complete fragment by the RA, which is then returned to the SC.

소정의 HTTP 리퀘스트에 대해, RA는 몇몇의 튜닝가능한 값들에 의존하는 프로세스를 이용하여 RA 리퀘스트들의 수, n을 결정한다. n의 값은 다음 튜닝가능한 상수들에 의존한다: T _wn (레이트 추정 윈도우 너비; 제안 값: 4s), D _min(최소 페치 지속시간; 제안 값: 2s), 및 c _s (RTT들에서 최소 페치 지속시간; 제안 값: 6).For a given HTTP request, the RA determines the number of RA requests, n , using a process that depends on some tunable values. The value of n depends on the following tunable constants: T _wn (rate estimate window width; proposed value: 4 s), D _min (minimum fetch duration; proposed value: 2 _s ), and c _s Proposed value: 6).

그리고 소정의 프래그먼트 리퀘스트를 위해 준비할 서브리퀘스트들의 수 n을 찾기 위한 프로세스는 도 33의 의사코드에 나타난 바와 같다.The process for finding the number n of sub-requests to be prepared for a predetermined fragment request is as shown in the pseudo code of FIG.

그리고 개개의 리퀘스트들은 예를 들어, 계산된 크기들을 갖는, 도 34에 도시된 프로세스를 사용하여 소스 리퀘스트들의 해체 구간으로 선택된다.
And the individual requests are selected as the disassociation period of the source requests using, for example, the process shown in FIG. 34, with the calculated sizes.

6.1.6 리퀘스트 발송 프로세스6.1.6 The Request Dispatch Process

리퀘스트 액셀러레이터는 RA 리퀘스트들의 세트를 유지한다. 접속들이 다음 리퀘스트를 이슈할 준비가 되면 언제나, 리퀘스트는 큐(queue)가 비어있지 않으면 RA 큐로부터 디큐잉되고(dequeued), 아이들 접속 상으로 이슈된다. 큐가 비어있으면, 새 프래그먼트 리퀘스트가 SC로부터 획득된다. 그리고 그 리퀘스트는 RA 리퀘스트들로 분할되고 RA 큐 상에 큐잉된다. 큐잉은 바람직하게는 소정의 프래그먼트 리퀘스트를 준비하기 위한 서브리퀘스트들의 수를 찾기 위한 프로세스에 의해 리턴된 바와 같은 슬라이스들의 순서로 행해진다.The request accelerator maintains a set of RA requests. Whenever connections are ready to issue the next request, the request is dequeued from the RA queue if the queue is not empty and is addressed on an idle connection. If the queue is empty, a new fragment request is obtained from the SC. The request is then split into RA requests and queued on the RA queue. The queuing is preferably done in the order of the slices as returned by the process for finding the number of sub-requests to prepare a given fragment request.

HTTP 접속들은 다양한 이유로, 예를 들어 웹 서버 타임아웃(timeout)이 발생했거나, 단일 접속 상에 이슈될 수 있는 리퀘스트들의 수가 초과되었기 때문에, 셧다운될 수 있다. RA는 이 상황을 우아하고 투명하게 다루어야 한다. 접속이 셧다운될 때마다, RA는 접속을 자동으로 다시 연다. 리퀘스트가 닫힌 접속 상에 진행하고 있었다면, 그것은 접속으로부터 디큐잉되고, 아직 수신되지 않은 부분에 대한 새 RA 리퀘스트가 RA 큐의 앞에 위치한다.HTTP connections may be shut down for various reasons, for example, because a web server timeout has occurred, or because a number of requests that may be issues on a single connection have been exceeded. The RA should handle this situation gracefully and transparently. Whenever a connection is shut down, the RA automatically reopens the connection. If the request was on a closed connection, it is dequeued from the connection, and a new RA request for the unreceived part is placed in front of the RA queue.

이 절차는 닫힌 접속들이 성능에 최소한의 영향을 주도록 보증한다.
This procedure ensures that closed connections have minimal impact on performance.

6.1.7 특정 실시예에서의 RA 파라미터 선택6.1.7 RA Parameter Selection in Specific Examples

TCP 접속은 그 흐름 제어에 의해 제한된다: 어드버타이징된(advertised) 수신 윈도우는 임의의 포인트의 시간에 승인되지 않도록(unacknowledged) 허용되는 데이터의 양에 상한을 둔다. 따라서, W가 수신 윈도우의 크기를, bdp는 그 접속의 대역폭-지연-곱을 나타낸다고 하면, 우리는 bdp ≤ W(조건 1)을 갖는다. 섹션 6.1.4의 방법은 c _w >1이면 이 조건(1)이 만족되도록 수신 윈도우 크기를 선택한다. 이는 개별 접속들이 가용 대역폭의 그들의 정당한 부분보다 실질적으로 더 많이 차지할 수 없도록 보증한다. 레이트 증가들을 허용하기 위해, 그리고 레이트 악순환()을 피하기 위해, c _w 를 1보다 다소 큰 것, 예를 들어, c _w = 2 또는 c _w = 4 를 선택하는 것이 바람직하다. 값이 클수록 레이트는 더 빠르게 커지지만, 접속들은 서로에게 덜 공평하다.A TCP connection is limited by its flow control: an advertised receive window places an upper limit on the amount of data allowed to be unacknowledged at any point in time. Thus, if W represents the size of the receiving window and bdp represents the bandwidth-delay-product of the connection, we have bdp ≤ W (condition 1). The method in section 6.1.4 chooses the receive window size to satisfy this condition (1) if c _w > 1. This ensures that individual connections can not occupy substantially more than their legitimate portion of the available bandwidth. To allow for rate increases and to avoid a rate vicious circle, it is desirable to choose c _w that is somewhat larger than 1, for example, c _w = 2 or c _w = 4. The higher the value, the faster the rate grows but the connections are less fair to each other.

다른 제한은 TCP 혼잡 제어 프로세스에 의해 부과된다. p가 패킷 손실 확률을 나타내고, M이 TCP 최대 세그먼트 크기를 나타낸다고 하면, 단일 접속의 레이트 r은 식10에 의해 표시된 바와 같이 제한된다.Other restrictions are imposed by the TCP congestion control process. If p represents the packet loss probability and M represents the TCP maximum segment size, then the rate r of a single connection is limited as indicated by Equation 10.

(식 10)

(Equation 10)

이제, 이것을 (bdp = r·RTT 및 BDP = N·bdp 를 사용하여) BDP 및 접속들의 수 N 에 관하여 다시 쓰면, 우리는 식 11에서 나타나는 것을 얻는다.Now, it (using bdp = r · RTT and BDP = N · bdp) write with respect to the number of BDP and N connected again, we get to appear in the formula 11.

(식 11)

(Expression 11)

이는 식 11의 부등식이 유지되는 것을 보증하기 위해 k _N 이 식 9의

보다 조금 작게 선택되어야 한다는 것을 암시한다. M의 전형적인 값은 1kilobyte이고, 우리가 p = 0.01로 두면,

= 10 kilobytes이다. 따라서, 이 예에서, 식 9의 설정 N에 대해 섹션 6.1.3에서 제시된 바와 같이 k _N =8,192 bytes로 설정하는 것은 식 11의 부등식이 만족한다는 것을 보증한다. 수신기는 적절이 설정되고 프로그램될 수 있다.This _N k of the expression (9) to ensure that the steady-state where the inequality of Equation 11

Which should be selected to be slightly smaller. The typical value of M is 1kilobyte, and if we leave p = 0.01,

= 10 kilobytes. Thus, in this example, setting k _N = 8,192 bytes as set forth in section 6.1.3 for setting N in equation 9 guarantees that the inequality of equation 11 is satisfied. The receiver can be set and programmed appropriately.

우리는 이제 소정의 소스 리퀘스트에 대한 RA 리퀘스트들의 수 n 을 계산하기 위해, 위의 섹션 6.1.3의 프로세스를 본다. 선험적으로, 우리는 슬라이스들을 가능한 한 작게 만들고 싶어하는데, 작은 슬라이스들은 많은 이점들을 제공하기 때문이다; 한 접속이 다른 것들에 비해 느리면, 이는 작은 리퀘스트들로 문제들을 야기하지 않을 것인데, 작은 리퀘스트들은 느린 접속 상에서도 빠르게 완료할 것이기 때문이다. 따라서 작은 슬라이스 설정에서, 느린 접속은 결국 본질적으로 단지 더 적은 리퀘스트들을 서비스하게 될 것이다. 작은 슬라이스들의 다른 이점은 그것들은 RA가 버퍼의 시간상 상대적으로 짧은 섹션에 애쓰도록 하고, 따라서, 그것은 그 노력을 가장 긴급한 작업 영역에 통합하려는 경향이 있다는 점이다.We now look at the process in section 6.1.3 above to calculate the number n of RA requests for a given source request. A priori, we want to make slices as small as possible, because small slices provide many advantages; If a connection is slower than others, it will not cause problems with smaller requests, since smaller requests will complete faster on slower connections. Thus, in a small slice configuration, a slow connection will eventually serve essentially only fewer requests. Another advantage of the small slices is that they tend to involve relatively short sections of the buffer in time, and thus tend to incorporate the effort into the most urgent work areas.

그러나, 슬라이스를 작게 만드는 것은 비용을 치른다: 첫째로, 각 리퀘스트는 업링크 상에 및 다운링크 상에 모두 얼마간의 오버해드를 포함한다. 둘째로, 한 리퀘스트를 완료한 뒤, 접속들은 약 RTT 동안 아이들 상태에 있을 것이다. 따라서, 리퀘스트 분할 프로세스는 이상적으로 업링크 트래픽을 과도하게 유발하지도 않고 각 가용 링크의 용량을 충분히 이용한다는 조건하에 가능한 한 작은 청크들을 선택하도록 해야 한다. 따라서 바람직한 특성들은:However, making the slice small costs money: First, each request includes some overhead on both the uplink and downlink. Second, after completing a request, the connections will be idle for about RTT. Therefore, the request division process should ideally select chunks as small as possible under the condition that they will not excessively cause uplink traffic and fully utilize the capacity of each available link. The preferred characteristics are therefore:

1. D _min 실시간 당 접속당 많아도 하나의 리퀘스트를 목표로 한다. 이는 업링크 트래픽이 가장 나쁜 경우에 N _target 에 비례하는 값에 의해 제한되도록 한다.1. D _{min It} targets one request per connection per real time. This ensures that the uplink traffic is limited by a value proportional to N _target in the worst case.

2. c _s ·RTT 마다 접속 당 많아도 하나의 리퀘스트를 목표로 한다. 이는 접속의 활성 시간을 적어도 약 c _s /( c _s + 1), 즉 중도적 c _s 에 대해 1에 가깝도록 한다.2. c _s · At most one request per connection per RTT . This makes the active time of the connection at least about c _s / ( c _s + 1), i.e. close to 1 for moderate c _s .

D _min의 좋은 선택은 사용 실정에 따른다. 그것을 원하는 대략 양단간 지연으로(그러나 그것보다 작은) 선택하는 것은 보통 프래그먼트의 일반적 지속시간이다. 양단간 지연이 크다면, 더 큰 버퍼들이 사용될 수 있고, 더 큰 슬라이스들의 나쁜 영향이 더 작다. 반면에, 짧은 양단간 지연에 대해, 버퍼들이 작고, 따라서 슬라이스들은 스톨들을 유발하는 느린 접속들을 회피하기 위해 작아야 한다. 그 시나리오에서, 더 작은 리퀘스트의 더 높은 비용은 버퍼 레벨에서 얻어진 안정성의 가치가 된다.A good choice of D _min depends on the application. It is usually the normal duration of a fragment to select it as approximately the desired end-to-end delay (but smaller). If the end-to-end delay is large, larger buffers can be used, and the worse impact of larger slices is smaller. On the other hand, for short end-to-end delays, the buffers are small, and therefore the slices must be small to avoid slow connections that cause stalls. In that scenario, the higher cost of the smaller request becomes the value of the stability obtained at the buffer level.

사용되는 파라미터들은 클라이언트로 스트리밍되는 미디어의 특성들의 요약인, MPD(Media Presentation Description)의 프로파일 표시자에 따라 튜닝될 수 있다. 모든 미디어 세그먼트를 다운로드하고 그것들을 최종 사용자에게 보여주는 대신, 클라이언트는 MPD 내의 프로파일로부터의 상이한 사용 실정들에 기초하여 세그먼트들을 “스킵”할 것을 선택할 수 있다.The parameters used may be tuned according to the profile indicator of the MPD (Media Presentation Description), which is a summary of the characteristics of the media streamed to the client. Instead of downloading all media segments and showing them to the end user, the client can choose to " skip " segments based on different usage events from the profile in the MPD.

c _s 선택의 하한은 다음과 같이 고안될 수 있다. 열린 N 접속들이 있고, RA가 활성이면, 평균적으로 대략 N ·c _s /( c _s + 1)의 활성 접속들이 있을 것이다. 모든 N 접속들의 수신 윈도우들이 총합 목표 레이트를 지속하기에 총합적으로 충분히 크다는 것을 보증하기 위해, c _w ·c _s /( c _s + 1) 가 적어도 1 인 것이 바람직하다.The lower bound of the c _s selection can be devised as follows. If there are open N connections and the RA is active, there will be approximately N c _s / ( c _s + 1) active connections on average. To ensure that the receive window for all N connections to a larger sum ever enough to sustain the total target rate, is preferably at least _{_{1 c w · c s / (}} c s + 1).

이 한도는 보수적이다. 활성 접속들의 추정된 수 N ·c _s /( c _s + 1)는 단지 평균이고, 다소간의 편차가 있을 것으로 보임에도, 편차들을 고려하지 않는다. 실제로, c _s 를 위의 한도에 의해 제안된 값들의 2에서 3배로 하는 것이 유리하고, 예를 들어, c _w = 3이고, c _s = 6이면, c _w ·c _s /( c _s + 1)는 적어도 2.5이다.
This limit is conservative. The estimated number N · c _s / ( c _s + 1) of active connections is only average and does not take deviations into account, although there may be some variation. In practice, it is advantageous to make c _s 2 to 3 times the values suggested by the above limits, for example c _w = 3 and c _s = 6, c _w c _s / ( c _s + 1) is at least 2.5.

6.2 순방향 에러 정정을 갖는 RA6.2 RA with forward error correction

데이터가 여러 TCP 접속들을 통해 수신되면, 그들은 때때로 일시적으로 다른 다운로드 레이트들을 갖는다. 프래그먼트의 리퀘스트가 여러 서브리퀘스트들로 분할되면, 전체 프래그먼트는 오로지 마지막 서브리퀘스트 응답(청크)가 수신되는 때 수신된다. 프래그먼트가 긴급하게 수신될 필요가 있는 때에, 이는 문제가 되는데, 서브리퀘스트들 중 하나는 느린 접속 상에 처리되어 프래그먼트가 빠르게 수신되는 것을 방해할 수 있기 때문이다.If data is received over multiple TCP connections, they sometimes have different download rates temporarily. If the request of a fragment is divided into several sub-requests, the entire fragment is received only when the last sub-request response (chunk) is received. This is a problem when a fragment needs to be urgently received because one of the sub-requests may be processed on a slow connection to prevent the fragment from being received quickly.

콘텐츠 제공자는, 비디오 데이터에 더하여, 클라이언트가 원래 프래그먼트를 재건하는 것을 돕기 위해 페칭할 수 있는, 각 프래그먼트에 대하여 추가적인 순방향 에러 정정(forward error correction; "FEC") 수선 데이터를 제공할 수 있다. 예를 들어, 클라이언트가 4 접속들을 갖고 4000 bytes 크기의 프래그먼트를 긴급하게 수신할 필요가 있다고 하자. 그 리퀘스트 액셀러레이터는 프래그먼트를 각각 1000bytes의 4 범위로 분할하고 4 접속들 각각에 하나의 리퀘스트를 이슈할 것이다. 접속 1은 빠르고, 접속 4는 중도적으로 빠르지만, 제 2 및 제 3 접속들은 매우 느릴 지도 모른다. 따라서, 전체 다운로드 레이트가 원칙적으로 전체 프래그먼트를 적시에 다운로드하기에 충분히 높다 하더라도, 접속들 2 및 3이 막혀있으므로 그것은 단지 아주 느리게 도착할 수 있다.In addition to the video data, the content provider may provide additional forward error correction ("FEC") repair data for each fragment that the client may fetch to help rebuild the original fragment. For example, suppose a client needs to have 4 connections and receive a fragment of 4000 bytes in size urgently. The request accelerator will split the fragment into four ranges of 1000 bytes each and issue one request for each of the four connections. Connection 1 is fast and connection 4 is moderately fast, but the second and third connections may be very slow. Thus, even though the overall download rate is in principle high enough to timely download the entire fragment, it can only arrive very slowly because connections 2 and 3 are blocked.

이 문제를 피하기 위해, 클라이언트는 그것이 그 자신의 서브리퀘스트를 처리하자 마자, 접속 1을 사용하여 접속 2 또는 3 하려는 것과 동일한 데이터를 페칭하려고 할 수 있다. 이는 도움이 될 수 있지만, RA는 어느 접속이 더 도움이 필요한지; 그것이 접속 2 인지 3인지 결정해야 한다. 그것이 틀린 예견을 하면, 그것은 필요 없이 중복 데이터를 다운로드하는 것이고, 프래그먼트는 여전히 적시에 도착하지 않을 수 있다.To avoid this problem, the client may attempt to fetch the same data as it would like to connect 2 or 3 using connection 1 as soon as it processes its own sub-request. This can be helpful, but the RA will tell you which connections need more help; You must decide whether it is connection 2 or 3. If it makes a false prediction, it is downloading redundant data without the need, and fragments may still not arrive in time.

대신에 더 양호한 리퀘스트 액셀러레이터는 접속 1 을 사용하여 얼마간의 수선 데이터를 페칭할 수 있다. 수선 (즉, FEC 코딩된) 데이터는, 성공적으로 다운로드되면, 리퀘스트 2 또는 3으로부터의 데이터가 유실되었는지 여부에 부관하게 유실 데이터를 재건하는데 사용될 수 있다. 유일한 제약은 수신된 데이터의 양이 프래그먼트를 재건하는데 충분하다는 것이다. 다른 말로, 우리 예에서, 수선 바이트들의 수 더하기 수신된 프래그먼트 바이트들의 수가 4000 이상이어야 한다.Instead, a better request accelerator can fetch some repair data using connection 1. The repaired (i.e., FEC coded) data may be used to reconstruct the lost data in response to whether data from request 2 or 3 is lost, if successfully downloaded. The only constraint is that the amount of data received is sufficient to reconstruct the fragment. In other words, in our example, the number of repair bytes plus the number of received fragment bytes should be greater than or equal to 4000.

일 구현에서, 콘텐츠 제공자는 코딩된 비디오 세그먼트들에 대한 FEC 수선 데이터로의 액세스를 제공한다. 그것은 원래 비디오 데이터와 유사한 방법으로 수선 데이터를 이용 가능하게 만든다. 예를 들어, 그것은 각 미디어 세그먼트 파일에 대해, 수선 정보를 포함하는 추가 FEC 파일을 제공할 수 있다. 콘텐츠 제공자는 미디어 프리젠테이션 설명의 FEC를 사용하기 위해 필요한 정보 및 파라미터들을 제공할 수 있다. 다른 구현에서, 미디어 프리젠테이션 설명은 FEC에 관한 어떤 정보도 포함하지 않지만, 클라이언트는 세그먼트 URL로부터 FEC 수선 URL의 이름을 어떻게 추론하는지에 대한 룰과 같은, 공통 협약을 이용하여 그것에 액세스할 수 있다. In one implementation, the content provider provides access to FEC repair data for the coded video segments. It makes repair data available in a manner similar to the original video data. For example, it may provide, for each media segment file, an additional FEC file containing repair information. The content provider may provide the necessary information and parameters to use the FEC of the media presentation description. In other implementations, the media presentation description does not include any information about the FEC, but the client can access it using a common convention, such as how to infer the name of the FEC repair URL from the segment URL.

클라이언트 구현은 어떻게 및 언제 수선 데이터를 리퀘스트하는지에 관한 프로세스들을 구현한다. 리퀘스트된 수선 데이터의 양은 얼마나 많은 데이터가 아웃스탠딩인지에 의존할 수 있다. 그것은 또한 얼마나 빨리 프래그먼트가 이용될 필요가 있는지에 의존할 수 있다. 예를 들어, 충분한 시간이 남아있다면, 모든 소스 데이터를 적시에 수신하기를 바랄 것이고, 임의의 수선을 요청하는 것이 아마 불필요하다. 반면에, 프래그먼트가 긴급하게 되고 있으면, 스톨이 임박하여 클라이언트가 그 프래그먼트에 대해 적시에 충분한 데이터를 얻을 수 없기 때문에, 많은 수선 데이터를 리퀘스트하길 원할 것이다. 따라서, 일 구현은 리퀘스트된 수선 데이터의 양을 β(B)S로 설정할 수 있고, 여기서, S는 아웃스탠딩 소스데이터의 양이고, β(B)는 버퍼 레벨의 감소하는 함수이다.The client implementation implements processes for how and when to request the repair data. The amount of requested repair data may depend on how much data is out-standing. It can also depend on how quickly the fragment needs to be used. For example, if there is enough time left, you will want to receive all the source data in a timely manner, and it is probably unnecessary to request arbitrary repairs. On the other hand, if the fragments are urgent, the stall is imminent and the client may not be able to obtain sufficient data for the fragment in a timely manner, so it will want to request a lot of repair data. Thus, an implementation may set the amount of requested repair data to ? (B) S , where S is the amount of outgoing source data and ? (B) is a decreasing function of the buffer level.

다른 구현은 아웃스탠딩 데이터의 양을 전체 아웃스탠딩 양 보다는, 가장 불완전한 리퀘스트의 아웃스탠딩 데이터의 양에 비례하는 아웃스탠딩 데이터의 양으로 정할 수 있다.
Other implementations can set the amount of out-standing data to the amount of out-standing data that is proportional to the amount of out-standing data of the most incomplete request, rather than the total out-standing amount.

6.2.1 수선 세그먼트 생성기의 실시예6.2.1 Example of a Water Segment Generator

어떻게 DASH 표준이 FEC, 특히 FEC에 대한 RaptorQ 를 사용하는지에 관련될 수 있는 아래의 모든 계산들은 바람직하게는 고정-소수점/정수 연산을 사용하여 수행된다. 이는 리프리젠테이션의 프래그먼트 내에 소스 심볼들의 수 및 위치들을 계산하는 것을 포함하고, 수선 세그먼트 내의 프래그먼트에 대한 수선 심볼들의 수 및 위치들을 계산하는 것은 고정-소수점 연산을 이용하여 수행되어야 한다. 이는 수신된 FEC 수선 프래그먼트들 및 소스 프래그먼트들의 조합들을 사용하여 소스 프래그먼트들을 디코딩하는 RA 프로세스와 정확한 같은 결과가 소스 세그먼트들로부터 FEC 수선 프래그먼트들을 생성하는 인제스쳔(ingestion) 프로세스에 의해 달성될 필요가 있고, 따라서 이 계산들은 정확히 동일한 결과를 가져야 하기 때문이다.All of the following calculations, which may be related to how the DASH standard uses FEC, in particular RaptorQ for FEC, is preferably performed using fixed-point / integer operations. This involves calculating the number and positions of the source symbols in the fragment of the representation, and calculating the number and positions of the repetition symbols for the fragments in the repair segment should be performed using fixed-point arithmetic. This needs to be accomplished by an ingestion process in which exactly the same result as the RA process of decoding source fragments using combinations of received FEC repair fragments and source fragments produces FEC repair fragments from the source segments , So these calculations must have exactly the same result.

고정-소수점 연산 대신에 유동-소수점 계산들을 사용하는 것은, 상이한 플랫폼들 상에 상이한 유동-소수점 구현들의 상이한 코너 케이스 동작으로 인해, 가끔 찾아내기 힘든 미묘한 버기(buggy) 동작을 낳을 수 있고, 양 끝-점들이 정확히 같은 계산 결과를 낳아야 하는 표준에서 받아들일 수 없을 것이다.Using floating-point arithmetic instead of fixed-point arithmetic can result in subtle buggy behavior that is sometimes hard to find due to the different corner case behavior of different floating-point implementations on different platforms, - Points will not be accepted in the standard that should yield exactly the same calculation results.

수선 세그먼트들은 sidx 테이블들을 포함하는 이미 처리된 소스 세그먼트들에 기초하는 별개의 프로세스에서 생성될 수 있다. 소스 세그먼트들 자체에 추가하여, 프로세스로의 두 입력들은 수선 비율 R 및 심볼 크기 S이다. 세그먼트 내의 수선 프래그먼트의 수선 심볼들의 수 및 위치들의 계산을 위한 고정소수점 연산을 사용하는 것을 용이하게 하기 위해, R 값은 mille 단위로 표현될 수 있고, 즉 R = 500은 비율이 1/2임을 의미한다.The repair segments may be generated in a separate process based on the already processed source segments including the sidx tables. In addition to the source segments themselves, the two inputs to the process are the waterline ratio R and the symbol size S. To facilitate the use of fixed-point arithmetic for the calculation of the number and positions of the repair symbols in a segment in a segment, the R value may be expressed in units of mille, i.e. R = 500 means that the ratio is 1/2 do.

각 세그먼트 내에, 소스 세그먼트의 시작에, 세그먼트 인덱싱 정보가 있고, 이는 시간/바이트-오프셋 세그먼트 맵을 포함한다. 시간/바이트-오프셋 세그먼트 맵은 시간/바이트-오프셋 쌍들 (T(0), B(0)), (T(1), B(1)) …,(T(i), B(i)),…,(T(n), B(n))의 리스트이고, 여기서 T(i-1)은 모든 미디어 세그먼트들 중에 미디어의 초기시작 시간에 관련된 미디어의 i 번째 프래그먼트의 플레이백을 위한 세그먼트 내의 시작시간을 나타내고, T(i)는 i 번째 프래그먼트에 대한 끝 시간(따라서 다음 프래그먼트에 대한 시작 시간)을 나타내고, 바이트-오프셋 B(i-1)은 소스 세그먼트의 시작과 관련된 미디어의 i 번째 프래그먼트가 시작하는 이 소스 세그먼트 내의 데이터의 시작의 대응하는 바이트 인덱스이고, B(i)는 i 번째 프래그먼트 까지 그리고 그것을 포함하는 세그먼트의 대응하는 바이트 수이다 (따라서 B(i))는 프래그먼트 i+1의 제 1 바이트의 인덱스이다). 세그먼트가 복수의 미디어 컴포넌트들을 포함한다면, T(i) 및 B(i)는 절대적 방법으로 세그먼트의 각 컴포넌트에 제공될 수 있고, 또는 그것들은 기준 미디어 컴포넌트를 서빙하는 다른 미디어 컴포넌트에 관련하여 표시될 수 있다. 어떤 경우에든, B(0)는 세그먼트의 제 1 프래그먼트의 시작 바이트 인덱스이고, 이는 세그먼트의 제 1 프래그먼트에 선행하는 sidx 정보로 인해 0보다 클 수 있다. B(0)이 0이 아니면, sidx에 대응하는 수선 세그먼트의 시작 시에 어떤 수선 심볼들이 있다. 구현에 따라, 이 첫 번째 수선 심볼들은 제 1 프래그먼트의 시작까지 세그먼트의 데이터를 보호할 수 있고, 또는 그것들은 사용되지 않는 페디드-제로(padded-zero) 데이터 바이트들일 수 있다.Within each segment, at the beginning of the source segment, there is segment indexing information, which contains a time / byte-offset segment map. The time / byte-offset segment map includes time / byte-offset pairs T (0), B (0), T (1), B (1) ... , ( T ( i ), B ( i )), ... , A list of (T (n), B ( n)), where T (i -1) is started in the segment for the playback of all media, the i-th fragment of a segment of the media involved in the initial starting time of the media time It represents a, T (i) is the end time of the i-th fragment represents a (thus the start time for the next fragment), byte-offset B (i -1) is the i-th fragment beginning of the media associated with the beginning of the source segment and a source corresponding to the start byte index of the data in the segment, B (i) is the number of bytes corresponding to a segment to the i th fragment and stores them (so B (i)) is a first fragment of i +1 Byte index). If the segment includes a plurality of media components, T ( i ) and B ( i ) may be provided to each component of the segment in an absolute way, or they may be displayed relative to other media components serving the reference media component . In any case, B (0) is the starting byte index of the first fragment of the segment, which may be greater than zero due to the sidx information preceding the first fragment of the segment. If B (0) is not zero, there are some water symbols at the beginning of the water segment corresponding to sidx. Depending on the implementation, these first repair symbols may protect the data of the segment until the beginning of the first fragment, or they may be unused padded-zero data bytes.

수선 비율 R은 수선 세그먼트 메타데이터와 함께 MPD에 시그널링되거나, 다른 수단들(TBD)에 의해 획득될 수 있다. R 에 대한 값의 예로, R =500이면 수선 세그먼트 크기는 그로부터 생성되는 소스 세그먼트의 대응 크기의 0.5배로 (아주 밀접하게) 근사하고, 소스 세그먼트 내의 소스 프래그먼트에 대응하는 수선 세그먼트의수선 프래그먼트의 크기는 또한 소스 세그먼트의 크기의 0.5 배에 (아주 헐렁하게) 근사한다. 예를 들어, 소스 세그먼트가 1,000 kilobytes 데이터를 포함하면, 대응하는 수선 세그먼트는 대략 500 kilobytes 수선 데이터를 포함한다.The repair rate R may be signaled to the MPD along with the repair segment metadata, or may be obtained by other means (TBD). As an example of a value for R, if R = 500, then the waterline segment size approximates (very closely) 0.5 times the corresponding size of the source segment generated therefrom, and the size of the waterline fragment of the waterline segment corresponding to the source fragment in the source segment is It also approximates (very loosely) 0.5 times the size of the source segment. For example, if the source segment contains 1,000 kilobytes of data, the corresponding line segment contains approximately 500 kilobytes of repair data.

S 값은 또한 수선 세그먼트 데이터와 함께 MPD에 시그널링되거나, 다른 수단들에 의해 획득될 수 있다. 예를 들어, S = 64는 소스 데이터 및 수선 데이터가 FEC 인코밍 및 디코딩을 위해 각각 64 바이트 크기의 심볼들을 포함한다는 것을 표시한다. S 값은 연관된 소스 세그먼트의 리프리젠테이션의 스트리밍 레이트에 비례하도록 선택될 수 있다. 예를 들어, 스트리밍 레이트가 100 kbps이면, S=12bytes가 적절할 수 있고, 반면에, 스트리밍 레이트가 1 Mbps 이면 S = 120bytes가 적절할 수 있고, 스트리밍 레이트가 10 Mbps 이면 S = 1,200bytes가 적절할 수 있다. 하나의 목표는 어떻게 그레널러(granular) 프래그먼트들이 심볼들로 분할되는지와 스트리밍 레이트에 대비한 FEC 디코딩에 대한 프로세싱 조건들 간의 양호한 트레이드-오프를 가진다는 것일 수 있다. 예를 들어, 1Mbps의 스트리밍 레이트, 그리고 500ms 크기 근처의 프래그먼트들에서, 각 프래그먼트는 64 KB 근처의 데이터이고, S=120이면 프래그먼트는 대략 500 소스 심볼들로 구성되고, 이는 각 심볼은 0.2% 근처의 데이터가 소스 블록들을 복원하는데 필요하고, 이는 심볼 그래널러리티(granularity)로 인해 필요한 여분 수신은 프래그먼트가 수신되고 있는 HTTP 접속들의 수의 0.2%배로 상한 된다는 것을 의미한다. 예를 들어, HTTP 접속들의 수가 6이면, 심볼 그래널러리티 수신 오버해드는 1.2%로 제한된다.The S value may also be signaled to the MPD along with the waterline segment data, or may be obtained by other means. For example, S = 64 indicates that the source data and the repair data contain symbols of size 64 bytes each for coaming and decoding where FEC is FEC. The S value may be chosen to be proportional to the streaming rate of the representation of the associated source segment. For example, if the streaming rate is 100 kbps, S = 12 bytes may be appropriate, whereas if the streaming rate is 1 Mbps, S = 120 bytes may be appropriate and if the streaming rate is 10 Mbps, S = 1,200 bytes may be appropriate . One goal may be how to divide the granular fragments into symbols and have a good trade-off between the processing conditions for FEC decoding versus the streaming rate. For example, at a streaming rate of 1 Mbps and fragments near 500 ms in size, each fragment is near 64 KB of data, and if S = 120, the fragment consists of approximately 500 source symbols, Of data is needed to recover the source blocks, which means that the extra reception required due to the symbol granularity is up to 0.2% times the number of HTTP connections for which the fragment is being received. For example, if the number of HTTP connections is 6, the symbol granularity reception overhead is limited to 1.2%.

수신 세그먼트는 다음과 같이 소스 세그먼트를 위해 생성될 수 있다. 소스 세그먼트의 각 프래그먼트는 FEC 인코딩 목적을 위한 소스 블록으로 여겨지고, 따라서 각 프래그먼트는 그로부터 수선 심볼들이 생성되는 소스 블록의 소스 심볼들의 시퀀스로 다루어진다. 제 1 i 프래그먼트에 대해 생성된 전체 수선 심볼들의 수는 TNRS(i) = divceil(R*B(i),S*1000)으로 계산되고, 여기서, divceil(I,J)는 적어도 Jfh 나누어진 I인 값을 갖는 가장 작은 정수를 출력하는 함수이고, 즉 divxeil(I, J) = (I+J-1) div J이고, 여기서 div는 결과가 가장 가까운 정수로 내림되는 고정-소수점 나눔이다. 따라서, 프래그먼트 i에 대해 생성된 수선 심볼들의 수는 NRS(i) = TNRS(i) - TNRS(i-1) 이다.A received segment may be generated for the source segment as follows. Each fragment of the source segment is considered a source block for FEC encoding purposes, and thus each fragment is treated as a sequence of source symbols of the source block from which the repair symbols are generated. The total number of repetition symbols generated for the first i-th fragment is calculated as TNRS (i) = divceil (R * B (i), S * 1000), where divceil (I, J) Divxeil (I, J) = (I + J-1) div J, where div is a fixed-decimal division in which the result is rounded down to the nearest integer. Therefore, the number of repair symbols generated for fragment i is NRS (i) = TNRS (i) - TNRS (i-1).

수선 세그먼트는 프래그먼트들에 대한 수선 심볼들의 연결(concatenation)을 포함하고, 수선 세그먼트 내의 수선 심볼들의 순서는 그로부터 그것들이 생성되는 프래그먼트들의 순서이고, 프래그먼트 내에 수선 심볼들은 그들의 인코딩 심볼 식별자(ESI”)의 순서이다. The repetition segment includes concatenation of the repetition symbols for the fragments, the order of the repetition symbols in the repetition segment is the order of the fragments from which they are generated, and the repetition symbols in the fragment are the symbols of their encoding symbol identifiers (ESI & Order.

전술한 바와 같이 프래그먼트에 대한 수선 심볼들의 수를 정의함으로써 모든 이전 프래그먼트들에 대한 수선 심볼들의 전체 수, 따라서 수선 프래그먼트 i의 심볼들에 대한 바이트 인덱스 및 바이트 범위는 오로지 R, S, B(I-1) 및 B(i)에 의존하고, 소스 세그먼트 내의 프래그먼트들의 이전 또는 후속 구조 어느 것에도 의존하지 않는다. 이것은 그로부터 수선 블록이 생성되는 소스 세그먼트의 대응 프래그먼트의 구조에 관한 로컬 정보만을 이용하여, 그것이 클라이언트가 수선 세그먼트 내의 수선 블록의 시작 포인트를 빠르게 계산하고, 또한 그 수선 블록 내에 수선 심볼들의 수를 빠르게 계산할 수 있도록 하기 때문에 유익하다. 따라서, 클라이언트가 소스 세그먼트의 중간부터 프래그먼트를 다운로드 및 플레이백을 시작한다고 결정하면, 그것은 또한 대응하는 수선 세그먼트 내로부터 프래그먼트에 대응하는 대응 수선 블록을 빠르게 생성하고 액세스할 수 있다.By defining the number of repair symbols for a fragment as described above, the total number of repair symbols for all previous fragments, and therefore the byte index and byte range for the symbols of the repair fragment i, are only R, S, B (I- 1) and B (i), and does not depend on any previous or subsequent structure of the fragments in the source segment. This makes it possible for the client to quickly calculate the starting point of the repair block in the repair segment and to quickly calculate the number of repair symbols in the repair block, using only local information on the structure of the corresponding fragment of the source segment from which the repair block is generated So it is beneficial. Thus, if a client decides to start downloading and playing a fragment from the middle of the source segment, it can also quickly create and access a corresponding repair block corresponding to the fragment from within the corresponding repair segment.

프래그먼트 i에 대응하는 소스 블록의 소스 심볼들의 수는 NSS(i) = divceil(B(i)-B(i -1),S)로 계산된다. B(i)-B(i -1)가 S의 배수가 아니면 FEC 인코딩 및 디코딩의 목적을 위해 마지막 소소 심볼은 i는 0 바이트들로 패딩 아웃되고, 즉, 마지막 심볼이 0 바이트들로 패딩 아웃되어 FEC 인코딩 및 디코딩 목적으로 그 크기가 S 바이트가 되도록 하나, 이 0 패딩 바이트들은 소스 세그먼트의 부분으로 저장되지 않는다. 이 실시예에서, 소스 심볼에 대한 ESI들은 0, 1, …, NSS(i)-1이고, 수선 심볼들에 대한 ESI들은 NSS(i), …, NSS(i)+NRS(i)-1이다.The number of source symbols of the source block corresponding to the fragment i is calculated by NSS (i) = divceil (B (i) - B (i -1), S). If the B (i) -B (i -1) is not a multiple of S, for the purposes of FEC encoding and decoding, the last symbol is padded out with 0 bytes, i, So that its size is S bytes for FEC encoding and decoding purposes, but these 0 padding bytes are not stored as part of the source segment. In this embodiment, the ESIs for the source symbol are 0, 1, ... , NSS (i) -1, the ESIs for the repair symbols are NSS (i), ... , NSS (i) + NRS (i) -1.

이 실시예에서 수선 세그먼트에 대한 URL은 예를 들어 소스 세그먼트의 URL에 서픽스(suffix) “.repair”를 단순히 추가함으로서 대응 소스 세그먼트에 대한 URL로부터 생성될 수 있다.In this embodiment, the URL for the repair segment may be generated from the URL for the corresponding source segment, for example by simply adding a suffix " .repair " to the URL of the source segment.

수선 세그먼트는 또한, 예를 들어 마지막에 첨부된, 대응 소스 세그먼트의 부분일 수 있다. 결합된 세그먼트의 구조는 또한 소스 프래그먼트들 및 수선 프래그먼트들이 결합된 세그먼트 내에 연속한다는 것일 수 있고, 즉 결합된 세그먼트는 제 1 소스 프래그먼트, 뒤따르는 제 1 수선 프래그먼트, 뒤따르는 제 2 소스 프래그먼트, 뒤따르는 제 2 수선 프래그먼트 등을 포함한다. 당업자는, 전술한 방법들 및 프로세스들이 그러한 결합 세그먼트들에 적용하도록 쉽게 채용될 수 있음을 알 것이다.
The waterline segment may also be part of the corresponding source segment, for example, attached at the end. The structure of the combined segment may also be that the source fragments and the repair fragments are contiguous in the combined segment, i. E., The combined segment is the first source fragment, the following first repair fragment, the following second source fragment, A second repair fragment, and the like. Those skilled in the art will appreciate that the above-described methods and processes may be readily adapted to apply to such coupled segments.

6.2.2 수선 세그먼트들을 이용하는 리퀘스트 액셀러레이터의 실시예6.2.2 Implementation of a Request Accelerator Using Repair Segments

수선 세그먼트에 대한 수선 인덱싱 정보 및 FEC 정보는 대응 소스 세그먼트에 대한 인덱싱 정보에 의해, 그리고 R 및 S 값으로부터 내재적으로 정의되고, 여기서 R은 mille 당 표시하는 0 및 1000 사이의 정수로 표현되고, S 는 바이트들로 표현된다. 수선 세그먼트를 포함하는 프래그먼트 구조 및 시간 오프셋들은 대응하는 소스 세그먼트의 시간 오프셋들 및 구조에 의해 정의된다. 프래그먼트 i에 대응하는 수선 세그먼트의 수선 심볼들의 시작 및 끝에 대한 바이트 오프셋은 RB(i-l) = S*divceil(R*B(i-l), S* 1000) 및 RB(i) = S*divceil(R*B(i), S* 1000) 으로 각각 계산된다. 프래그먼트 i에 대응하는 수선 세그먼트의 바이트들의 수는 RB(i) - RB(i-l)이고, 따라서 프래그먼트 i에 대응하는 수선 심볼들의 수는 NRS(i) = (RB(i) - RB(i-l))/S으로 계산된다. (분자가 S의 배수임이 보증되므로 여기서 divceil 연산의 필요가 없으나, divceil가 여기서 사용될 수 있고 결과는 여전히 정확할 것임에 주의하라.) 프래그먼트 i에 대응하는 소스 심볼들의 수는 NSS(i) = divceil(B(i)-B(i-l), S)로 계산될 수 있고, 여기서 마지막 소스 심볼은 인코딩에 대해 설명한 바와 동일하게, 디코딩 목적으로 필요하다면 0들로 패딩된다. 따라서, 수선 세그먼트 내 수선 블록에 대한 수선 인덱싱 정보 및 대응 FEC 정보는 대응 소스 세그먼트의 대응 프래그먼트에 대한 R, S 및 인덱싱 정보로부터 내재적으로 유도될 수 있다.The repair indexing information and the FEC information for the repair segment are implicitly defined by the indexing information for the corresponding source segment and implicitly from the R and S values, where R is expressed as an integer between 0 and 1000 representing mille, and S Is represented by bytes. The fragment structure and time offsets including the repair segment are defined by the time offsets and structure of the corresponding source segment. The byte offsets for the start and end of the repair symbols of the repaired segment corresponding to fragment i are RB (il) = S * divceil (R * B (il), S * 1000) and RB B (i), S * 1000). (I) - RB (il), so the number of the waterline symbols corresponding to the fragment i is NRS (i) = RB (i) - RB / S. &Lt; / RTI > (Note that the divceil operation is not needed here, since the numerator is guaranteed to be a multiple of S, but divceil can be used here and the result will still be correct.) The number of source symbols corresponding to fragment i is NSS (i) = divceil B (i) -B (il), S), where the last source symbol is padded with zeros if necessary for decoding purposes, as described for encoding. Thus, the repair index information and corresponding FEC information for the repair block in the repair segment can be implicitly derived from R, S and indexing information for the corresponding fragment of the corresponding source segment.

일 예로서, 바이트 오프셋 B(1) = 6,410 에서 시작하고, 바이트 오프셋 B(2) = 6,770 에서 끝나는 프래그먼트 2를 나타내는, 도 35에 도시된 예를 고려하고, 즉 프래그먼트 2 는 6,770-6,410 bytes 크기이고 6,770은 프래그먼트 3의 시작 바이트 인덱스이다. 이 예에서, 심볼 크기는 S = 64 bytes이고, 점선의 수직 선들은 S의 배수들에 대응하는 소스 세그먼트 내의 바이트 오프셋들을 나타낸다. 소스 세그먼트 크기의 비율로서의 전체 수선 세그먼트 크기는 이 예에서 mille 당 R = 500으로 설정된다(수선은 대략 소스의 1/2이다). 프래그먼트 2에 대한 소스 블록의 소스 심볼들의 수는 NSS(2) = divceil(6,770-6,410, 64) = (6,770- 6,410+64-1) div 64 = 6으로 계산되고, 이 6 소스 심볼들은 ESI들 0, 5을 각각 갖고, 여기서 제 1 소스 심볼은 소스 세그먼트 내의 바이트 인덱스 6,410에서 시작하는 프래그먼트 2의 제 1 64 bytes이고, 제 2 소스 심볼은 소스 세그먼트 내의 바이트 인덱스 6,474에서 시작하는 프래그먼트 2의 다음 64bytes 등등이다. 프래그먼트 2에 대응하는 수선 블록의 끝 바이트 오프셋은 RB(2) = 64 * divceil(500*6,770, 64* 1,000) = 64 * (3,385,000 + 64,000 - 1) div 64,000 = 64 * 53 = 3,392으로 계산되고, 프래그먼트 2에 대응하는 수선 블록의 시작 바이트 오프셋은 RB(1) = 64*divceil(500 * 6,410, 64* 1,000) = 64 * (3,205,000 + 64,000 - 1) div 64,000 = 64 * 51 = 3,264으로 계산되고, 따라서, 이 예에서 각각, 수선 세그먼트 내의 바이트 오프셋 3,264에서 시작하고 바이트 오프셋 3,392에서 끝나는, ESI들 6 및 7을 갖는 프래그먼트 2에 대응하는 수선 블록의 두 수선 심볼들이 있다. As an example, consider the example shown in FIG. 35, which shows fragment 2 starting at byte offset B (1) = 6,410 and ending at byte offset B (2) = 6,770, i.e., fragment 2 is 6,770-6,410 bytes size And 6,770 is the start byte index of Fragment 3. In this example, the symbol size is S = 64 bytes and the vertical lines of the dashed line represent byte offsets in the source segment corresponding to multiples of S. The total contour segment size as a percentage of the source segment size is set to R = 500 per mille in this example (the waterline is approximately one half of the source). The number of source symbols of the source block for Fragment 2 is calculated by NSS (2) = divceil (6,770-6,410,64) = (6,770-6,410 + 64-1) div64 = 6, 0, 5, where the first source symbol is the first 64 bytes of Fragment 2 starting at byte index 6,410 in the source segment and the second source symbol is the next 64 bytes of Fragment 2 beginning at byte index 6,474 in the source segment, And so on. The end byte offset of the repair block corresponding to Fragment 2 is calculated as RB (2) = 64 * divceil (500 * 6,770,64 * 1,000) = 64 * (3,385,000 + 64,000-1) div 64,000 = 64 * 53 = 3,392 , The start byte offset of the repair block corresponding to the fragment 2 is calculated as RB (1) = 64 * divceil (500 * 6, 410, 64 * 1,000) = 64 * (3,205,000 + 64,000-1) div 64,000 = 64 * 51 = And thus in this example are the two waterline symbols of the waterline block corresponding to Fragment 2 with ESIs 6 and 7 starting at byte offset 3,264 and ending at byte offset 3,392, respectively, in the waterline segment.

이것은 도 35에 도시된다. 도 35에 나타난 예에서, R = 500(수선은 대략 소스의 1/2이다)이고 프래그먼트 2에 대응하는 6 소스 심볼들이 있다 하더라도, 수선 심볼들의 수를 계산하기 위해 단순히 소스 심볼들의 수를 사용했다면, 예상하는 바와 같이, 수선 심볼들의 수는 3이 아니고, 그러나 대신에 2인것으로 계산된다. 수선 심볼들의 수를 결정하기 위해 단순히 프래그먼트의 소스 심볼들의 수를 사용하는 것과는 대조적으로, 여기서 행해지는 방법은 단지 대응 소스 세그먼트의 대응 소스 블록과 연관된 인덱스 정보로부터 수선 세그먼트 내의 수선 블록의 위치를 계산할 수 있게 한다. 이것이 인제스쳔(ingestion) 프로세스에 일치하는 계산이고 RA 프로세스 내에 있기 위해서, 수선 세그먼트 내의 수선 프래그먼트에 대한 수선 심볼들의 수 및 위치들은 고정-소수점 연산을 사용하여 계산된다는 것이 중요하다. 더욱이, 소스 블록의 소스 심볼들의 수, K가 커지면서, 대응 수선 블록의 수선 심볼들의 수, KR은 K * R/1,000에 의해 가깝게 근사하고, 일반적으로 KR은 많아도 divceil(K * R, 1,000) 이고, KR은 적어도 divfloor((K-1) * R, 1000)이고, 여기서 where divfloor(I, J) = I div J 이다.
This is shown in Fig. In the example shown in FIG. 35, even if R = 500 (the waterline is approximately one half of the source) and there are six source symbols corresponding to Fragment 2, if we simply used the number of source symbols to calculate the number of waterline symbols , As expected, the number of repair symbols is not 3, but is instead calculated to be 2. In contrast to simply using the number of source symbols in a fragment to determine the number of repair symbols, the method performed here can only calculate the position of the repair block in the repair segment from the index information associated with the corresponding source block of the corresponding source segment Let's do it. It is important to note that in order for this to be a computation consistent with the ingestion process and to be in the RA process, the number and positions of the repair symbols for the repair fragments in the repair segment are calculated using fixed-point arithmetic. Furthermore, as the number of source symbols in the source block, K, increases, the number of the repair symbols in the corresponding repair block, KR, is closely approximated by K * R / 1,000 and is generally divceil (K * R, 1,000) , KR is at least divfloor ((K-1) * R, 1000), where where divfloor (I, J) = I div J.

도시된 예들Illustrated Examples

도 25는 레이트 선택 프로세스를 도시한다. λ 및 μ에 대한 설정들이 클수록, 설정들은 더 공격적이다. 도 23은 파라미터 λ에 대한 다른 값들을 도시한다. 도 24는 파라미터 μ에 대한 다른 값들을 도시한다. 하이브리드 설정은 두 주요 메커니즘들에 의해 레이트 동요를 감시시키려 한다. 제 1은 B가 더 클 때 레이트를 증가시키는데 더 주의함에 의하고, 다음은 B가 더 작을 때 현재 레이트에 머무르기 위해 더 애쓰는 것이다.25 shows a rate selection process. The larger the settings for λ and μ, the more aggressive the settings are. Fig. 23 shows other values for the parameter lambda. Fig. 24 shows different values for the parameter [mu]. Hybrid configuration seeks to monitor rate fluctuations by two major mechanisms. The first is to be more careful to increase the rate when B is bigger, and the next is to work harder to stay at the current rate when B is smaller.

pker x.y: C = x*min(y*Tdl,B) 에 대한 예시적 설정은 x.y가 8.1, 4.2, 2.4, 4.4 또는 다른 x.y 값들로 설정될 수 있다. pker의 실제 평균 윈도우는 다운로드 보류 주기의 스킵으로 인해 C보다 길다는 점에 주의하라. EWMA 으로 스킵 없고 다운로드 보류 주기에서의 레이트가 마지막 다운로드 구간의 그것과 동일하다고 가정하라.An exemplary setting for pker xy: C = x * min (y * Tdl, B) may be set to xy values of 8.1, 4.2, 2.4, 4.4, or other xy values. Note that the actual average window of pker is longer than C due to the skip of the download pending cycle. Assume that the EWMA does not skip and the rate in the download pending period equals that of the last download period.

MWA(Moving Window Average)에 대해, H(z) = (1/D)*((1-z^D)/(1-z^-1))이고, D는 윈도우 크기이다. X_i = min {R_k : k≥ i}이고, 여기서 R_k는 가중치 W_k를 갖는 레이트의 EWMA이RH, W₁ < W₂ < W₃ <…이다. EWMA에 대해, H(z) = ((1-β)/(1-βz^-1))이고, β는 이전 평균의 가중치이다. MWA 및 EWMA는 어떤 경우들에는 대략 균등하다.For MWA (Moving Window Average), H (z) = (1 / D) * ((1-z ^D ) / (1-z ^-1 )) and D is the window size. X _i = min {R _k : _k ≥ i}, where R _k is the EWMA of the rate with the weight W _k is RH, W ₁ <W ₂ <W ₃ <... to be. For EWMA, H (z) = ((1 -?) / (1 -? Z ^-1 )) and? Is the weight of the previous average. MWA and EWMA are approximately equal in some cases.

적응성 추정기가 긴 평균 윈도우를 갖는다면, 그것은 레이트 스위치 빈도를 감소시키고 동시에 라이브 스트리밍에 대해 대략 동일한 평균 레이트를 유지한다. 다른 시나리오에 대해 다른 설정들이 잘 동작한다. 공격적인 설정들은 보다 안정적인 시나리오들에 대해 잘 동작하고, 반면에 덜 공격적인 설정은 보다 변덕적인 시나리오에 더 적합하다. 대역폭이 상당한 부분의 시간 동안 어떤 마진만큼 가장 높은 리프리젠테이션 레이트보다 더 높다면(예를 들어, 20-sec 평균일 때의 시간의 %가 레이트 캡(cap)보다 더 높다), 보다 공격적인 설정으로 가는 것이 유익하다. 이상적으로, 디바이스는 시나리오 유형들을 검출하고 적절한 설정을 적용할 수 있어야 한다. 시나리오 검출은 라이오 기술 유형, 어던 단위 시간 내의 레이트 변동들의 수, 이동 속도, 등등과 같은 인자들에 기초할 수 있다. 더 간단한 전략은 위 관찰에 기초할 수 있다: “전체” 대역폭이 레이트 캡 보다 높은 때 보다 공격적인 설정을 사용한다.
If the adaptivity estimator has a long average window, it reduces the rate switch frequency and at the same time maintains approximately the same average rate for live streaming. Other settings work well for different scenarios. Aggressive configurations work well for more stable scenarios, while less aggressive configurations are better suited for more volatile scenarios. If the bandwidth is higher than the highest representation rate by any margin over a significant fraction of the time (e.g., the% of time at 20-sec average is higher than the rate cap) It is beneficial to go. Ideally, the device should be able to detect scenario types and apply appropriate settings. Scenario detection may be based on factors such as the Lyoto technology type, the number of rate variations within a unit time, the speed of movement, and so on. A simpler strategy can be based on the above observations: Use a more aggressive setting when the " total " bandwidth is higher than the rate cap.

8. 레이트 선택 파라미터들의 설정8. Setting of rate selection parameters

이 섹션에서는, 레이트 선택 파라미터들을 설정하는 예들이 제공된다.In this section, examples are provided for setting rate selection parameters.

MLB에 대해, EFF = 1 - Rv/Rdl이고, Rv는 선택된 리프리젠테이션의 현재 레이트이고 Rdl은 현재 다운로드 레이트이다. 제안된 룰은 다음과 같다:For MLB, EFF = 1 - Rv / Rdl, Rv is the current rate of the selected representation and Rdl is the current download rate. The proposed rule is as follows:

EFF < 0이면, 아마 1 레이트보다 더 많이 하락If EFF <0, it probably drops more than one rate

0 <= EFF < 0.1이면, 일 레이트 하락If 0 < = EFF < 0.1,

0.1 <= EFF < 0.6이면, 현 레이트에 머묾If 0.1 <= EFF <0.6, the current rate

0.6 <= EFF < 0.8이면, 일 레이트 상승If 0.6 < = EFF < 0.8,

0.8 <= EFF <= 1이면, 아마 일 레이트보다 더 많이 상승If 0.8 < = EFF < = 1,

alpha = Rv/Rdl 이라고 하자. 그러면 이는 대략 다음으로 바뀐다:Let alpha = Rv / Rdl. Then it changes roughly to:

alpha <= 0.4이면, 적어도 일 레이트 상승If alpha <= 0.4, then at least one rate rise

0.4 < alpha <= 0.9이면, 동일한 레이트에 머묾0.4 < alpha < = 0.9,

0.9 < alpha 이면, 적어도 일 레이트 하락If 0.9 < alpha, then at least one rate drop

이를 DASH클라이언트 레이트 선택 프로세스의 컨텍스트에 넣으면:Put it in the context of the DASH client rate selection process:

RUP를 UP에 대응하는 리프리젠테이션의 레이트라고 하고, RDOW를 DOWN에 대응하는 리프리젠테이션의 레이트라고 하고, 위와 같이 Rv를 현재 선택된 리프리젠테이션의 레이트라고 하자. RUP는 RUP <= lambda(t)*Rdl이도록 가능한 한 크게 선택하고, RDOWN은 RDOWN <= mu(t)*Rdl이도록 가능한 한 크게 선택된다. 파라미터 t = B/(D+delta)이고, 여기서 B는 미디어 버퍼의 프리젠테이션의 현재 양이고, D는 현재 결정이 되고 있는 포인트를 넘는 다음 가능한 스위치 포인트까지의 시간의 한도이고, delta는 네트워크 레이턴시 및 라운드 트립 시간을 고려하는 작은 파라미터이고, 예를 들어, delta는 근사로서 1 초 또는 2 초로 설정될 수 있고, 또는 delta는 현재 RTT의 측정된 상한에 따라 설정될 수 있다.Let RUP be the rate of representation corresponding to UP, RDOW be the rate of representation corresponding to DOWN, and let Rv be the rate of the currently selected representation as above. RUP is selected as large as possible to be RUP <= lambda (t) * Rdl, and RDOWN is selected as large as possible to be RDOWN <= mu (t) * Rdl. Where B is the current amount of presentation of the media buffer, D is the time limit to the next possible switch point beyond the point at which it is currently being determined, and delta is the network latency < RTI ID = And round trip time, for example, delta may be set to one or two seconds as an approximation, or delta may be set according to the measured upper limit of the current RTT.

다음 레이트 RNEXT의 전체 선택은 다음과 같다:The overall selection of the following rate RNEXT is as follows:

RUP < Rv이면 RNEXT = min{Rv, RDOWN} 아니면 RNEXT = RUP이다.If RUP <Rv, RNEXT = min {Rv, RDOWN} or RNEXT = RUP.

위 MLB 파라미터들은 모든 t에 대해 lambda(t) = 0.4*R 및 mu(t) = 0.9으로 설정함으로써 근사될수 있고, 여기서 R은 다음 높은 리프리젠테이션의 레이트의 현재 리프리젠테이션의 레이트의 그것에 대한 비율이다. 예를 들어, 현재 레이트가 500 Kbps이고, 다음 높은 레이트는 750 Kbps 이면, R = 1.5이고 따라서 lambda(t) = 0.6이다. 이것은 MLB 프로세스를 다음과 같이 근사한다.The above MLB parameters may be approximated by setting lambda (t) = 0.4 * R and mu (t) = 0.9 for all t, where R is the rate of the current representation of the rate of the next higher representation Ratio. For example, if the current rate is 500 Kbps and the next higher rate is 750 Kbps then R = 1.5 and thus lambda (t) = 0.6. This approximates the MLB process as follows.

결정 포인트에서, EFF >= 0.6, 즉 alpha <= 0.4이면, Rv <= 0.4*Rdl이고, 이 경우에 RUP는 적어도 Rv*R일 것이고(왜냐하면 모든 t에 대해 lambda(t) = 0.4*R이므로) 따라서 RNEXT = RUP, 즉 레이트는 Rv*R의 다음 높은 리프리젠테이션으로 상승할 수 있고, Rdl이 0.4*Rv보다 훨씬 크면 RUP는 Rv*R보다 더 크게 될 것이고(리프리젠테이션 레이트들의 그래널러리티에 기초하여), EFF가 예를 들어 0.8보다 크면 RUP는 Rv*R 넘는 일 레이트보다 클 것이다. EFF < 0.1이면 Rv > 0.9*Rdl이고, 이 경우 RDOWN은 Rv보다 작을 것이고(왜냐하면 RDOWN <= 0.9*Rdl이므로), 그리고 레이트는 하락할 거이고, 즉 RNEXT < Rv이다. EFF가 0.1과 0.6 사이이면 RUP <= Rv*R이고 RDOWN >= Rv이며, 그 경우 RNEXT는 Rv와 동일하게 선택될 것이다.
At the decision point, if EFF > = 0.6, i.e. alpha <= 0.4, then Rv <= 0.4 * Rdl and in this case RUP will be at least Rv * R (since lambda (t) = 0.4 * R for all t ) Thus, RNEXT = RUP, i.e. the rate can rise to the next higher representation of Rv * R, and if Rdl is much larger than 0.4 * Rv then RUP will be larger than Rv * R Lt; / RTI > and EFF is greater than 0.8, for example, RUP will be greater than one rate above Rv * R. If EFF <0.1 then Rv> 0.9 * Rdl, then RDOWN will be less than Rv (because RDOWN <= 0.9 * Rdl) and the rate will drop, ie RNEXT <Rv. If EFF is between 0.1 and 0.6, then RUP <= Rv * R and RDOWN> = Rv, then RNEXT will be selected equal to Rv.

레이트 선택 파라미터 세트들Rate selection parameter sets

아래 테이블들은 몇몇 가능한 레이트 선택 파라미터 세트들을 특정한다. 아래 테이블에 표시되지 않은 t의 중간 값에 대한 lambda 및 mu의 값들은 주변 값들 사이에 선형적으로 보간함에 의해 계산되어야 한다. 아래 테이블에 표시된 것들을 넘는 t의 값에 대한 lambda 및 mu의 값들은 표시된 t의 최대 값에 대한 lambda 및 mu 값들로 설정되어야 한다.The following tables specify several possible rate selection parameter sets. The values of lambda and mu for the median of t that are not shown in the table below should be calculated by interpolating linearly between the surrounding values. The values of lambda and mu for the value of t above those shown in the table below shall be set to the lambda and mu values for the maximum value of t indicated.

모든 t에 대해 제약들 mu(t) <= t 및 lambda(t) <= t가 만족되면, 이론적으로 플레이백에 스톨이 없을 것이나, 실질적 관점에서 스톨이 전혀 없는 것보다는 플레이백에 작은 스톨을 갖고 그러나 훨신 낮아진 레이트에서 플레이 아웃을 계속하는 것이 바람직할 수 있고, 예를 들어 1 Mbps에서 20 kbps로의 도약은 사이에 1 초 중지를 갖는 1 Mbps에서 250 kbps로의 도약보다 더 나쁜 경험일 수 있다. lambda 및 mu의 최소 값이 도 36의 테이블들에 설정되고, mu(t) > t 및/또는 lambda(t) > t 인 값들에 대해 스톨이 발생할 것임에 주의하라(그럼에도 불구하고 lambda(t) 및 mu(t)의 설정들과 무관하게 버퍼가 이렇게 비는 임의의 경우에 스톨이 발생할 수 있다).If the constraints mu (t) <= t and lambda (t) <= t are satisfied for all t, theoretically there will be no stalls in playback, but a small stall in playback It may be desirable to continue playing at a much lower rate, for example, a jump from 1 Mbps to 20 kbps may be a worse experience than a jump from 1 Mbps to 250 kbps with a one second pause in between. Note that the minimum values of lambda and mu are set in the tables of FIG. 36 and a stall will occur for mu (t)> t and / or lambda (t)> t (nonetheless lambda And mu (t), the stall may occur in any case where the buffer is so busy).

이제 설명되는 바와 같이, 클라이언트 디바이스는 HTTP를 통한 적응성 비디오 스트리밍을 위한 레이트 적응 및 다운로드 프로세스들을 제공할 수 있다. 인터넷(및 다른 네트워크들)을 통해 비디오를 스트리밍하는 클라이언트들은 요동치는 대역폭의 문제에 직면한다. 고품질 비디오가 스트리밍되면, 링크는 때때로 충분히 빠르지 않을 수 있고, 이는 플레이어가 중단하고 다시 버퍼링하도록 한다. 다른 경우들에서, 저 품질 비디오는 훨씬 더 작은 대역폭을 하용하나 더 나쁜 사용자 경험이다. 하나의 해결책은 비디오 품질을 적응성 있게 조정하는 것이다: 쓰로우풋이 높을 때 더 높은 품질을 선택하고, 자동적으로 아래로 스위칭한다.As will now be described, the client device may provide rate adaptation and download processes for adaptive video streaming over HTTP. Clients streaming video over the Internet (and other networks) are confronted with the problem of fluctuating bandwidth. When high quality video is streamed, the link may not be fast enough from time to time, which causes the player to stop and buffer again. In other cases, low quality video is a much worse user experience, with much lower bandwidth. One solution is to adaptively adjust video quality: choose a higher quality when the throw foot is high, and automatically switch it down.

그러나, 적응성 비디오 스트리밍은 다수의 도전들을 일으킨다: (1) 비디오 레이트(품질)을 선택하기 위한 프로세스 또는 알고리즘은 레이트 증가들은 물론 레이트 감소들에 적응하도록 충분히 빠르게 행동해야 한다. 동시에 그것은 너무 이르거나 이상한 결정들을 피하고 불필요한 레이트 스위칭 결정들을 피해야 한다. 클라이언트는 높은 비디오 품질이 획득될 수 있도록 충분히 높은 레이트에서 데이터를 페칭하는 것을 목표해야 한다. 동시에 다운로드 프로세스는 데이터가 적시에 수신된다는 것을 보증해야 한다. 각 프레임은 그것이 플레이 아웃되기 전에 전체가 수신되어야 한다. 그것들은 불필요하게 큰 플레이백 버퍼를 요구함 없이 이 목표들을 달성할 수 있어야 한다. 큰 버퍼들의 몇몇 문제들은 라이브 이벤트들에 대해, 버퍼의 비디오 양이 목표 양단간 레이턴시에 의해 제한된다는 것이고, 이는 이들 경우에서 가능한 플레이백 버퍼를 심하게 제한한다. 또한, 큰 버퍼에 의존하는 것은 버퍼가 미리 채워질 필요가 있기 때문에 플레이백 시작들 및 검색들에서 바람직하지 않은 지연들을 유발한다. 또한, 큰 플레이백 버퍼는 많은 메모리를 사용하고, 그것은 모바일 폰들 및 다른 클라이언트 디바이스들에 충분하지 않다.Adaptive video streaming, however, causes a number of challenges: (1) The process or algorithm for selecting a video rate (quality) must act fast enough to accommodate rate reductions as well as rate increases. At the same time it avoids too early or strange decisions and avoids unnecessary rate switching decisions. The client should aim to fetch data at a rate high enough to allow high video quality to be obtained. At the same time, the download process must ensure that the data is received in a timely manner. Each frame must be received entirely before it is played out. They should be able to achieve these goals without requiring an unnecessarily large playback buffer. Some problems with large buffers are that, for live events, the amount of video in the buffer is limited by target-to-target latency, which severely limits the playback buffer available in these cases. Also, relying on a large buffer causes undesirable delays in playback starts and searches because the buffer needs to be pre-populated. Also, the large playback buffer uses a lot of memory, which is not enough for mobile phones and other client devices.

이들 문제점들을 해결하기 위한 신 레이트 변화들에 빠르게 반응할 레이트 추정을 위한 프로세스. 레이트 추정은, 특히 스트리밍 비디오에서의 사용에 잘 맞는, 적응성 윈도우드(windowed) 평균이다. 레이트 추정기는 윈도윙 너비를 크게 유지하면서(따라서 측정 편차를 크게) 필요하면 레이트가 충분히 빠르게 적응함을 보증할 수 있도록 하는 방법으로 비디오 버퍼 레벨 및 비디오 버퍼 레벨에서의 변화를 고려한다. 프로세스에 의해 제공된 보증은 (a) B가 레이트 하락이 발생하는 때 버퍼의 비디오 데이터의 양(플레이백 시간의 초 단위)이면, 추정기는 버퍼가 B/2로 비는데 걸리는 시간 내에 그 레이트 추정을 조정할 것이고, (b) B가 레이트 증가가 일어나는 동안 버퍼의 데이터 양이면, 레이트 추정기는 새 레이트로 충분히 빠르게 조정하여 그것이 원칙적으로 많아도 3*B 내에 나타날 수 있도록 한다(스마트 레이트 변경 프로세스라고 가정하면).A process for rate estimation that will quickly respond to new rate changes to solve these problems. Rate estimation is an adaptive windowed average that is particularly well suited for use in streaming video. The rate estimator takes into account changes in the video buffer level and the video buffer level in such a way as to be able to ensure that the rate adapts fast enough, if necessary, while keeping the window wing width large (and hence larger measurement variance). The assurance provided by the process is: (a) if B is the amount of video data in the buffer (in seconds of playback time) when a rate drop occurs, then the estimator estimates the rate within the time it takes for the buffer to fall to B / 2 (B) if B is the amount of data in the buffer during which the rate increase occurs, the rate estimator adjusts to the new rate fast enough so that it can appear in principle at most 3 * B (assuming a smart rate change process) .

레이트 결정 프로세스는 (a) 버퍼가 낮은 레벨에 있을 때, 버퍼가 채워지도록 레이트 결정들을 할 수 있고, 심지어 낮은 다운로드 레이트 추정들이 관측되더라도, 비정상적으로 레이트들을 변경하는 것을 피하도록 버퍼를 이용하고, (c) 안정된 레이트 시나리오에서, 올바른 안정된 레이트를 빠르게 선택한다. (a) 정확한 추정을 고려하고, (b) 네트워크 지연들 및 패킷 손실 레이트들이 높더라도 링크 용량을 달성할 수 있고, (c) 스트림의 적시 전달을 달성하는 HTTP를 위해 멀티미디어 다운로드 전략들이 사용된다. 이를 달성하기 위해, 복수의 HTTP 접속들을 사용하고, 네트워크 상태에 따라 미디어 리퀘스트들을 더 작은 청크 리퀘스트들로 분해하고, TCP 흐름제어 메커니즘들을 사용하여 접속들을 동기화시키고, 버스트들(bursts)의 데이터를 리퀘스트할 수 있다. 우리는 또한 접속을 busy로 유지하기 위해 HTTP 파이프라이닝(pipelining) 프로세스를 사용할 수 있다.The rate determination process may (a) use the buffer to avoid changing the rates abnormally, even when low download rate estimates are observed, and c) In a stable rate scenario, quickly select the right stable rate. (a) to take accurate estimates, (b) to achieve link capacity even at high network delays and packet loss rates, and (c) to use multimedia download strategies for HTTP to achieve timely delivery of streams. To achieve this, it is possible to use multiple HTTP connections, decompose media requests into smaller chunked requests according to network conditions, synchronize connections using TCP flow control mechanisms, and send data of bursts to requests can do. We can also use the HTTP pipelining process to keep the connection busy.

많은 특징들, 측면들 및 상세한 설명들이 지금까지 설명되었다. 설명된 바와 같이, 다양한 실시예들에서 방법 단계들은 당업자에게 자명한 바와 같이 대응하는 프로그래밍된 엘리먼트들, 프로세서에 제공되는 명령들, 하드웨어 또는 다른 장치들에 의해 수행될 수 있다. 마찬가지로, 엘리먼트들은 프로세스들 또는 프로그램 엘리먼트들에 의해 기능할 수도 있다. 일 실시예의 엘리먼트들의 구조는 단순히 프로세서에 의해 실행되는 명령들의 세트를 포함할 수 있으나 여기에서 대응하는 방법 단계로서 설명되었다.Many features, aspects, and details have been described so far. As described, the method steps in various embodiments may be performed by corresponding programmed elements, instructions provided to the processor, hardware, or other devices, as would be apparent to those skilled in the art. Similarly, elements may function by processes or program elements. The structure of the elements of an embodiment may simply include a set of instructions that are executed by the processor but have been described herein as corresponding method steps.

다양한 실시예들에서, 다운로드 레이트 변속(acceleration)은 사용될 수도 아닐 수도 있다. 다운로드 레이트 변속의 일 예는 TCP 접속들을 통한 HTTP 리퀘스트들을 사용하여 다운로드들을 변속하는 방법 또는 장치이다. TCP 접속은 특정 윈도우 크기를 갖고 TCP 접속의 끝들의 노드들은 윈도우 크기에 대한 설정들을 변경할 수 있다. 그 크기가 목표 다운로드 레이트의 함수인 연속하는 HTTP 리퀘스트들에 대한 윈도우 크기를 설정하는 것은 새롭다(on novelty). 따라서, 목표 다운로드 레이트가 변함에 따라, TCP 윈도우 크기가 변할 수 있다. In various embodiments, a download rate acceleration may or may not be used. One example of a download rate change is a method or device that uses HTTP requests over TCP connections to speed up downloads. A TCP connection has a specific window size and the nodes at the ends of the TCP connection can change settings for the window size. Setting the window size for successive HTTP requests whose size is a function of the target download rate is novel (on novelty). Thus, as the target download rate changes, the TCP window size may change.

일 실시예에서, 네트워크 경로에 의해 커플링된 소스 및 수신기 사이의 네트워크 경로를 통한 데이터 다운로드를 제어하기 위한 방법 및/또는 장치 또는 컴퓨터 판독가능 매체가 사용되고, 그 방법은, 소스 및 수신기 사이의 복수의 TCP 접속들 각각에 대해, 그 TCP 접속에 대한 TCP 수신기 윈도우 크기를 결정하고, 소스 및 수신기 사이의 TCP접속은 직접 접속 또는 간접 접속일 수 있고, 미디어 콘텐츠에 대한 목표 다운로드 레이트를 결정하고, 목표 다운로드 레이트는 적어도 두 연속되는 HTTP 리퀘스트들에 대한 적어도 두 값들 사이에서 변하고, 복수의 TCP 접속들의 각 TCP 접속을 사용하여 다운로드될 미디어 콘텐츠의 미디어 데이터 엘리먼트들을 다운로드하는 것을 포함하고, 미디어 콘텐츠는 복수의 HTTP 리퀘스트들의 응답의 부분 또는 전체이고, 소정의 TCP 접속에 대한 결정된 TCP 수신기 윈도우 크기는, 적어도 부분적으로, 목표 다운로드 레이트에 기초하여 결정되고, 결정된 TCP 수신기 윈도우 크기는 적어도 두 연속되는 HTTP 리퀘스트들에 대한 적어도 두 값들 사이에서 변한다.In one embodiment, a method and / or apparatus or computer readable medium for controlling the downloading of data over a network path between a source and a receiver coupled by a network path is used, the method comprising: The TCP connection between the source and the receiver may be a direct connection or an indirect connection and may determine a target download rate for the media content, Rate includes varying between at least two values for at least two consecutive HTTP requests and downloading media data elements of the media content to be downloaded using each TCP connection of the plurality of TCP connections, It is part or all of the response of the requests, The determined TCP receiver window size for the CP connection is determined based at least in part on the target download rate and the determined TCP receiver window size varies between at least two values for at least two consecutive HTTP requests.

현재 TCP 접속에 대한 결정된 TCP 수신기 윈도우 크기는, 적어도 부분적으로, 곱셈 레이트로 곱해진 현재 TCP 접속에 대한 현재 추정된 라운드-트립 시간(“ERTT”)의 곱에 기초하여 결정되고, 곱셈 레이트는 현재 TCP 접속에 대한 목표 다운로드 레이트 및 목표 다운로드 레이트보다 소정의 양만큼 더 높은 레이트에 의해 한정된 범위 내이다. 현재 ERTT는 1초, 10초, 50초 등과 같이 바로 이전의 측정 주기 동안의 최소 관측 RTT의 측정에 의해 결정될 수 있다. 현재 ERTT는 정지 중의 주기의 끝에서의 측정에 의해 결정될 수 있고, 정지중의 기간은 다운로드 주기 다음에 오고 미리-결정된 지속시간 주기 동안 TCP 접속들을 통해 어떤 활성 HTTP 리퀘스트들도 제시되지 않은 기간이다. 목표 다운로드 레이트는 현재 총합의 다운로드 레이트의 두 배 또는 세 배와 같이, 사용된 TCP 접속들의 수로 나누어진, 사용된 모든 TCP 접속들을 통한 현재 집합한 다운로드 레이트에 비례할 수 있다. 목표 다운로드 레이트는 미디어 콘텐츠의 플레이백 레이트에 비례할 수 있고, 플레이백 레이트는 사용된 TCP 접속들의 수로 나누어진 사용된 모든 TCP 접속들에 걸친 집합을 통한 레이트이다. 각 미디어 데이터 엘리먼트는 소정 범위의 편차 내의 크기들을 갖는 다수의 청크들로 분할 될 수 있고, 그러한 청크들의 수는 사용된 TCP 접속들의 수에 기초한다. 그러한 청크들의 n는 현재 TCP접속들에 대한 평가된 라운드-트립 시간(“ERTT”), 현재 다운로드 레이트, 및/또는 리퀘스트되는 미디어 프래그먼트의 크기 중 적어도 하나에 더 기초할 수 있다. 미리 결정된 범위의 편차는 0일 수 있고 따라서 각 청크는 프래그먼트 리퀘스트 당 같은 크기를 가지며, 청크의 수는 사용된 TCP 접속들의 수 곱하기 미리 결정된 인자이다. 후속하는 미디어 데이터 엘리먼트에 대한 더 나중의 HTTP 리퀘스트는 제 1 가용 TCP 접속으로 할당될 수 있다.The determined TCP receiver window size for the current TCP connection is determined based at least in part on the product of the current estimated round-trip time (" ERTT ") for the current TCP connection multiplied by the multiplication rate, The target download rate for the TCP connection, and a rate that is higher by a predetermined amount than the target download rate. The current ERTT can be determined by measuring the minimum observed RTT for the immediately preceding measurement period, such as 1 second, 10 seconds, 50 seconds, and so on. The current ERTT can be determined by measurement at the end of the period during the outage, while the period of outage is after the download period and no active HTTP requests are presented over TCP connections for a pre-determined duration period. The target download rate may be proportional to the current aggregate download rate over all used TCP connections, divided by the number of TCP connections used, such as twice or three times the download rate of the current total. The target download rate may be proportional to the playback rate of the media content and the playback rate is the rate through the set across all used TCP connections divided by the number of TCP connections used. Each media data element may be partitioned into a number of chunks having sizes within a predetermined range of variances, and the number of such chunks is based on the number of TCP connections used. The n of such chunks may be further based on at least one of the estimated round-trip time (" ERTT ") for current TCP connections, the current download rate, and / or the size of the requested media fragment. The predetermined range of deviations can be zero, so each chunk has the same size per fragment request, and the number of chunks is a predetermined number of times the number of TCP connections used. A later HTTP request for a subsequent media data element may be assigned to the first available TCP connection.

제어는 소스 및 수신기 사이에 사용할 TCP 접속들의 수를 결정하고, 그 수는 1 보다 크고, 사용할 TCP 접속들의 수는, 적어도 부분적으로, 결정된 적어도 하나의 네트워크 상태에 기초하여 결정되고, TCP 접속들의 수 각각을 이용하여 다운로드될 미디어 콘텐츠의 복수의 미디어 데이터 엘리먼트들을 다운로드하는 것을 또한 포함할 수 있고, 미디어 콘텐츠는 복수의 HTTP 리퀘스트들에 대한 응답의 부분 또는 전체이다. 사용된 TCP 접속들의 수는 TCP 접속들에 대한 추정된 라운드-트립 시간(“ERTT”), 목표 다운로드 레이트 및 손실 레이트의 추정에 기초할 수 있다. 손실 레이트는 1% 또는 0.1%로 추정될 수 있다. 사용할 TCT 접속들의 수는 (a) 목표 다운로드 레이트, (b) ERTT, (c) 추정된 손실 레이트의 제곱근의 곱을 포함하거나, 및/또는 그에 비례하는, 2 및 16 사이일 수 있다. 각각의 TCP 접속들에 대해, TCP 수신기 윈도우 크기는 그 TCP 접속에 대해 목표 다운로드 레이트에 기초하여 결정될 수 있고, 결정된 TCP 수신기 윈도우 크기는 적어도 두 연속하는 HTTP 리퀘스트들에 대한 적어도 두 값들 사이에서 변한다.The control determining the number of TCP connections to use between the source and the receiver, the number being greater than 1, the number of TCP connections to use being determined based at least in part on at least one determined network state, Downloading a plurality of media data elements of the media content to be downloaded using each of the media content, wherein the media content is part or all of a response to the plurality of HTTP requests. The number of TCP connections used may be based on an estimate of the round-trip time (" ERTT "), target download rate and loss rate for TCP connections. The loss rate may be estimated at 1% or 0.1%. The number of TCT connections to be used may be between 2 and 16, inclusive and / or proportional to (a) the target download rate, (b) ERTT, (c) the product of the square root of the estimated loss rate. For each TCP connection, the TCP receiver window size can be determined based on the target download rate for that TCP connection, and the determined TCP receiver window size varies between at least two values for at least two consecutive HTTP requests.

일 실시예에서, 프리젠테이션 버퍼를 고려하고 버퍼가 얼마나 큰지/찼는지/비었는지, 즉 그 레벨이 어떤가에 기초하여 다운로드 레이트를 추정하는, 다운로드 추정을 위한 방법 및/또는 장치 또는 컴퓨터 판독가능 매체가 사용된다. 예를 들어, 유한대역폭을 갖는 네트워크 경로에 의해 데이터 소스들로 커플링되는 리시버에서 다운로드 레이트를 추정하는 것으로서, 다운로드 레이트는 데이터가 리시버에서 네트워크 경로를 통해 수신될 수 있는 레이트인 것은, 리시버의 프리젠테이션 버퍼를 모니터링하고, 프리젠테이션 버퍼는 적어도 미디어 데이터가 수신되는 시간 및 미디어 데이터가 리시버와 연관된 프리젠테이션 엘리먼트에 의해 소비되는 시간 사이에 미디어 데이터를 저장하고, 다운로드 레이트의 추정이 기초할 0이 아닌 추정 주기를 결정하고, 추정 주기 동안 버퍼 레벨들의 표시들을 저장하고, 소정 시간에서의 버퍼 레벨은 적어도 대략적으로, 얼마나 많은 프리젠테이션이 그 시간에, 수신되었지만 아직 프리젠테이션 엘리먼트에 의해 소비되지 않은 미디어 데이터에 의해 차지되었는지에 대응하고, 저장된 표시자를 추정된 다운로드 레이트의 측정의 부분으로서 사용하는 것을 포함할 수 있다.In one embodiment, a method and / or apparatus for download estimation, which takes into account the presentation buffer and estimates the download rate based on how large / full / empty the buffer is, Is used. For example, estimating a download rate at a receiver coupled to data sources by a network path having a finite bandwidth, the download rate being the rate at which data can be received over the network path at the receiver, And the presentation buffer stores media data between at least the time at which the media data is received and the time at which the media data is consumed by the presentation element associated with the receiver, and wherein the estimate of the download rate is non- The buffer level at a given time is at least approximately equal to the amount of media data that has been received at that time but has not yet been consumed by the presentation element On Corresponding to that occupied by, and can comprise the application as part of the measurement of the downloaded characters are stored estimated rate display.

프리젠테이션 엘리먼트는 디스플레이 및 오디오 출력을 포함할 수 있다. 추정 주기는 측정된 버퍼 레벨에, 미리 결정된 비례 인수로, 비례하는 지속시간을 가질 수 있다. 추정 주기의 지속시간은 측정 시간에 프리젠테이션 버퍼의 소비되지 않은 미디어 데이터의 바이트 수에 비례하고, 및/또는 미디어가 프리젠테이션 버퍼에 추가되는 추가 레이트의 함수이고, 및/또는 프리젠테이션 버퍼의 미리 결정된 부분을 다운로드하는데 사용되는 시간에 비례하도록 할 수 있다. 미리결정된 지속 시간은 프리젠테이션 버퍼의 콘텐츠의 미리 결정된 부분이 다운로드된 지속 시간에 대응할 수 있다. 추정 주기는 프리젠테이션 버퍼의 콘텐츠의 미리결정된 부분이 다운로드된 시간 및 프리젠테이션 버퍼에 있는 미디어 데이터의 프리젠테이션 시간 중 더 작은 것일 수 있다.The presentation element may include a display and an audio output. The estimation period may have a proportional duration, with a predetermined proportional factor, at the measured buffer level. The duration of the estimation period is proportional to the number of bytes of uncommitted media data in the presentation buffer at the measurement time and / or is a function of the additional rate at which the media is added to the presentation buffer, and / And may be proportional to the time used to download the determined portion. The predetermined duration may correspond to a duration for which a predetermined portion of the content of the presentation buffer has been downloaded. The estimation period may be the smaller of the time at which a predetermined portion of the content of the presentation buffer was downloaded and the presentation time of the media data in the presentation buffer.

일 실시예에서, 플레이백 레이트 선택을 위한 방법 및/또는 장치 또는 컴퓨터 판독 가능 매체가 사용되고, 플레이백 레이트는 미디어가 프리젠테이션 버퍼로부터 소비되는 레이트이고, megabits/second 와 같이, 메모리 단위/시간으로 측정된다. 수신기가 어떤 미디어에 대해 리퀘스트하는 경우, 그 미디어에 대한 플레이백 레이트가 있다. 자주, 그러나 아마 항상은 아니고, 더 높은 품질의 미디어는 더 높은 플레이백 레이트를 갖고 따라서 트레이드-오프를 보인다. 어떤 플레이백 레이트를 사용/리퀘스트 할지는, 적어도 가끔, 얼마나 많은 미디어가 프리젠테이션 버퍼에 있는지의 함수이다. 수신기는 수신기의 프리젠테이션 엘리먼트를 이용하여 플레이 아웃 할 미디어를 수신할 수 있고, 플레이 아웃은 미디어가 프리젠테이션 버퍼로부터 플레이백 레이트로 소비되는 것을 초래하고, 수신기는 복수의 플레이백 레이트들로부터 선택하도록 구성되고, 프리젠테이션 버퍼를 모니터링하고, 프리젠테이션 버퍼는 적어도 미디어 데이터가 수신되는 시간 및 미디어 데이터가 수신기에 연관된 프리젠테이션 엘리먼트에 의해 소비되는 시간 사이에 미디어 데이터를 저장하고, 버퍼 레벨의 표시를 저장하고, 버퍼 레벨은 얼마나 많은 프리젠테이션 버퍼가 수신되었지만 아직 프리젠테이션 엘리먼트에 의해 소비되지 않은 미디어 데이터에 의해 차지되었는지에 대응하고, 추정된 다운로드 레이트를 결정하고, 저장된 표시 및 추정된 다운로드 레이트를 사용하여 목표 플레이백 레이트를 계산하고, 목표 플레이백 레이트에 따라 복수의 플레이백 레이트들 중에서 선택하는 것을 포함한다.In one embodiment, a method and / or apparatus or computer readable medium for selecting a playback rate is used, the playback rate being the rate at which the media is consumed from the presentation buffer, in megabits / second, . If the receiver requests for some media, there is a playback rate for that media. Frequently, but perhaps not always, higher quality media has a higher playback rate and thus shows a trade-off. Which playback rate to use / request is a function of how much media is in the presentation buffer, at least occasionally. The receiver may receive the media to play out using the presentation element of the receiver and the playout may cause the media to be consumed at the playback rate from the presentation buffer and the receiver to select from the plurality of playback rates And wherein the presentation buffer stores media data between at least the time at which the media data is received and the time at which the media data is consumed by the presentation element associated with the receiver and stores an indication of the buffer level And the buffer level corresponds to how many presentation buffers have been received by the media data that have been received but not yet consumed by the presentation element, determine the estimated download rate, and store the stored representation and the estimated download rate To calculate a target playback rate and selecting from among a plurality of playback rates in accordance with a target playback rate.

선택된 플레이백 레이트는 추정된 다운로드 레이트의 미리 결정된 곱셈보다 작거나 같고, 미리 결정된 곱셈은 버퍼 레벨의 증가하는 함수이다. 미리 결정된 곱셈은 프리젠테이션 버퍼의 미디어 데이터의 플레이백 지속 시간의 아핀 선형 함수일 수 있고, 미리 결정된 곱셈은 거기서 프리젠테이션 버퍼의 버퍼 레벨이 임계 양보다 작을 때 1보다 작을 수 있다. 미리 결정된 곱셈은 프리젠테이션 버퍼의 미디어 데이터의 프리젠테이션 지속 시간이 프리젠테이션 시간의 미리 설정된 최대 양보다 많거나 같을 때 1 보다 크거나 같을 수 있다. 미리 결정된 곱셈이 프리젠테이션 버퍼의 미디어의 플레이백 지속 시간의 구분적 선형 함수일 수 있다. 선택된 플레이백 레이트는 추정된 다운로드 레이트의 미리 결정된 곱셈보다 작거나 같을 수 있고, 미리 결정된 곱셈은 프리젠테이션 버퍼의 미디어 데이터의 바이트 수의 증가 함수일 수 있다. 플레이백 레이트는 비례 인수 곱하기 다운로드 레이트 추정보다 작거나 같은 복수의 플레이백 레이트들 중 가장 큰 가용 플레이백 레이트로 선택될 수 있고, 비례 인수는 레이트 변화들에의 반응 시간의 추정으로 나누어진 프리젠테이션 버퍼의 미디어 데이터의 플레이백 지속 시간의 증가 함수이다. 반응 시간은 미디어 데이터의 스위치 포인트들 사이의 프리젠테이션 시간에 상한될 수 있고 및/또는 반응 시간의 추정은 미디어 데이터의 스위치 포인트들 사이의 프리젠테이션 시간의 평균일 수 있다. 반응 시간의 추정은 미리 결정된 상수 곱하기 추정된 라운드-트립 시간(“ERTT”)보다 크거나 같을 수 있다.The selected playback rate is less than or equal to a predetermined multiplication of the estimated download rate, and the predetermined multiplication is an increasing function of the buffer level. The predetermined multiplication may be an affine linear function of the playback duration of the media data of the presentation buffer and the predetermined multiplication may be less than 1 when the buffer level of the presentation buffer is less than the threshold amount. The predetermined multiplication may be greater than or equal to 1 when the presentation duration of the media data in the presentation buffer is greater than or equal to a preset maximum amount of presentation time. The predetermined multiplication may be a piecewise linear function of the playback duration of the media in the presentation buffer. The selected playback rate may be less than or equal to a predetermined multiplication of the estimated download rate and the predetermined multiplication may be an increasing function of the number of bytes of media data in the presentation buffer. The playback rate may be selected to be the largest available playback rate among a plurality of playback rates that is less than or equal to the proportional factor multiplication download rate estimate and the proportional factor may be a presentation divided by an estimate of the response time to rate changes Is an increasing function of the playback duration of the media data in the buffer. The response time may be upper bound to the presentation time between switch points of the media data and / or the estimate of the response time may be an average of the presentation time between switch points of the media data. The estimate of the reaction time may be greater than or equal to a predetermined constant times the estimated round-trip time (" ERTT ").

수신기의 프리젠테이션 엘리먼트를 사용하여 플레이 아웃할 미디어를 수신하는 수신기로서, 플레이 아웃은 미디어가 프리젠테이션 버퍼로부터 플레이백 레이트로 소비되는 것을 초래하고, 수신기는 복수의 플레이백 레이트들로부터 선택하도록 구성되는, 수신기는, 프리젠테이션 버퍼를 모니터링하고, 프리젠테이션 버퍼는 적어도 미디어 데이터가 수신되는 시간 및 미디어 데이터가 수신기에 연관된 프리젠테이션 엘리먼트에 의해 소비되는 시간 사이에 미디어 데이터를 저장하고, 버퍼 레벨의 표시를 저장하고, 버퍼 레벨은 얼마나 많은 프리젠테이션 버퍼가 수신되었지만 아직 프리젠테이션 엘리먼트에 의해 소비되지 않은 미디어 데이터에 의해 차지되었는지에 대응하고, 버퍼 레벨의 저장된 표시 및 버퍼 레벨의 허용된 편차를 사용하여 목표 플레이백 레이트를 계산하고, 목표 플레이백 레이트에 따라 복수의 플레이백 레이트들 중에서 선택하기 위한 방법 또는 장치를 포함한다.A receiver for receiving media to playout using a presentation element of a receiver, the playout resulting in media being consumed at a playback rate from a presentation buffer, and the receiver being configured to select from a plurality of playback rates , The receiver monitors the presentation buffer and the presentation buffer stores media data between at least the time at which the media data is received and the time at which the media data is consumed by the presentation element associated with the receiver, And the buffer level corresponds to how many presentation buffers have been received by the media data that have been received but not yet consumed by the presentation element, And a method or apparatus for calculating a rate of layback and selecting among a plurality of playback rates in accordance with a target playback rate.

플레이백 레이트는 높은 비례 인수, 낮은 비례 인수, 다운로드 레이트 추정, 현재 플레이백 레이트, 버퍼 레벨, 및 레이트 변경들에 대한 반응 시간의 추정에 기초하여 선택될 수 있다. 높은 비례 인수 및 낮은 비례 인수는 모두 레이트 변화에의 반응 시간의 추정에 의해 나누어진 프리젠테이션 버퍼의 미디어 데이터의 플레이백 지속 시간의 증가 함수들 및/또는 구분적 선형 함수들일 수 있고, 높은 비례 인수는 낮은 비례 인수보다 크거나 같을 수 있다. 플레이백 레이트는 이전 플레이백 레이트가 낮은 비례 인수 곱하기 추정된 다운로드 레이트 및 높은 비례인수 곱하기 다운로드 레이트 추정 사이에 있으면 이전 플레이백 레이트와 동일할 수 있다. 이전 플레이백 레이트가 높은 비례 인수 곱하기 다운로드 레이트 추정을 넘으면 플레이백 레이트는 높은 비례 인수 곱하기 추정된 다운로드 레이트보다 크지 않은 가장 큰 가용 플레이백 레이트 이도록 선택될 수 있다. 이전 플레이백 레이트가 낮은 비례 인수 곱하기 다운로드 레이트 추정 아래이면 플레이백 레이트는 낮은 비례 인수 곱하기 다운로드 레이트 추정보다 크지 않은 가장 큰 가용 플레이백 레이트 이도록 선택될 수 있다.The playback rate may be selected based on an estimate of the response rate for high proportional factors, low proportional factors, download rate estimates, current playback rate, buffer level, and rate changes. Both the high proportional and low proportional factors may be incremental and / or delimited linear functions of the playback duration of the media data in the presentation buffer divided by an estimate of the response time to the rate change, May be greater than or equal to the lower proportional factor. The playback rate may be equal to the previous playback rate if the previous playback rate is between a low proportional factor multiplied estimated download rate and a high proportional factor multiplied download rate estimate. If the previous play rate exceeds the high proportional factor multiplication download rate estimate, then the playback rate can be selected to be the highest available factor multiplication multiplied by the largest available play rate that is not greater than the estimated download rate. If the previous playback rate is below the low proportional factor multiplication download rate estimate, the playback rate may be selected to be the largest available playback rate that is not less than the low proportional factor multiplication download rate estimate.

일 실시예에서, 리퀘스트들을 하고 또한 프로세스에서 리퀘스트들을 취소할지를 결정하기 위한 방법 및/또는 장치 또는 컴퓨터 판독가능 매체가 사용된다. 수신기가 미디어의 세그먼트들/부분들/프래그먼트들에 대해 리퀘스트들을 하고, 리퀘스트에 대한 응답을 수신하고, 응답으로부터 미디어를 저장하고 아마 다른 리퀘스트를 하면서, 그것은 리퀘스트를 취소하고 다른 리퀘스트를 이슈하는 것이 바람직할 지를 결정할 수 있다. 미디어의 플레이백 레이트는 수신기에 의해 가장 공격적이고 그것이 소비됨에 따라 프리젠테이션 버퍼의 미디어가 없어짐 없이 획득할 것으로 예상하는 가장 높은 플레이백 레이트를 선택함으로써 결정될 수 있다. 다운로드 레이트가 갑자기 하락하면, 수신기는 그 현재 리퀘스트를 취소하고 낮은 플레이백 레이트 미디어에 대한 새 리퀘스트를 할지 아니면 현재 리퀘스트가 플레이 아웃하게 할지를 결정한다. 높은 플레이백 레이트 리퀘스트를 취소하고 그것을 낮은 플레이백 레이트 리퀘스트로 대체하는 것은 프리젠테이션 버퍼의 콘텐츠가 오래 지속되게 할 수 있고, 그러나 도중에 리퀘스트를 취소하는 것은 그 리퀘스트에 대한 임의의 부분적으로 수신된 미디어의 손실을 초래할 수 있다. In one embodiment, a method and / or apparatus or computer readable medium is used to determine whether to receive requests and to cancel requests in the process. It is desirable for the receiver to receive requests for segments / fragments / fragments of media, receive responses to requests, store media from responses and possibly other requests, it is desirable to cancel the request and issue another request Can be determined. The playback rate of the media can be determined by choosing the highest playback rate that the receiver expects to acquire the most aggressive and the media in the presentation buffer as it is consumed. If the download rate suddenly drops, the receiver cancels the current request and decides whether to make a new request for low playback rate media or the current request to play out. Canceling a high playback rate request and replacing it with a low playback rate request may cause the content of the presentation buffer to be long-lasting, but canceling the request on the way may result in a failure of any partially received media It can cause loss.

그러한 한 실시예에서, 수신기는 수신기의 프리젠테이션 엘리먼트를 이용하여 플레이 아웃 할 미디어를 수신하고, 플레이 아웃은 미디어가 프리젠테이션 버퍼로부터 플레이백 레이트로 소비되는 것을 초래하고, 수신기는 복수의 플레이백 레이트들로부터 선택하도록 구성된다. 리퀘스트 행동을 결정하는 것은, 프리젠테이션 버퍼를 모니터링하고, 프리젠테이션 버퍼는 적어도 미디어 데이터가 수신되는 시간 및 미디어 데이터가 수신기에 연관된 프리젠테이션 엘리먼트에 의해 소비되는 시간 사이에 미디어 데이터를 저장하고, 버퍼 레벨의 표시를 저장하고, 버퍼 레벨은 얼마나 많은 프리젠테이션 버퍼가 수신되었지만 아직 프리젠테이션 엘리먼트에 의해 소비되지 않은 미디어 데이터에 의해 차지되었는지에 대응하고, 미디어 데이터의 선택된 제 1 청크를 다운로드하기 위한 이슈된 리퀘스트의 상태를 유지하고, 이슈된 리퀘스트가 아웃스탠딩이면, 네트워크 상태 및 이슈된 리퀘스트의 상태에 기초하여, 리퀘스트를 계속할지 리퀘스트를 취소할지를 결정하는 것을 포함한다.In such an embodiment, the receiver receives the media to playout using the presentation elements of the receiver, and the playout results in the media being consumed at the playback rate from the presentation buffer, and the receiver has a plurality of playback rates As shown in FIG. Determining a request behavior includes monitoring the presentation buffer and storing the media data between a time at which the media data is received and a time at which the media data is consumed by the presentation element associated with the receiver, And the buffer level corresponds to how many presentation buffers have been received but has been occupied by media data that has not yet been consumed by the presentation element and the number of issues requested to download the selected first chunk of media data And if the issued request is out-standing, determining whether to continue the request or to cancel the request, based on the network status and the status of the issued request.

리퀘스트를 계속할지 리퀘스트를 취소할지를 결정하는 것은, 제 1 미디어 데이터가 플레이 아웃되어야 하기 전에 리퀘스트에 대한 다운로드를 완료할 충분한 시간이 있는지 여부를 결정하고, 충분한 시간이 없다면, 리퀘스트를 취소하는 것을 포함할 수 있다. 리퀘스트를 계속할지 리퀘스트를 취소할지를 결정하는 것은, 다운로드 레이트들 및 미디어 소비 레이트들에 기초하여, 스톨이 발생할 것임을 검출하고, 프리젠테이션 엘리먼트가 소비되고 있는 미디어에 의해 지시된 레이트에서 미디어 데이터를 소비할 수 없을 때의 시간 및 프리젠테이션 엘리먼트가 소비되고 있는 미디어에 의해 지시된 레이트에서 미디어 데이터를 소비하는 것을 재개할 수 있는 시간 사이의 스톨 주기를 추정하고, 계속 또는 취소가 스톨 주기에 미치는 영향을 결정하고, 리퀘스트를 취소하는 것이 스톨 주기를 줄이면, 리퀘스트를 취소하는 것을 더 포함할 수 있다.Determining whether to continue the request or to cancel the request includes determining whether there is enough time to complete the download to the request before the first media data is played out and canceling the request if there is not enough time . Determining whether to continue the request or to cancel the request will determine that a stall will occur based on the download rates and the media consumption rates and will cause the presentation element to consume media data at the rate indicated by the media being consumed Estimates a stall period between the time when the media element is not available and the time at which the presentation element can resume consuming the media data at the rate indicated by the media being consumed and determines the effect of the continuation or cancellation on the stall period And canceling the request may further include canceling the request if the stall period is reduced.

다른 특징들은 미디어 데이터의 제 2 청크를 선택하고, 미디어 데이터의 제 2 청크는 시작 프리젠테이션 시간을 갖고 그 시작 프리젠테이션 시간은 미디어 데이터의 제 1 청크와 동일한 시작 프리젠테이션 시간이고, 미디어 데이터의 제 2 청크의 다운로드를 리퀘스트하는 것, 미디어 데이터의 제 2 청크를 선택하고, 미디어 데이터의 제 2 청크는 시작 프리젠테이션 시간을 갖고 그 시작 프리젠테이션 시간은 미디어 데이터의 제 1 청크보다 늦고, 미디어 데이터의 제 2 청크의 다운로드를 리퀘스트하는 것을 포함할 수 있다. 미디어 데이터의 제 2 청크는 리시버에 의해 그 시작 프리젠테이션 시간이 제 1 청크의 시작 프리젠테이션 시간의 그것에 비교하여 리시버에 이용 가능한 가장 낮은 차이이고, 및/또는 그 플레이백이 그 시작 프리젠테이션 시간과 미디어 데이터의 제 1 청크의 시작 프리젠테이션 시간 사이의 미리 결정된 최대 갭을 갖는 최대 플레이백 레이트 이도록 선택될 수 있다.Other features select a second chunk of media data, the second chunk of media data has a start presentation time, the start presentation time is a start presentation time same as the first chunk of media data, Requesting download of two chunks, selecting a second chunk of media data, the second chunk of media data having a start presentation time, the start presentation time of which is later than the first chunk of media data, And requesting download of the second chunk. The second chunk of media data is the lowest chunk available to the receiver as compared to that of the start presentation time of the first chunk by the receiver and / May be selected to be the maximum playback rate with a predetermined maximum gap between the start presentation time of the first chunk of data.

어떤 실시예들은 미디어 데이터의 제 1 청크의 잔여 부분의 다운로드가 플레이백을 위해 적시에 완료될 수 없는지를 결정하고, 미디어 데이터의 제 2 청크의 다운로드가 플레이백을 위해 적시에 완료될 수 있는지를 결정하고, 리퀘스트를 계속할지 아니면 미디어 데이터의 제 1 청크에 대한 리퀘스트를 취소하고 대신에 미디어 데이터의 제 2 청크를 리퀘스트할 지에 대한 결정을, 미디어 데이터의 제 1 청크의 잔여 부분의 다운로드가 플레이백을 위해 적시에 완료될 수 없는지 및 미디어 데이터의 제 2 청크의 다운로드가 플레이백을 위해 적시에 완료될 수 있는지에 기초를 두는 것을 포함할 수 있다. 데이터의 제 2 청크의 미디어 데이터의 플레이백 레이트는 수신기에서 지원되는 가장 높은 플레이백 레이트 이도록 선택될 수 있다. 수신기는 이미 프리젠테이션 버퍼에 있는 적어도 얼마간의 미디어 데이터의 프리젠테이션 시간을 커버링하는 미디어 데이터를 요청하고, 리퀘스트된 미디어 데이터를 다운로드하고, 리퀘스트된 미디어 데이터를 플레이 아웃하고, 이미 프리젠테이션 버퍼에 있는 대응하는 미디어 데이터의 적어도 얼마간을 폐기할 수 있다. 리퀘스트된 미디어 데이터의 플레이백 레이트는, 프리젠테이션 버퍼로부터 폐기된 대응 미디어 데이터의 최대 프리젠테이션 지속 시간에 대한 제약을 조건으로, 최대 플레이백 레이트일 수 있다. 리퀘스트된 미디어 데이터는 그 시작 프리젠테이션 시간이 수신기에 이용 가능한 가장 빠른 시작 프리젠테이션 시간 이도록 선택될 수 있다.Some embodiments may determine whether the download of the remainder of the first chunk of media data can not be completed in time for playback and determine whether the download of the second chunk of media data can be completed in time for playback Determine whether to continue the request, or cancel the request for the first chunk of media data and instead request the second chunk of media data, if the download of the remainder of the first chunk of media data is a playback , And based on whether the download of the second chunk of media data can be completed in time for playback. The playback rate of the media data of the second chunk of data may be selected to be the highest playback rate supported by the receiver. The receiver may request media data that already covers the presentation time of at least some of the media data in the presentation buffer, download the requested media data, play out the requested media data, At least some of the media data to be played back may be discarded. The playback rate of the requested media data may be a maximum playback rate, subject to constraints on the maximum presentation duration of the corresponding media data discarded from the presentation buffer. The requested media data may be selected such that its start presentation time is the earliest start presentation time available to the receiver.

어떤 수신기들에서, 다운로드는 버퍼 레벨에 의존하고, 수신기들은 높은 워터마크 및 낮은 워터마크의 개념을 사용한다. 그러한 수신기에서, 미디어 데이터는 소스로부터 다운로드되고 수신기의 프리젠테이션 버퍼에 저장된다. 프리젠테이션 버퍼의 채워진 레벨(또는 단지 “레벨”)이 결정되고, 채워진 레벨은 프리젠테이션 버퍼의 어떤 부분이 프리젠테이션 엘리먼트에 의해 아직 소비되지 않은 미디어 데이터를 포함한다는 것을 나타낸다. 채워진 레벨이 높은 채움 임계치(“높은 워터마크”)를 넘으면, 다운로드는 중단되고, 채워진 레벨이 낮은 채움 임계치(“낮은 워터마크”) 아래이면, 다운로드는 다시 시작한다. 채워진 레벨은 미디어 데이터가 프리젠테이션 엘리먼트에 의해 소비됨에 따라 업데이트될 수 있다. 채워진 레벨은 메모리 저장 용량의 단위 및/또는 프리젠테이션 시간 단위로 측정될 수 있다. 다운로드는 추정된 라운드-트립 시간(“ERTT”)에 기초할 수 있고, ERTT는 미디어 데이터 다운로드가 재시작될 때 리셋된다. 다운로드가 복수의 TCP 접속들을 통해 발생하면, 사용된 복수의 TCP 접속들이 미디어 데이터 다운로드가 재시작될 때 리셋될 수 있다. 높은 채움 및 낮은 채움 임계치들은 시간에 따라 변할 수 있다.In some receivers, the download depends on the buffer level, and receivers use the concept of high watermark and low watermark. In such a receiver, the media data is downloaded from the source and stored in the presentation buffer of the receiver. The filled level (or simply " level ") of the presentation buffer is determined, and the filled level indicates that some portion of the presentation buffer contains media data that has not yet been consumed by the presentation element. If the filled level exceeds the high fill threshold (" high watermark "), the download is stopped and if the filled level is below the low fill threshold (" low watermark "), the download resumes. The filled level can be updated as the media data is consumed by the presentation element. The filled level can be measured in units of memory storage capacity and / or presentation time units. The download may be based on an estimated round-trip time (" ERTT ") and the ERTT is reset when the media data download is restarted. If the download occurs over a plurality of TCP connections, the plurality of TCP connections used may be reset when the media data download is restarted. High fill and low fill thresholds can change over time.

추가의 실시예들이 이 개시물을 읽은 후에 당업자에게 상상될 수 있다. 다른 실시예들에서, 위에 개시된 발명의 조합 및 서브-조합들이 유리하게 만들어 질 수 있다. 실례의 목적으로 컴포넌트들의 예시적 배열들이 제시되고 조합들, 추가들, 재배열들 및 기타 등등은 본 발명의 대안적 실시예들에 의도된다. 따라서, 발명은 예시적인 실시예에 대하여 설명되었으며, 당업자는 많은 변형들이 가능함을 인식할 것이다.Additional embodiments may be envisioned by those skilled in the art after reading this disclosure. In other embodiments, combinations and sub-combinations of the inventions disclosed above may be made advantageous. For purposes of illustration, exemplary arrangements of components are provided and combinations, additions, rearrangements, and the like are contemplated in alternative embodiments of the present invention. Accordingly, the invention has been described with respect to exemplary embodiments, and those skilled in the art will recognize that many modifications are possible.

예를 들어, 여기에 설명된 프로세스들은 하드웨어 컴포넌트들, 소프트웨어 컴포넌트들, 및/또는 그 임의의 결합을 사용하여 구현될 수 있다. 설명 및 도면들은, 따라서, 한정적 의미가 아니라 예시적인 것으로 간주되어야 한다. 그러나 다양한 변경들 및 변형들이 청구항들에 제시된 바와 같은 본 발명의 더 넓은 사상 및 범위로부터 벗어남 없이 거기에 될 수 있고 본 발명은 다음 청구항들의 범위 내에 모든 변경들 및 균등물들을 포함하는 것으로 의도되었다는 것이 명백할 것이다.For example, the processes described herein may be implemented using hardware components, software components, and / or any combination thereof. The description and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense. It will be understood by those skilled in the art that various changes and modifications can be made therein without departing from the broader spirit and scope of the invention as set forth in the claims and that the invention is intended to cover all modifications and equivalents falling within the scope of the following claims It will be obvious.

Claims

A method for selecting a playback rate in a receiver that receives media to playout using a presentation element of the receiver,
The playout results in the media being consumed at the playback rate from the presentation buffer and the receiver is configured to select from a plurality of playback rates,
Monitoring the presentation buffer, the presentation buffer storing media data between at least the time at which the media data is received and the time at which the media data is consumed by the presentation element associated with the receiver;
Storing an indication of a buffer level, the buffer level corresponding to how much of the presentation buffer has been occupied by media data that has been received but has not yet been consumed by the presentation element;
Determining an estimated download rate;
Using a stored indication and an estimated download rate to calculate a target playback rate; And
And selecting from among the plurality of playback rates in accordance with the target playback rate.
A method for selecting a playback rate.

The method according to claim 1,
Wherein the selected playback rate is a playback rate that is less than or equal to a predetermined multiplication of the estimated download rate and wherein the predetermined multiplication is an increasing function of the buffer level.

3. The method of claim 2,
Wherein the predetermined multiplication is an affine linear function of the playback duration of the media data of the presentation buffer.

3. The method of claim 2,
Wherein the predetermined multiplication is less than one when the buffer level of the presentation buffer is less than the threshold amount.

3. The method of claim 2,
Wherein the predetermined multiplication is greater than or equal to 1 when the presentation duration of the media data in the presentation buffer is greater than or equal to a predetermined maximum amount of presentation time.

3. The method of claim 2,
Wherein the predetermined multiplication is a piecewise linear function of the playback duration of the media data of the presentation buffer.

The method according to claim 1,
Wherein the selected playback rate is a playback rate that is less than or equal to a predetermined multiplication of the estimated download rate and the predetermined multiplication is a function of increasing the number of bytes of media data in the presentation buffer Lt; / RTI >

The method according to claim 1,
Wherein the playback rate is selected as the largest available playback rate among a plurality of playback rates that is less than or equal to the proportional factor multiplied by the estimated download rate and the proportional factor is divided by an estimate of response time to rate changes And a function of increasing the playback duration of the media data in the presentation buffer.

9. The method of claim 8,
Wherein the estimation of the response time is an upper bound on the presentation time between switch points of the media data.

9. The method of claim 8,
Wherein the estimation of the reaction time is an average of a presentation time between switch points of the media data.

Wherein the estimate of the reaction time is greater than or equal to a predetermined constant multiplied estimated round-trip time (" ERTT ").

A method for selecting a playback rate in a receiver that receives media to playout using a presentation element of the receiver,
The playout results in the media being consumed at the playback rate from the presentation buffer and the receiver is configured to select from a plurality of playback rates,
Monitoring the presentation buffer, wherein the presentation buffer stores media data between at least the time at which the media data is received and the time at which the media data is consumed by the presentation element associated with the receiver;
Storing an indication of a buffer level, the buffer level corresponding to how much of the presentation buffer has been occupied by media data that has been received but has not yet been consumed by the presentation element;
Determining an allowed deviation of the buffer level;
Using the stored representation of the buffer level and the allowed deviation of the buffer level to calculate a target playback rate; And
And selecting from among the plurality of playback rates in accordance with the target playback rate.
A method for selecting a playback rate.

13. The method of claim 12,
Wherein the playback rate is selected based on an estimate of response times for high proportional factors, low proportional factors, estimated download rates, current playback rate, buffer level, and rate changes. Way.

14. The method of claim 13,
Wherein the high proportional and low proportional factors are both incremental functions of the playback duration of the media data of the presentation buffer divided by an estimate of the response time to the rate change and / A method for selecting a rate.

14. The method of claim 13,
Wherein the high proportional factor is greater than or equal to the low proportional factor.

14. The method of claim 13,
Wherein the playback rate is selected to be equal to the previous playback rate if the previous playback rate is between the low proportional factor times the estimated download rate and the high proportional factor times the download rate estimate Lt; / RTI >

14. The method of claim 13,
Wherein the playback rate is selected to be the highest available rate multiplied by the high proportionality factor multiplied by the higher rate factor if the previous rate of play exceeds the high rate factor estimate, How to choose.

14. The method of claim 13,
Wherein the playback rate is selected to be the largest available play rate that is not greater than the low rate factor estimate multiplied by the low rate factor if the previous play rate is less than the low rate factor estimate multiplied by the download rate estimate Lt; / RTI >

A receiver that receives media to play out using a presentation element of a receiver and consumes media data at a playback rate,
A presentation interface for providing playback at one of a plurality of playback rates;
A presentation buffer storing the media data and being coupled to the presentation interface between at least the time at which the media data is received and the time at which the media data is consumed by the presentation element associated with the receiver;
Storage for variables relating to presentation buffer capacity, including an indication of a buffer level, said buffer level indicating how much of said presentation buffer has been occupied by media data that has been received but has not yet been consumed by said presentation element Corresponding;
An estimated download rate determiner for determining an estimated download rate;
And logic for arranging the requests according to the determined selected playback rate using the stored indication and the estimated download rate to calculate a target playback rate.
receiving set.

20. The method of claim 19,
Wherein the selected playback rate is a playback rate that is less than or equal to a predetermined multiplication of the estimated download rate and wherein the predetermined multiplication is an increasing function of the buffer level and wherein the predetermined multiplication is performed in the media of the presentation buffer Receiver is an affine linear function of the playback duration of the data.

21. The method of claim 20,
Wherein the predetermined multiplication is less than one when the buffer level of the presentation buffer is less than a threshold amount.

21. The method of claim 20,
Wherein the selected playback rate is a playback rate that is less than or equal to a predetermined multiplication of the estimated download rate and wherein the predetermined multiplication is an increasing function of the number of bytes of media data in the presentation buffer.

20. The method of claim 19,
Wherein the playback rate is selected as the largest available play rate among a plurality of playback rates that is less than or equal to the rate factor estimate multiplied by the download rate estimate and wherein the proportional factor is divided by an estimate of response time to rate changes And a function of increasing the playback duration of the media data of the presentation buffer.

24. The method of claim 23,
Wherein the estimate of the reaction time is an average over a presentation time between switch points of the media data or a presentation time between switch points of the media data.

24. The method of claim 23,
Wherein the estimate of the reaction time is greater than or equal to a predetermined constant times the estimated round-trip time (" ERTT ").

18. A non-transitory computer readable medium for execution by a processor of a receiver for playing out media to a presentation element of a receiver,
The playout results in the media being consumed at the playback rate from the presentation buffer and the receiver is configured to select from a plurality of playback rates,
The medium may further comprise:
Program code for monitoring a presentation buffer, the presentation buffer storing media data between at least the time at which the media data is received and the time at which the media data is consumed by the presentation element associated with the receiver;
Program code for storing an indication of a buffer level, the buffer level corresponding to how much of the presentation buffer has been occupied by media data that has been received but has not yet been consumed by the presentation element;
Program code for determining an allowed deviation of the buffer level;
Program code for using the stored representation of the buffer level and the allowed deviation of the buffer level to calculate a target playback rate; And
And program code for selecting among the plurality of playback rates in accordance with the target playback rate.
Including program code,
Non-transient computer readable medium.

27. The method of claim 26,
Further comprising program code for selecting a playback rate based on an estimate of response time for the current playback rate, the buffer level, and rate changes, a high rate factor, a low rate factor, a download rate estimate, Transient computer readable medium.

28. The method of claim 27,
Wherein the high proportional and low proportional factors are both incremental functions of the playback duration of the media data of the presentation buffer divided by an estimate of the response time to the rate change and / Transient computer readable medium.

28. The method of claim 27,
Wherein the high proportional argument is greater than or equal to the low proportional argument.

28. The method of claim 27,
Further comprising program code for comparing and selecting the playback rate to be equal to the previous playback rate if the previous playback rate is between the low proportional factor times the estimated download rate and the high proportional factor times the download rate estimate &Lt; / RTI >

28. The method of claim 27,
Program code for comparing and selecting the playback rate such that the highest rate multiplied by the high rate factor multiplies the high rate rate multiplied by the high rate factor and is the largest available playback rate not greater than the estimated download rate &Lt; / RTI > further comprising a computer readable medium.

28. The method of claim 27,
Program code for comparing and selecting the playback rate such that if the previous playback rate is equal to the low rate factor multiplied by the download rate estimate then the low rate factor is the largest available playback rate that is not greater than the download rate estimate &Lt; / RTI >