KR102249993B1

KR102249993B1 - Apparatus and method for controlling resource reselection in vehicle communication system

Info

Publication number: KR102249993B1
Application number: KR1020190174856A
Authority: KR
Inventors: 김성륜; 오승은; 한규원
Original assignee: 연세대학교 산학협력단
Priority date: 2019-12-26
Filing date: 2019-12-26
Publication date: 2021-05-07

Abstract

The present invention provides a device and method for controlling resource reselection in a vehicle communication system, wherein location information and the packet reception rate are received from each of at least one vehicle, a resource maintenance possibility of each vehicle is initialized as to an initial resource maintenance probability in accordance with the number of surrounding vehicles of each vehicle determined from the received location information of each of the at least one vehicle, and thereafter, the resource maintenance probability is updated in response to a change in the packet reception rate of each of the received at least one vehicle to be transmitted to the corresponding vehicle, thereby improving the packet reception rate and the instantaneous data rate.

Description

Resource reselection control device and method of vehicle communication system {APPARATUS AND METHOD FOR CONTROLLING RESOURCE RESELECTION IN VEHICLE COMMUNICATION SYSTEM}

본 발명은 차량 통신 시스템의 자원 재선택 제어 장치 및 방법에 관한 것으로, 강화 학습을 이용하는 차량 통신 시스템의 자원 재선택 제어 장치 및 방법에 관한 것이다.The present invention relates to an apparatus and method for controlling resource reselection in a vehicle communication system, and to an apparatus and method for controlling resource reselection in a vehicle communication system using reinforcement learning.

6GHz 이하 대역을 활용하는 차량 네트워크의 대표적인 표준으로는 셀룰러-차량사물 통신(Cellular Vehicle to Everything Communication: 이하 C-V2X)과 IEEE 802.11p 등이 있다. 그리고 기존의 C-V2X 표준에는 자동 스케쥴링 (Autonomous scheduling) 기반 C-V2X 모드4(mode 4)와 네트워크 제어 스케쥴링(Network controlled scheduling) 기반 C-V2X 모드3(mode 3)를 포함하고 있다.Typical standards for vehicle networks utilizing a band below 6GHz include Cellular Vehicle to Everything Communication (C-V2X) and IEEE 802.11p. In addition, the existing C-V2X standard includes C-V2X mode 4 (mode 4) based on automatic scheduling and C-V2X mode 3 (mode 3) based on network controlled scheduling.

이중 C-V2X mode 4는 각 차량들이 일정 수의 패킷을 전송하게 되면 센싱 정보를 바탕으로 하여 기존에 사용하던 자원 블록(resource block)을 그대로 사용할지 아니면 다른 자원 블록을 선택 통신할지를 결정한다. 따라서 기존 인프라(노변 장치(Road Side Unit: RSU), 기지국 등)를 활용하지 않아, 인프라를 거치면서 발생하는 추가 레이턴시(additional latency)가 없는 반면, 인프라를 활용하는 기법 대비 자원 선택의 효율이 떨어진다.Of these, in C-V2X mode 4, when each vehicle transmits a certain number of packets, it determines whether to use the existing resource block as it is or to select another resource block based on the sensing information. Therefore, the existing infrastructure (Road Side Unit (RSU), base station, etc.) is not used, so there is no additional latency that occurs while going through the infrastructure, whereas the efficiency of resource selection is lower than the technique using the infrastructure. .

그에 반해 C-V2X 모드3에서는 기지국이 센싱한 정보를 바탕으로 자원 블록을 차량들에 직접 할당해주는 방식을 이용한다. 따라서 인프라를 통한 통신으로 인한 추가 레이턴시가 발생하지만 효율적인 자원 선택을 통한 데이터 율이 향상되는 이득이 있다.In contrast, in C-V2X mode 3, resource blocks are directly allocated to vehicles based on information sensed by the base station. Therefore, additional latency occurs due to communication through the infrastructure, but there is a benefit of improving the data rate through efficient resource selection.

한편 최근에는 자동 드라이빙 기술 수준의 향상과 차량 내 어플리케이션의 다양화로 인해 더 높은 데이터 율에 대한 요구를 지원할 수 있는 MAC(Media Access Control)의 연구의 필요성이 높아지고 있다. 이에 따라 6GHz 이상, 즉 밀리미터파를 활용한 방향성 전송에 대한 연구가 집중적으로 이루어지고 있다.Meanwhile, the need for a study of MAC (Media Access Control) that can support the demand for a higher data rate is increasing due to the improvement of the level of automatic driving technology and the diversification of in-vehicle applications. Accordingly, research on directional transmission using more than 6 GHz, that is, millimeter wave, is being intensively conducted.

기지국이 센싱한 정보를 바탕으로 자원 할당하는 C-V2X 모드3에서 모든 차량들이 6GHz 이상 주파수 대역으로 방향성 안테나를 활용하면 데이터 율 이득이 저하될 수 있다. 그러나 C-V2X 모드4의 경우, 6GHz 이상 주파수 대역의 밀리미터파를 활용하더라도 성능의 향상이 크게 나타나지 않는다는 한계가 있다.In C-V2X mode 3, in which resources are allocated based on information sensed by the base station, if all vehicles use a directional antenna in a frequency band of 6 GHz or higher, the data rate gain may decrease. However, in the case of C-V2X mode 4, there is a limitation that performance improvement is not significantly improved even when millimeter waves in a frequency band of 6 GHz or higher are used.

한국 공개 특허 제10-2019-0114757호 (2019.10.10 공개)Korean Patent Publication No. 10-2019-0114757 (published on Oct. 10, 2019)

본 발명의 목적은 주파수 재선택 효율을 향상시킬 수 있는 차량 통신 시스템의 자원 재선택 제어 장치 및 방법을 제공하는데 있다.An object of the present invention is to provide an apparatus and method for controlling resource reselection of a vehicle communication system capable of improving frequency reselection efficiency.

본 발명의 다른 목적은 주파수 재선택의 효율을 향상시켜 패킷 수신률과 순간 데이터율을 향상시킬 수 있는 차량 통신 시스템의 자원 재선택 제어 장치 및 방법을 제공하는데 있다.Another object of the present invention is to provide an apparatus and method for controlling resource reselection of a vehicle communication system capable of improving a packet reception rate and an instantaneous data rate by improving the efficiency of frequency reselection.

상기 목적을 달성하기 위한 본 발명의 일 실시예에 따른 차량 통신 시스템의 자원 재선택 제어 장치는 적어도 하나의 차량 각각으로부터 위치 정보와 패킷 수신율(packet reception ratio: 이하 PRR)을 수신하여, 수신된 적어도 하나의 차량 각각의 위치 정보로부터 판별되는 각 차량의 주변 차량 수에 따라 각 차량의 자원 유지 확률을 초기 자원 유지 확률로 초기화하고, 이후, 수신된 적어도 하나의 차량 각각의 PRR의 변화에 대응하여 자원 유지 확률을 업데이트하여 대응하는 차량으로 전송한다.The apparatus for controlling resource reselection of a vehicle communication system according to an embodiment of the present invention for achieving the above object receives location information and a packet reception ratio (PRR) from each of at least one vehicle, and receives the received at least The resource maintenance probability of each vehicle is initialized as the initial resource maintenance probability according to the number of surrounding vehicles of each vehicle determined from the location information of each vehicle, and thereafter, resources in response to a change in PRR of each received at least one vehicle The maintenance probability is updated and transmitted to the corresponding vehicle.

상기 자원 재선택 제어 장치는 적어도 하나의 차량 각각으로부터 수신된 PRR의 변화에 따라 기지정된 방식으로 가중치를 업데이트하고 업데이트된 가중치를 적용하여 자원 유지 확률을 획득할 수 있다.The apparatus for controlling resource reselection may obtain a resource maintenance probability by updating a weight in a predetermined manner according to a change in PRR received from each of at least one vehicle and applying the updated weight.

상기 자원 재선택 제어 장치는 적어도 하나의 차량(i) 각각에 대한 자원 유지 확률(p_i)을 가중치(w_i)로부터 수학식 The resource reselection control device calculates the resource retention probability (p _i ) for each of at least one vehicle (i) from the weight (w _i )

(여기서 t는 시간, u(t)는 유틸리티 함수로서 PRR을 나타내고, α는 유틸리티 가중 파라미터이다.)에 따라 획득할 수 있다.(Where t is time, u(t) represents PRR as a utility function, and α is a utility weighting parameter).

상기 자원 재선택 제어 장치는 수신된 적어도 하나의 차량 각각의 위치 정보로부터 판별되는 각 차량의 주변 차량 수에서 최대값과 최소값을 분석하고, 적어도 하나의 차량 각각을 중심으로 기준 거리 이내에 위치한 주변 차량의 수가 상기 최대값에 근접할수록 상기 초기 자원 유지 확률을 작게 설정하고, 상기 주변 차량의 수가 상기 최소값에 근접할수록 기지정된 범위 내의 최대값에 가깝게 되도록 설정할 수 있다.The resource reselection control device analyzes the maximum value and the minimum value from the number of surrounding vehicles of each vehicle determined from the received location information of each of the at least one vehicle, and As the number approaches the maximum value, the initial resource retention probability may be set to be smaller, and as the number of nearby vehicles approaches the minimum value, the number may be set to be closer to a maximum value within a predetermined range.

상기 자원 재선택 제어 장치는 적어도 하나의 차량(i) 각각에 대한 주변 차량 수(n_i)와 판별된 상기 최대값(N)과 상기 최소값(n)으로부터 초기 자원 유지 확률(p₀)을 수학식 _{The resource reselection control device calculates an initial resource retention probability (p 0} ) _{from the number of surrounding vehicles (n i} ) for each of at least one vehicle (i) and the determined maximum value (N) and the minimum value (n). expression

(여기서 d_th은 차량(i)을 중심으로 주변 차량을 판별하기 위한 기준 거리이다.)에 따라 획득할 수 있다.(Here, d _th is a reference distance for discriminating surrounding vehicles around the vehicle i).

상기 적어도 하나의 차량 각각은 상기 자원 재선택 제어 장치에서 업데이트되어 자원 유지 확률에 따라 자원 재선택 확률를 변경하고, 변경된 자원 재선택 확률에 따라 자원 재선택 여부를 결정할 수 있다.Each of the at least one vehicle may be updated in the resource reselection control apparatus to change a resource reselection probability according to a resource maintenance probability, and determine whether to reselect a resource according to the changed resource reselection probability.

상기 자원 재선택 제어 장치는 차량 통신 시스템의 인프라스트럭쳐로 구현될 수 있다.The resource reselection control device may be implemented as an infrastructure of a vehicle communication system.

상기 목적을 달성하기 위한 본 발명의 다른 실시예에 따른 차량 통신 시스템의 자원 재선택 제어 방법은 적어도 하나의 차량 각각으로부터 위치 정보와 패킷 수신율(PRR)을 수신하여, 수신된 적어도 하나의 차량 각각의 위치 정보로부터 판별되는 각 차량의 주변 차량 수에 따라 각 차량의 자원 유지 확률을 초기 자원 유지 확률로 초기화하는 단계; 및 이후, 수신된 적어도 하나의 차량 각각의 PRR의 변화에 대응하여 자원 유지 확률을 업데이트하는 단계를 포함한다.A method for controlling resource reselection of a vehicle communication system according to another embodiment of the present invention for achieving the above object receives location information and packet reception rate (PRR) from each of at least one vehicle, Initializing a resource maintenance probability of each vehicle as an initial resource maintenance probability according to the number of surrounding vehicles of each vehicle determined from the location information; And thereafter, updating the resource maintenance probability in response to a change in the PRR of each of the received at least one vehicle.

따라서, 본 발명의 실시예에 따른 차량 통신 시스템의 자원 재선택 제어 장치 및 방법은 인프라와 지향성 안테나를 사용하는 다수의 차량들로 이루어진 네트워크 상에서 각 차량이 업로드한 정보들을 바탕으로 인프라가 강화 학습을 통해 네트워크 파라미터를 업데이트하고, 업데이트된 네트워크 파라미터를 기반으로 각 차량들이 자원 재선택 확률을 가변하여 주파수 재선택의 효율을 향상시킬 수 있으며, C-V2X 모드4를 활용했을 때보다 패킷 수신률과 순간 데이터율을 향상시킬 수 있다.Accordingly, in the apparatus and method for controlling resource reselection of a vehicle communication system according to an embodiment of the present invention, the infrastructure performs reinforcement learning based on information uploaded by each vehicle on a network consisting of a plurality of vehicles using the infrastructure and the directional antenna. Network parameters are updated through the network parameters, and the efficiency of frequency reselection can be improved by varying the resource reselection probability of each vehicle based on the updated network parameters. The rate can be improved.

도 1은 C-V2X 모드4에서 각 차량이 자원을 재선택하는 개념을 설명하기 위한 도면이다.
도 2는 본 발명의 일 실시예에 따른 차량 통신 시스템에서 자원 재선택 제어 장치가 각 차량이 자원을 재선택할 수 있도록 자원 재선택 확률을 업데이트하는 개념을 설명하기 위한 도면이다.
도 3은 도 2의 자원 재선택 제어 장치가 각 차량의 위치에 기반하여 자원 재선택 확률을 초기화 하는 과정을 설명하기 위한 도면이다.
도 4는 본 발명의 일 실시예에 따른 차량 통신 시스템의 자원 재선택 제어 방법을 나타낸다.
도 5는 본 발명의 차량 통신 시스템의 자원 재선택 제어 방법에 따른 패킷 수신율 성능을 나타낸다.1 is a diagram for explaining a concept in which each vehicle reselects a resource in C-V2X mode 4;
FIG. 2 is a diagram illustrating a concept of updating a resource reselection probability so that each vehicle can reselect a resource by a resource reselection control apparatus in a vehicle communication system according to an embodiment of the present invention.
FIG. 3 is a diagram illustrating a process of initializing a resource reselection probability based on the location of each vehicle by the resource reselection control apparatus of FIG. 2.
4 illustrates a method for controlling resource reselection in a vehicle communication system according to an embodiment of the present invention.
5 shows packet reception rate performance according to the resource reselection control method of the vehicle communication system of the present invention.

본 발명과 본 발명의 동작상의 이점 및 본 발명의 실시에 의하여 달성되는 목적을 충분히 이해하기 위해서는 본 발명의 바람직한 실시예를 예시하는 첨부 도면 및 첨부 도면에 기재된 내용을 참조하여야만 한다. In order to fully understand the present invention, operational advantages of the present invention, and objects achieved by the implementation of the present invention, reference should be made to the accompanying drawings illustrating preferred embodiments of the present invention and the contents described in the accompanying drawings.

이하, 첨부한 도면을 참조하여 본 발명의 바람직한 실시예를 설명함으로써, 본 발명을 상세히 설명한다. 그러나, 본 발명은 여러 가지 상이한 형태로 구현될 수 있으며, 설명하는 실시예에 한정되는 것이 아니다. 그리고, 본 발명을 명확하게 설명하기 위하여 설명과 관계없는 부분은 생략되며, 도면의 동일한 참조부호는 동일한 부재임을 나타낸다. Hereinafter, the present invention will be described in detail by describing a preferred embodiment of the present invention with reference to the accompanying drawings. However, the present invention may be implemented in various different forms, and is not limited to the described embodiments. In addition, in order to clearly describe the present invention, parts irrelevant to the description are omitted, and the same reference numerals in the drawings indicate the same members.

명세서 전체에서, 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라, 다른 구성요소를 더 포함할 수 있는 것을 의미한다. 또한, 명세서에 기재된 "...부", "...기", "모듈", "블록" 등의 용어는 적어도 하나의 기능이나 동작을 처리하는 단위를 의미하며, 이는 하드웨어나 소프트웨어 또는 하드웨어 및 소프트웨어의 결합으로 구현될 수 있다. Throughout the specification, when a certain part "includes" a certain component, it means that other components may be further included, rather than excluding other components unless specifically stated to the contrary. In addition, terms such as "... unit", "... group", "module", and "block" described in the specification mean a unit that processes at least one function or operation, which is hardware, software, or hardware. And software.

도 1은 C-V2X 모드4에서 각 차량이 자원을 재선택하는 개념을 설명하기 위한 도면이다.1 is a diagram for explaining a concept in which each vehicle reselects a resource in C-V2X mode 4;

일반적으로 C-V2X 모드4에서 자원 예약은 도 1에 도시된 바와 같이 시간축과 주파수축의 2차원 공간에서 이루어지며, 시간 축에서 자원은 다수의 서브 프레임(Sub-frame)으로 구분되고, 주파수 축에서 자원은 다수의 서브 채널(Sub-channel)로 구분된다. 그리고 다수의 서브 프레임과 다수의 서브 채널에 의해 각각 구분되는 각 블록은 자원(R)으로서 각각 다수의 자원 블록으로 구성될 수 있다.In general, in C-V2X mode 4, resource reservation is made in a two-dimensional space of the time axis and the frequency axis as shown in FIG. 1, and in the time axis, the resource is divided into a plurality of sub-frames, and in the frequency axis. Resources are divided into a number of sub-channels. In addition, each block divided by a plurality of sub-frames and a plurality of sub-channels is a resource R, and may be composed of a plurality of resource blocks, respectively.

C-V2X 모드4에서는 자원 스케쥴링을 수행하기 위한 별도의 조정자가 없기 때문에 각 차량이 데이터를 전송할 때 충돌이 발생할 가능성이 있다. 이에 각 차량은 데이터 충돌 발생이 최소화되도록 센싱(Senging), 선택(Selection) 및 재선택(Reselection)의 과정을 수행하여 반복적으로 자원을 선택한다.In C-V2X mode 4, since there is no separate coordinator for performing resource scheduling, there is a possibility of a collision when each vehicle transmits data. Accordingly, each vehicle repeatedly selects a resource by performing a process of sensing, selection, and reselection to minimize the occurrence of data collision.

C-V2X 모드4에서 각 차량은 우선 데이터를 전송할 자원을 선택하기 이전에 다른 차량의 자원 사용 상태를 센싱한다. 이는 다른 차량에 의해 사용되는 자원(R)을 분석하여 전송 데이터의 충돌을 방지하기 위함으로, 각 차량은 각각의 자원(R)을 센싱하여 다른 차량에 의해 사용되고 있는 것으로 판별되는 자원(R)은 사용 중으로 설정하여, 해당 자원이 선택되지 않도록 한다.In C-V2X mode 4, each vehicle first senses the resource use state of another vehicle before selecting a resource to transmit data. This is to prevent collision of transmitted data by analyzing the resource (R) used by other vehicles. Each vehicle senses each resource (R) and determines that the resource (R) is determined as being used by another vehicle is Set to in use, so that the resource is not selected.

센싱 과정에서 각 차량은 센싱 윈도우(Sensing windows)에 의해 지정된 시간 구간(일반적으로 1000ms)에 동안 각 자원(R)의 사용 상태를 수신 신호 강도 표시(Received Signal Strength Indicator: 이하 RSSI)를 기반으로 모니터링한다. 도 1에서는 서브 프레임이 1ms의 시간 간격으로 구분되므로, 센싱 윈도우에는 1000개의 서브 프레임이 포함될 수 있다. 따라서 차량은 이전 1000개의 서브 프레임 각각에서 서브 채널별 자원 사용 상태를 RSSI를 기반으로 모니터링한다.During the sensing process, each vehicle monitors the usage status of each resource (R) during the time period (generally 1000ms) specified by the sensing window based on the received signal strength indicator (RSSI). do. In FIG. 1, since sub-frames are divided into 1 ms time intervals, 1000 sub-frames may be included in the sensing window. Therefore, the vehicle monitors the resource usage status of each sub-channel in each of the previous 1000 sub-frames based on RSSI.

그리고 모니터링 결과 RSSI가 기지정된 문턱값 이상인 자원(R)을 다른 차량에 의해 사용 중인 자원으로 판별한다. 즉 각 자원(R)에 대해 측정된 RSSI를 기반으로 사용 가능 자원과 사용 불가 자원을 판별한다.As a result of monitoring, the resource R whose RSSI is equal to or greater than a predetermined threshold value is determined as a resource being used by another vehicle. That is, available and unusable resources are determined based on the RSSI measured for each resource (R).

센싱 과정 이후, 선택 과정에서 차량은 데이터를 전송하기 한 자원을 선택한다. 선택 과정에서 차량은 선택 윈도우(selection windows)에 의해 지정된 시간 구간(여기서는 일예로 100ms) 내에서 RSSI를 기반으로 사용 가능 자원으로 판별된 자원(R) 중 하나의 자원을 선택한다. 여기서 선택 윈도우는 차량이 자원을 예약할 수 있는 서브 프레임을 지정하기 위해 설정된다. 만일 다수의 차량이 빈번하게 자원을 예약하고자 한다면, 잦은 충돌이 발생하게 되며, 이를 방지하기 위해 일반적으로 자원 예약 간격(resource reservation interval: 이하 RRI)이 지정된다. RRI는 20ms, 50ms 및 100ms 등으로 설정될 수 있으며, 선택 윈도우는 RRR에 대응하는 크기를 갖는다. 도 1에서는 RRI가 100ms인 경우를 가정하였으며, 이에 선택 윈도우가 100개의 서브 프레임을 포함하는 크기로 설정되었다. 그리고 차량은 선택 윈도우에 의해 시정된 시간 구간에서 하나의 자원만을 선택할 수 있다.After the sensing process, in the selection process, the vehicle selects a resource for transmitting data. In the selection process, the vehicle selects one of the resources R determined as available resources based on the RSSI within a time period designated by the selection window (here, for example, 100 ms). Here, the selection window is set to designate a subframe in which the vehicle can reserve a resource. If multiple vehicles frequently attempt to reserve resources, frequent collisions occur, and in general, a resource reservation interval (RRI) is designated to prevent this. The RRI may be set to 20 ms, 50 ms, and 100 ms, and the selection window has a size corresponding to the RRR. In FIG. 1, it is assumed that the RRI is 100 ms, and the selection window is set to a size including 100 subframes. In addition, the vehicle may select only one resource in the time interval corrected by the selection window.

이때, 차량은 센싱 과정에서 센싱 윈도우의 RRI 단위로 각 자원(R)을 모니터링한 결과를 바탕으로, RRI에 대응하는 간격의 자원(R)들의 평균 RSSI(Average RSSI)를 계산하고, 계산된 평균 RSSI가 하위 20%에 해당하는 자원을 선택 가능한 후보 자원으로 하여, 선택 가능한 후보 자원 중 하나를 선택한다. 일예로 차량은 평균 RSSI가 최소인 자원을 선택할 수도 있다.At this time, the vehicle calculates the average RSSI (Average RSSI) of the resources (R) at intervals corresponding to the RRI based on the result of monitoring each resource (R) in the RRI unit of the sensing window during the sensing process, and the calculated average The resource corresponding to the lower 20% of the RSSI is selected as a selectable candidate resource, and one of the selectable candidate resources is selected. For example, the vehicle may select a resource with the minimum average RSSI.

이때 차량은 선택된 자원을 이용할 횟수를 나타내는 자원 재선택 카운터(resource reselection counter: 이하 RC)을 함께 설정할 수 있다. 각 차량은 RC를 기지정된 방식으로 설정할 수 있다. 일예로 차량은 RRI가 100ms 인 경우, [5, 15] 중 하나의 값을 임의로 선택할 수 있으며, RRI가 50이면, [10, 30] 중 하나의 값을 임의로 선택할 수 있다.At this time, the vehicle may also set a resource reselection counter (hereinafter referred to as RC) indicating the number of times to use the selected resource. Each vehicle can set the RC in a predetermined manner. For example, when the RRI is 100 ms, the vehicle can randomly select one of [5, 15], and when the RRI is 50, it can randomly select one of [10, 30].

그리고 RC로 지정된 횟수만큼 패킷을 전송하게 되면, 차량은 재선택 과정을 수행한다. 재선택 과정에서 차량은 미리 설정된 자원 유지 확률(resource keeping probability)(p)에 기초한 자원 재선택 확률(resource reselection probability)(1 - p)에 따라 자원을 재선택할지, 현재 사용 중인 자원을 그대로 재이용할지를 판별한다. 자원 유지 확률(p)은 통상적으로 0에서 0.8 사이의 값(p=[0, 0.8])으로 설정되고, 이에 따라 자원 재선택 확률(1 - p)은 0 ~ 20% 사이의 값으로 결정될 수 있다.And when the packet is transmitted the number of times specified by RC, the vehicle performs a reselection process. During the reselection process, the vehicle reselects resources according to a resource reselection probability (1-p) based on a preset resource keeping probability (p), or reuses the currently used resource as it is. Determine whether to do it. The resource retention probability (p) is usually set to a value between 0 and 0.8 (p=[0, 0.8]), and accordingly, the resource reselection probability (1-p) can be determined to a value between 0 and 20%. have.

만일 자원 재선택 확률(1 - p)에 기초하여 차량이 자원을 재선택하는 것으로 판별하게 되면, 차량은 선택 과정과 동일하게 센싱 과정에서 센싱 윈도우 기간 동안 모니터링된 자원 사용 상태를 기반으로 자원을 재선택한다.If it is determined that the vehicle reselects a resource based on the resource reselection probability (1-p), the vehicle re-selects the resource based on the resource usage status monitored during the sensing window period in the sensing process in the same way as the selection process. Choose.

다만 기존의 C-V2X 모드4에서는 각 차량의 자원 재선택 확률(1 - p)이 미리 설정되어 고정됨에 따라 주변 환경에 적응적으로 자원을 유지하거나 재선택하기 어려웠다. 그러나 이하에서 설명하는 본 실시예에 따른 차량 통신 시스템에서는 각 차량이 인프라스트럭쳐(Infrastructure)와 통신을 수행하고, 인프라스트럭쳐가 각 차량의 자원 재선택 확률(1 - p)을 조절함으로써, 차량이 주변 환경에 적응적으로 자원을 유지하거나 재선택할 수 있도록 한다.However, in the existing C-V2X mode 4, it was difficult to maintain or reselect resources adaptively to the surrounding environment as the resource reselection probability (1-p) of each vehicle is preset and fixed. However, in the vehicle communication system according to the present embodiment described below, each vehicle communicates with the infrastructure, and the infrastructure adjusts the resource reselection probability (1-p) of each vehicle. It allows resources to be maintained or reselected adaptively to the environment

상기에서 C-V2X 모드4에서 각 차량이 자원을 재선택하는 개념을 설명하는 것은 본 실시예에 따른 차량 통신 시스템에서 각 차량이 기본적으로 C-V2X 모드4를 기초로 자원을 재선택하기 때문이다.The concept in which each vehicle reselects a resource in the C-V2X mode 4 is described above because each vehicle basically reselects a resource based on the C-V2X mode 4 in the vehicle communication system according to the present embodiment. .

도 2는 본 발명의 일 실시예에 따른 차량 통신 시스템에서 자원 재선택 제어 장치가 각 차량이 자원을 재선택할 수 있도록 자원 재선택 확률을 업데이트하는 개념을 설명하기 위한 도면이고, 도 3은 도 2의 자원 재선택 제어 장치가 각 차량의 위치에 기반하여 자원 재선택 확률을 초기화하는 과정을 설명하기 위한 도면이다.FIG. 2 is a diagram for explaining the concept of updating a resource reselection probability so that each vehicle can reselect a resource by a resource reselection control apparatus in a vehicle communication system according to an embodiment of the present invention, and FIG. A diagram for explaining a process of initializing a resource reselection probability based on the location of each vehicle by the resource reselection control apparatus of.

여기서 각 차량은 이미 자원에 대한 초기 센싱 과정 및 선택 과정을 수행하여 자원을 선택하고, 선택된 자원을 이용하고 있는 것으로 가정한다.Here, it is assumed that each vehicle has already selected a resource by performing an initial sensing process and a selection process for the resource, and is using the selected resource.

도 2를 참조하면, 본 실시예에 따른 차량 통신 시스템에서는 각 차량(A)이 도착 시간(Time-of-Arrival: ToA), 도착 시간차(Time-Difference-of-Arrival: TDoA)와 같은 로컬라이제이션 기법을 이용하여 자신의 위치 정보를 획득한다. 각 차량은 각종 센서를 이용하여 수시로 주변을 감지함으로써, 자신의 위치 정보를 판별할 수 있다. 기존에도 C-V2X 모드4에서는 차량이 센서를 이용하여 자신의 위치 정보 및 주변 환경을 감지한다. 따라서 여기서는 각 차량은 항시 자신의 위치 정보를 감지하고 있는 것으로 가정한다.Referring to FIG. 2, in the vehicle communication system according to the present embodiment, each vehicle A is a localization scheme such as time-of-arrival (ToA) and time-of-arrival (TDoA). Use to acquire your own location information. Each vehicle can determine its own location information by detecting its surroundings from time to time using various sensors. Even before, in C-V2X mode 4, the vehicle uses sensors to detect its own location information and surrounding environment. Therefore, it is assumed here that each vehicle is always detecting its own location information.

그리고 차량은 RC로 지정된 횟수만큼 패킷을 전송하여 재선택 과정이 되면, 이전 선택 윈도우로부터 현재까지의 패킷 수신율(packet reception ratio: 이하 PRR)을 측정한다.In addition, when the vehicle transmits the packet the number of times specified by the RC and performs the reselection process, the packet reception ratio (hereinafter referred to as PRR) from the previous selection window to the present is measured.

PRR이 측정되면, 차량은 측정된 PRR을 자신의 위치 정보와 함께 자원 재선택 제어 장치(IS)로 업링크 전송한다. 여기서 자원 재선택 제어 장치(IS)는 차량 통신 시스템에 제공된 인프라스트럭쳐(Infrastructure)로서 노변 장치(Roadside Unit: RSU) 또는 기지국 등일 수 있다.When the PRR is measured, the vehicle uplink transmits the measured PRR to the resource reselection control device (IS) along with its location information. Here, the resource reselection control device IS is an infrastructure provided to the vehicle communication system, and may be a roadside unit (RSU) or a base station.

상기한 바와 같이, 차량 통신 시스템에서 C-V2X 모드3의 경우, 인프라스트럭쳐를 이용하는 반면, C-V2X 모드4는 인프라스트럭쳐를 이용하지 않는다. 그러나 본 실시예에서는 각 차량이 C-V2X 모드4를 기반으로 통신을 수행하되, 인프라스트럭쳐가 자원 재선택 제어 장치(IS)로 기능하여, 각 차량의 자원 재선택 확률(1 - p)을 조절할 수 있도록 한다. 이는 자원 재선택 제어 장치(IS)로 기능하는 인프라스트럭쳐가 차량의 주변 환경, 즉 도로 환경에 적응적으로 각 차량의 자원 재선택 확률(1 - p)을 가변함으로써, 각 차량의 PRR을 향상시키기 위함이다.As described above, in the case of the C-V2X mode 3 in the vehicle communication system, the infrastructure is used, while the C-V2X mode 4 does not use the infrastructure. However, in this embodiment, each vehicle performs communication based on the C-V2X mode 4, but the infrastructure functions as a resource reselection control device (IS) to adjust the resource reselection probability (1-p) of each vehicle. To be able to. This is because the infrastructure functioning as the resource reselection control device (IS) changes the resource reselection probability (1-p) of each vehicle adaptively to the vehicle's surrounding environment, that is, the road environment, thereby improving the PRR of each vehicle. It is for sake.

자원 재선택 제어 장치(IS)는 차량으로부터 위치 정보와 PRR이 인가되면, 차량의 위치 정보에 기반하여, 각 차량의 자원 재선택 확률(1 - p)을 기지정된 방식으로 초기화한다.When the location information and the PRR are applied from the vehicle, the resource reselection control device IS initializes the resource reselection probability (1-p) of each vehicle in a predetermined manner based on the location information of the vehicle.

이후, 강화 학습 방식으로 각 차량에 대한 자원 유지 확률(p)을 업데이트함으로써, 각 차량의 자원 재선택 확률(1 - p)을 조절한다.Thereafter, resource reselection probability (1-p) of each vehicle is adjusted by updating the resource retention probability (p) for each vehicle using the reinforcement learning method.

자원 재선택 제어 장치(IS)는 우선 도 3에 도시된 바와 같이, 다수의 차량에서 전송된 차량 위치 정보를 기반으로 각 차량의 위치를 중심으로 기지정된 기준 거리(d_th) 이내에 위치한 다른 차량의 수를 카운트한다. 자원 재선택 제어 장치(IS)는 위치 정보가 전송된 다수의 차량(i) 각각의 위치를 중심으로 기준 거리(d_th) 이내에 위치한 다른 차량의 수(n_i)를 카운트하고, 이중 최대값(N = max(n_i))과 최소값(n = min(n_i))를 판별한다.Resource re-selection control unit (IS), as illustrated in Figure 3 first, the other vehicle is located within, based on the vehicle position information sent from the plurality of vehicle groups based on a specified distance around the location of each vehicle (d _th) Count the number. The resource reselection control device (IS) counts the number of other vehicles (n _i _{) located within a reference distance (d th} ) with respect to each location of a plurality of vehicles (i) to which the location information is transmitted, and the maximum value ( Determine N = max(n _i )) and minimum value (n = min(n _i )).

그리고 판별된 최대값(N)과 최소값(n)을 이용하여 각 차량(i)의 자원 유지 확률(p)을 기지정된 방식으로 초기화하여 초기 자원 유지 확률(p₀)을 획득한다.Then, the resource maintenance probability (p) of each vehicle (i) is initialized in a known manner using the determined maximum value (N) and the minimum value (n) to obtain _{an initial resource maintenance probability (p 0 ).}

자원 재선택 제어 장치(IS)는 일예로 수학식 1에 따라 초기 자원 유지 확률(p₀)을 획득할 수 있다.The resource reselection control apparatus IS may obtain an _{initial resource maintenance probability p 0 according to Equation 1, for example.}

수학식 1에 따르면, 자원 재선택 제어 장치(IS)는 차량(i)을 중심으로 기준 거리(d_th) 이내에 위치한 다른 차량의 수(n_i)에 따라 초기 자원 유지 확률(p₀)을 0에서 0.8 사이의 값(p₀ = [0, 0.8])으로 정규화한다.According to Equation 1, the resource reselection control device (IS) _zeroes the initial resource maintenance probability (p 0) according to the number of other vehicles (n _i _{) located within the reference distance (d th) from the vehicle (i).} Normalize to a value between 0.8 (p ₀ = [0, 0.8]).

즉 i번째 차량(i)을 중심으로 기준 거리(d_th) 이내에 위치한 다른 차량의 수(n_i)가 최대값(N)에 근접할수록 초기 자원 유지 확률(p₀)을 작게 설정하고, 다른 차량의 수(n_i)가 최소값(n)에 근접할수록 기지정된 초기 자원 유지 확률(p₀)을 최대 유지 확률(여기서는 일예로 0.8)에 가깝게 되도록 설정한다.That is, as the number of other vehicles (n _i _{) located within the reference distance (d th} ) from the i-th vehicle (i) approaches the maximum value (N), the initial resource retention probability (p ₀ ) is set smaller, and other vehicles As the number of n _i approaches the minimum value n, the predetermined initial resource retention probability p ₀ is set to be closer to the maximum retention probability (here, as an example, 0.8).

그리고 다수의 차량(i) 각각에 대응하여 설정된 초기 자원 유지 확률(p₀)을 대응하는 차량으로 전달하여, 각 차량의 자원 재선택 확률(1 - p)을 초기화한다.Then, the initial resource retention probability (p ₀ ) set for each of the plurality of vehicles i is transmitted to the corresponding vehicle, thereby initializing the resource reselection probability (1-p) of each vehicle.

따라서 각 차량의 주변에 다른 차량이 많을수록 자원 유지 확률(p)은 작아지게 되고, 자원 유지 확률(p)로부터 획득되는 자원 재선택 확률(1 - p)은 커지게 된다. 그리고 각 차량의 주변에 다른 차량이 적을수록 자원 유지 확률(p)은 커지게 되고, 자원 재선택 확률(1 - p)은 작아지게 된다.Therefore, as there are more other vehicles around each vehicle, the resource retention probability (p) decreases, and the resource reselection probability (1-p) obtained from the resource retention probability (p) increases. In addition, as there are fewer other vehicles around each vehicle, the resource retention probability (p) increases, and the resource reselection probability (1-p) decreases.

이는 주변에 차량이 많이 위치해 간섭을 주거나 받을 확률이 높은 차량들이 자원을 더 비번하게 재선택하도록 하여 학습 초기에도 각 차량의 자원 재선택 효율성을 높일 수 있도록 하기 위함이다.This is to increase the resource reselection efficiency of each vehicle even at the beginning of learning by allowing vehicles with a high probability of causing or receiving interference because there are many vehicles around them to reselect resources more inconveniently.

그러므로 자원 재선택 제어 장치(IS)는 초기화 과정에서 차량의 위치를 인지하여 자원 재선택 확률(1 - p)을 초기화하는 것으로 볼 수 있다.Therefore, it can be seen that the resource reselection control apparatus IS initializes the resource reselection probability (1-p) by recognizing the location of the vehicle during the initialization process.

이후, 자원 재선택 제어 장치(IS)는 각 차량별 PRR 값을 기반으로 각 차량의 자원 유지 확률(p)을 강화 학습 방식으로 업데이트한다.Thereafter, the resource reselection control apparatus IS updates the resource maintenance probability p of each vehicle based on the PRR value for each vehicle in a reinforcement learning method.

본 실시예에서 자원 재선택 제어 장치(IS)는 각 차량의 PRR을 향상시키기 위해, 자원 유지 확률(p)을 업데이트한다. 즉 자원 재선택 제어 장치(IS)는 수학식 2에 따라 전송된 패킷 대비 수신된 패킷의 비를 나타내는 PRR을 최대로 하는 자원 유지 확률(

)을 획득하는 것을 목적으로 한다.In this embodiment, the resource reselection control apparatus IS updates the resource retention probability p in order to improve the PRR of each vehicle. That is, the resource reselection control apparatus (IS) maximizes the PRR representing the ratio of the received packet to the transmitted packet according to Equation 2 (

It aims to acquire ).

다만 도로 환경이 항시 변화하므로, 자원 재선택 제어 장치(IS)가 수학식 2로 나타나는 PRR을 최대로 하는 자원 유지 확률(

)을 직접 획득하는 것은 현실적으로 매우 어렵다.However, since the road environment is always changing, the resource reselection control device (IS) maximizes the PRR represented by Equation 2 (

) Is very difficult in reality.

이에 본 실시예에 따른 자원 재선택 제어 장치(IS)는 수학식 3에 따른 강화 학습 알고리즘을 이용하여 자원 유지 확률(p)을 업데이트함으로써, 각 차량의 PRR이 향상될 수 있도록 한다.Accordingly, the resource reselection control apparatus IS according to the present embodiment updates the resource retention probability p using the reinforcement learning algorithm according to Equation 3, so that the PRR of each vehicle can be improved.

수학식 3을 참조하면, 자원 재선택 제어 장치(IS)는 각 차량(i)에 대한 자원 유지 확률(p_i)을 가중치(w_i(t))에 기반하여 업데이트한다. 여기서 다음 가중치(w_i(t+1))는 현재 가중치(w_i(t))와 유틸리티 함수(u(t))에 의해 결정되며, 유틸리티 함수(u(t))는 PRR로 설정된다. 다시 말해 다음 가중치(w_i(t+1))는 현재 가중치(w_i(t))와 현재 PRR과 이전 PRR 사이의 차에 미리 지정된 유틸리티 가중 파라미터(α)와 가중치(w_i)에 대한 자원 유지 확률(p_i)의 변화율을 기반으로 획득된다.Referring to Equation 3, the resource reselection control apparatus IS _{updates the resource retention probability p i} for each vehicle i based on the weight w _i (t). Here, the next weight (w _i (t+1)) is determined by the current weight (w _i (t)) and the utility function (u(t)), and the utility function (u(t)) is set to PRR. In other words, the next weight (w _i (t+1)) is the resource for the _{current weight (w i} (t)) and the utility weight parameter (α) and weight (w _{i) pre-specified in the difference between the current PRR and the previous PRR.} It is obtained based on the rate of change of the retention probability (p _{i ).}

자원 재선택 제어 장치(IS)는 수학식 3에 기초하여 자원 유지 확률(p_i)을 업데이트하고, 업데이트된 자원 유지 확률(p_i)을 대응하는 차량(i)로 다운 링크 전송하여, 각 차량(i)의 자원 재선택 확률(1 - p)을 업데이트한다.Resource re-selection control unit (IS) is to update the resource holding probability (p _i) on the basis of the equation (3), and transmitting downlink to a vehicle (i) corresponding to keep updated resource probability (p _i), each vehicle The resource reselection probability (1-p) of (i) is updated.

그리고 각 차량(i)은 업데이트된 자원 재선택 확률(1 - p)에 기초하여 자원의 재선택 여부를 결정하고, 자원을 재선택하는 것으로 판별되면, 선택 윈도우를 설정하고, 센싱 과정에서 센싱 윈도우의 RRI 단위로 각 자원(R)을 모니터링한 결과를 바탕으로, 센싱 윈도우에서 평균 RSSI가 하위 20%에 해당하는 자원 중 하나에 대응하는 자원을 설정된 선택 윈도우 내에서 선택한다. 그리고 선택된 자원을 이용하여 다른 차량으로 데이터를 전송한다.And each vehicle (i) determines whether to reselect a resource based on the updated resource reselection probability (1-p), and when it is determined that the resource is reselected, a selection window is set, and a sensing window is performed in the sensing process. Based on the result of monitoring each resource (R) in RRI units of, a resource corresponding to one of the resources corresponding to the lower 20% of the average RSSI in the sensing window is selected within the set selection window. Then, the data is transmitted to another vehicle using the selected resource.

도 4는 본 발명의 일 실시예에 따른 차량 통신 시스템의 자원 재선택 제어 방법을 나타낸다.4 illustrates a method for controlling resource reselection in a vehicle communication system according to an embodiment of the present invention.

도 2 및 도 3을 참조하여, 도 4의 자원 재선택 제어 방법을 설명하면, 우선 적어도 하나의 차량 각각으로부터 위치 정보와 PRR을 수신하여 획득한다(S10). 상기한 바와 같이, 적어도 하나의 차량 각각은 다양한 센서를 이용하여 자신의 위치 정보를 판별하고, 이전 선택 윈도우로부터 현재까지의 PRR을 측정할 수 있다. 그리고 자원 재선택 제어 장치(IS)는 각 차량에서 업링크 전송되는 위치 정보와 PRR을 수신하여 획득한다.Referring to FIGS. 2 and 3, the method of controlling resource reselection of FIG. 4 is first obtained by receiving location information and PRR from each of at least one vehicle (S10). As described above, each of the at least one vehicle may determine its own location information using various sensors, and measure a PRR from the previous selection window to the present. In addition, the resource reselection control apparatus (IS) receives and acquires the location information and PRR transmitted uplink from each vehicle.

적어도 하나의 차량 각각의 위치 정보가 획득되면, 획득된 위치 정보를 기반로부터 판별되는 각 차량의 주변 차량 수에 따라 각 차량의 자원 유지 확률(p)을 초기 자원 유지 확률(p₀)로 초기화하고, 초기 자원 유지 확률(p₀)을 각 차량으로 다운링크 전송하여 각 차량이 전송된 초기 자원 유지 확률(p₀)에 따라 자원 재선택 확률(1-p)을 초기 자원 재선택 확률(1 - p₀)로 변경하도록 한다(S20).When the location information of each of the at least one vehicle is acquired, the resource retention probability (p) of each vehicle is initialized _{to the initial resource retention probability (p 0) according to the number of surrounding vehicles of each vehicle determined based on the acquired location information.} maintaining the initial resource probability (p ₀₎ of the downlink transmission to the vehicles keep each vehicle is transmitted initial resource probability (p ₀₎ an initial resource reselection probability of a resource reselection probability (1-p) in accordance with (1 - p ₀ ) to be changed (S20).

자원 재선택 제어 장치(IS)는 각 차량의 자원 재선택 확률(1 - p)을 초기 자원 재선택 확률(1 - p₀)로 변경하기 위해, 우선 적어도 하나의 차량(i) 각각을 중심으로 기지정된 기준 거리(d_th) 이내의 주변 차량 수(n_i)를 카운트한다(S21). 그리고 적어도 하나의 차량 각각에 대해 카운트된 주변 차량 수(n_i) 중 최대값(N = max(n_i))과 최소값(n = min(n_i))을 판별한다(S22).In order to change the resource reselection probability (1-p) of each vehicle to the initial resource reselection probability (1-p ₀ ), the resource reselection control device (IS) first focuses on each of at least one vehicle (i). The number of surrounding vehicles (n _i ) within a predetermined reference distance (d _th ) is counted (S21). The maximum value (N = max(n _i )) and the minimum value (n = min(n _i _{)) of the number of surrounding vehicles (n i} ) counted for each of the at least one vehicle are determined (S22).

최대값(N)과 최소값(n)이 판별되면, 판별된 최대값(N)과 최소값(n) 및 적어도 하나의 차량 각각에 대해 카운트된 주변 차량 수(n_i)를 기반으로, 각 차량에 대한 초기 자원 유지 확률(p₀)을 수학식 1과 같이 기지정된 방식으로 획득한다(S23). 이때, 초기 자원 유지 확률(p₀)은 기지정된 범위([0, 0.8])의 값을 갖도록 정규화되어 획득될 수 있으며, 적어도 하나의 차량 각각을 중심으로 기준 거리(d_th) 이내에 위치한 다른 차량의 수(n_i)가 최대값(N)에 근접할수록 초기 자원 유지 확률(p₀)을 작게 설정하고, 다른 차량의 수(n_i)가 최소값(n)에 근접할수록 기지정된 범위 내에서 최대값에 가깝게 되도록 설정할 수 있다.When the maximum value (N) and minimum value (n) are determined, based on the determined maximum value (N) and minimum value (n) and the number of surrounding vehicles (n _i ) counted for each of at least one vehicle, each vehicle is The initial resource maintenance probability (p ₀ ) for is obtained in a predetermined manner as shown in Equation 1 (S23). At this time, the initial resource retention probability (p ₀ ) can be obtained by normalizing to have a value in a predetermined range ([0, 0.8]), and other vehicles located within a _{reference distance (d th) around each of at least one vehicle} As the number of (n _i ) approaches the maximum value (N), the initial resource retention probability (p ₀ ) is set smaller, and the closer the number of other vehicles (n _i ) to the minimum value (n), the maximum within a specified range. It can be set to be close to the value.

그리고 설정된 각 차량에 대한 초기 자원 유지 확률(p₀)을 대응하는 차량으로 전송하여, 각 차량이 자원 재선택 확률(1-p)을 초기 자원 재선택 확률(1 - p₀)로 변경하도록 한다(S24).And, by transmitting the set initial resource retention probability (p ₀ ) for each vehicle to the corresponding vehicle, each vehicle changes the resource reselection probability (1-p) to the initial resource reselection probability (1-p ₀ ). (S24).

초기 자원 유지 확률(p₀)이 획득되면, 이후 각 차량의 PRR을 향상시키기 위해 자원 유지 확률(p)을 강화 학습 방식으로 업데이트한다(S30).When the initial resource retention probability (p ₀ ) is obtained, the resource retention probability (p) is then updated in a reinforcement learning method to improve the PRR of each vehicle (S30).

초기 자원 유지 확률(p₀)을 업데이트하기 위해 자원 재선택 제어 장치(IS)는 각 차량으로부터 PRR을 수신하여 획득한다(S31). 이때 각 차량의 위치 정보를 함께 인가받을 수도 있다. 그리고 수신된 PRR의 변화에 기반하여, 가중치(w_i(t))를 기지정된 방식으로 계산하고, 계산된 가중치(w_i(t))를 이용하여, 자원 유지 확률(p)을 업데이트한다(S32). 자원 유지 확률(p)이 업데이트되면, 업데이트된 자원 유지 확률(p)을 대응하는 차량으로 전송하여, 각 차량이 업데이트된 자원 유지 확률(p)에 따라 자원 재선택 확률(1 - p)을 변경하도록 한다(S24).In order to update the initial resource retention probability p ₀ , the resource reselection control apparatus IS receives and obtains a PRR from each vehicle (S31). At this time, the location information of each vehicle may be authorized together. Then, based on the change in the received PRR, the weight (w _i (t)) is calculated in a known manner, and the resource retention probability (p) is updated using the _{calculated weight (w i (t)) (} S32). When the resource retention probability (p) is updated, the updated resource retention probability (p) is transmitted to the corresponding vehicle, and each vehicle changes the resource reselection probability (1-p) according to the updated resource retention probability (p). Let it be (S24).

도 5는 본 발명의 차량 통신 시스템의 자원 재선택 제어 방법에 따른 패킷 수신율 성능을 나타낸다.5 shows packet reception rate performance according to the resource reselection control method of the vehicle communication system of the present invention.

도 5는 시간의 진행에 따른 차량의 PRR을 측정한 결과로서, (a)는 본 실시예에 따라 차량 위치 인지 기반 자원 유지 확률 초기화 과정 및 강화 학습에 의한 자원 유지 확률 업데이트 과정을 수행한 경우를 나타낸다. (b)는 자원 유지 확률을 랜덤하게 초기화하고, 이후 강화 학습에 의한 자원 유지 확률 업데이트 과정을 수행한 경우를 나타낸다. 그리고 (c)는 기존의 C-V2X 모드4에 지향성 안테나(Directional antenna) 및 빔 포밍(beamforming) 기법을 적용한 경우를 나타낸다.Figure 5 is a result of measuring the PRR of the vehicle according to the progress of time, (a) shows a case in which the vehicle location recognition-based resource maintenance probability initialization process and the resource maintenance probability update process by reinforcement learning are performed according to the present embodiment. Show. (b) shows a case in which the resource maintenance probability is initialized at random, and then the resource maintenance probability update process by reinforcement learning is performed. And (c) shows a case in which a directional antenna and a beamforming technique are applied to the existing C-V2X mode 4.

도 5에서 (a) 내지 (c) 모두 시간이 흐름에 따라 점차 PRR이 향상되는 형태로 나타나지만, (a) 및 (b)와 같이 강화 학습에 의한 자원 유지 확률 업데이트 과정을 수행한 경우에 PRR이 더욱 높게 나타남을 알 수 있다. 또한 (a)와 같이 차량 위치 인지 기반 자원 유지 확률 초기화 과정을 수행하는 경우에 자원 유지 확률을 랜덤하게 초기화한 경우보다 PRR이 더욱 향상됨을 알 수 있다.In FIG. 5, both (a) to (c) appear in a form in which the PRR gradually increases with time, but when the resource retention probability update process by reinforcement learning is performed as shown in (a) and (b), the PRR is It can be seen that it appears even higher. In addition, in the case of performing the process of initializing the resource maintenance probability based on vehicle location recognition as shown in (a), it can be seen that the PRR is further improved than the case of randomly initializing the resource maintenance probability.

이는 주파수에 해당하는 자원의 재선택 효율성을 향상시킨 것으로 각 차량의 PRR과 함께 순간 데이터율(Instantaneous Data Rate)을 향상시킬 수 있다.This improves the reselection efficiency of the resource corresponding to the frequency, and can improve the instantaneous data rate together with the PRR of each vehicle.

본 발명에 따른 방법은 컴퓨터에서 실행시키기 위한 매체에 저장된 컴퓨터 프로그램으로 구현될 수 있다. 여기서 컴퓨터 판독가능 매체는 컴퓨터에 의해 액세스 될 수 있는 임의의 가용 매체일 수 있고, 또한 컴퓨터 저장 매체를 모두 포함할 수 있다. 컴퓨터 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보의 저장을 위한 임의의 방법 또는 기술로 구현된 휘발성 및 비휘발성, 분리형 및 비분리형 매체를 모두 포함하며, ROM(판독 전용 메모리), RAM(랜덤 액세스 메모리), CD(컴팩트 디스크)-ROM, DVD(디지털 비디오 디스크)-ROM, 자기 테이프, 플로피 디스크, 광데이터 저장장치 등을 포함할 수 있다.The method according to the present invention can be implemented as a computer program stored in a medium for execution on a computer. Here, the computer-readable medium may be any available medium that can be accessed by a computer, and may also include all computer storage media. Computer storage media includes both volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data, and ROM (Read Dedicated memory), RAM (random access memory), CD (compact disk)-ROM, DVD (digital video disk)-ROM, magnetic tape, floppy disk, optical data storage device, and the like.

본 발명은 도면에 도시된 실시예를 참고로 설명되었으나 이는 예시적인 것에 불과하며, 본 기술 분야의 통상의 지식을 가진 자라면 이로부터 다양한 변형 및 균등한 타 실시예가 가능하다는 점을 이해할 것이다.The present invention has been described with reference to the embodiments shown in the drawings, but these are merely exemplary, and those of ordinary skill in the art will understand that various modifications and equivalent other embodiments are possible therefrom.

따라서, 본 발명의 진정한 기술적 보호 범위는 첨부된 청구범위의 기술적 사상에 의해 정해져야 할 것이다.Therefore, the true technical protection scope of the present invention should be determined by the technical spirit of the appended claims.

Claims

Resource maintenance probability of each vehicle according to the number of neighboring vehicles of each vehicle determined from the location information of each of the received at least one vehicle by receiving location information and packet reception ratio (PRR) from each of at least one vehicle Is initialized to the initial resource retention probability,
Thereafter, the resource retention probability is updated in response to a change in the PRR of each of the received at least one vehicle and transmitted to the corresponding vehicle,
The weight is updated in a predetermined manner according to the change of the PRR received from each of the at least one vehicle, and the updated weight is applied to obtain a resource maintenance probability,
Equation _{of the resource maintenance probability (p i} ) for each of at least one vehicle (i) from the weight (w _i)

(Where t is time, u(t) represents PRR as a utility function of reinforcement learning, and α is a predefined utility weighting parameter)
Resource reselection control device of the vehicle communication system obtained according to.

delete

The method of claim 1, wherein the resource reselection control device
Analyzing the maximum and minimum values from the number of surrounding vehicles of each vehicle determined from the received location information of each of at least one vehicle, and as the number of surrounding vehicles located within a reference distance around each of at least one vehicle approaches the maximum value A resource reselection control apparatus of a vehicle communication system configured to set the initial resource retention probability to be small, and set so that the number of nearby vehicles approaches the minimum value, the closer to the maximum value within a predetermined range.

The method of claim 4, wherein the resource reselection control device
Equation _{of the initial resource retention probability (p 0} _{) from the number of surrounding vehicles (n i} ) for each of at least one vehicle (i) and the determined maximum value (N) and the minimum value (n)

(Where d _th is the reference distance for determining the surrounding vehicle around the vehicle (i), and n _i is the number of other vehicles located within the reference distance around the vehicle (i))
Resource reselection control device of the vehicle communication system obtained according to.

The method of claim 1, wherein each of the at least one vehicle
A resource reselection control device of a vehicle communication system that is updated in the resource reselection control device to change a resource reselection probability according to a resource maintenance probability, and to determine whether to reselect a resource according to the changed resource reselection probability.

The method of claim 1, wherein the resource reselection control device
Resource reselection control device implemented as an infrastructure of a vehicle communication system.

By receiving location information and packet reception rate (hereinafter referred to as PRR) from each of at least one vehicle, the initial resource maintenance probability of each vehicle is maintained according to the number of surrounding vehicles of each vehicle determined from the location information of each of the received at least one vehicle Initializing with probability; And
Thereafter, including the step of updating the resource maintenance probability in response to a change in the PRR of each of the received at least one vehicle,
The updating step
Updating a weight in a predetermined manner according to a change in PRR received from each of the at least one vehicle; And
Comprising the step of obtaining the updated resource retention probability by applying the updated weight,
Updating the weights
Equation _{of the weight (w i} ) for each of at least one vehicle (i)

(Where t is time, u(t) represents PRR as a utility function of reinforcement learning, and α is a predefined utility weighting parameter)
Acquired according to, and
The step of obtaining the updated resource maintenance probability
Equation of resource retention probability (p _i)

Resource reselection control method of the vehicle communication system obtained according to the.

delete

The method of claim 8, wherein the initializing step
Analyzing a maximum value and a minimum value from the number of surrounding vehicles of each vehicle determined from the received location information of each of the at least one vehicle; And
The initial resource retention probability is set smaller as the number of surrounding vehicles located within a reference distance from each of at least one vehicle approaches the maximum value, and the closer the number of surrounding vehicles approaches the minimum value, the closer to the maximum value within a predetermined range. Resource reselection control method of a vehicle communication system comprising the step of setting to be.

The method of claim 12, wherein the setting step
Equation _{of the initial resource retention probability (p 0} _{) from the number of surrounding vehicles (n i} ) for each of at least one vehicle (i) and the determined maximum value (N) and the minimum value (n)

(Where d _th is the reference distance for determining the surrounding vehicle around the vehicle (i), and n _i is the number of other vehicles located within the reference distance around the vehicle (i))
Resource reselection control method of the vehicle communication system set according to.

The method of claim 8, wherein each of the at least one vehicle
Changing a resource reselection probability according to the updated and transmitted resource maintenance probability; And
Resource reselection control method of a vehicle communication system comprising the step of determining whether to reselect resources according to the changed resource reselection probability.