KR20230067113A

KR20230067113A - A method for calculating the estimated time of arrival of a ship and an device for the same

Info

Publication number: KR20230067113A
Application number: KR1020210152853A
Authority: KR
Inventors: 김현; 정우영; 류상천; 김영남; 이상원
Original assignee: 씨제이대한통운 (주)
Priority date: 2021-11-09
Filing date: 2021-11-09
Publication date: 2023-05-16
Also published as: KR102568297B1

Abstract

The present invention relates to a method for calculating an estimated arrival time of a ship and a calculation device thereof, and more specifically to a method for reinforcing learning of past ship routes and using a result obtained as a result of learning in this way to predict the navigation status of ships currently in operation, ultimately calculating the final estimated arrival time of the ship more accurately, and a calculation device thereof.

Description

A method for calculating the estimated time of arrival of a ship and an arithmetic device therefor

본 발명은 선박의 도착예정시간을 연산하는 방법 및 연산장치에 관한 것으로, 구체적으로는 과거 선박의 항로 등을 강화 학습시키고, 이렇게 학습된 결과 얻어진 결과값을 현재 운용 중인 선박의 항해 상황 예측에 활용함으로써 궁극적으로는 선박의 최종 도착예정시간을 더 정확하게 연산해 내는 방법 및 연산장치에 관한 것이다.The present invention relates to a method and an arithmetic device for calculating an estimated time of arrival of a ship, and more specifically, to reinforce and learn the course of a ship in the past, and use the result obtained as a result of this learning to predict the navigation situation of a ship currently in operation. Ultimately, it relates to a method and an arithmetic device that more accurately calculates the final estimated time of arrival of a ship.

해상운송 산업에서 도착예정시간(ETA)은 일반적으로 선박이 평균적인 속도로 이동하였을 시 목적지에 언제 도착할 것인지를 단순히 거리와 속력 간의 관계식 만으로 계산하여 얻어진다. 그러나 최근에는 기술이 발달하면서 도착예정시간을 계산함에 있어 여러 변수들에 의한 영향을 반영하고자 하는 시도가 있으며, 이러한 노력은 앞으로도 계속 될 것으로 예상된다.Estimated time of arrival (ETA) in the maritime transportation industry is generally obtained by simply calculating when a ship will arrive at its destination when moving at an average speed, using only the relational expression between distance and speed. However, recently, with the development of technology, attempts have been made to reflect the influence of various variables in calculating the estimated time of arrival, and such efforts are expected to continue in the future.

그러나 실제 선박이 운항하게 될 경로, 속력 등은 날씨, 계절, 바람, 항구 혼잡도 등 셀 수 없이 많은 변수들의 영향을 받을 수 밖에 없어 여전히 정확한 도착예정시간을 구하기란 매우 어려운 과제라 할 것이다. However, it is still a very difficult task to obtain an accurate estimated time of arrival, as the route and speed that the actual vessel will operate are inevitably affected by countless variables such as weather, season, wind, and port congestion.

본 발명은 이렇듯 선박이 경로를 따라 목적지까지 이동할 때에 받을 수 있는 영향을 실시간으로 반영하여 즉각적인 경로 연산 및 도착예정시간 연산이 가능하게 하는 방법 및 시스템을 제안하고자 한 것으로, 과거 선박의 항로 등의 학습데이터를 강화학습 방식으로 학습시키고 이렇게 학습된 결과물을 참고하여 도착예정시간을 보정함으로써 종래에 비해 선박의 도착 시간을 정확하게 예측할 수 있게 하는 방법 및 시스템에 관한 것이다. The present invention is intended to propose a method and system that enables immediate route calculation and estimated arrival time calculation by reflecting in real time the effects that a ship may receive when moving to a destination along a route, such as past ship route learning It relates to a method and system for accurately predicting the arrival time of a ship compared to the prior art by learning data using a reinforcement learning method and correcting the estimated time of arrival by referring to the learned result.

대한민국 공개특허 제10-2021-0066108 A호(2021.06.07.공개)Republic of Korea Patent Publication No. 10-2021-0066108 A (2021.06.07. Publication)

본 발명은 강화학습 알고리즘을 통해 과거의 선박운행 정보들을 학습하게 하고, 학습된 결과값을 데이터베이스에 저장해 둠으로써 운항 중인 선박의 실시간 도착예정시간 연산에 활용 가능하게 하는 것을 목적으로 한다.An object of the present invention is to learn past ship operation information through a reinforcement learning algorithm and to store the learned result values in a database so that they can be used for calculating the real-time estimated arrival time of a ship in operation.

이를 통해 종래 단순히 주변 환경 정보들만을 정량적으로 반영시켜 도착예정시간을 연산하였던 것에 비해 더 정확한 도착예정시간을 산출해 낼 수 있게 하고, 궁극적으로는 정확도를 80% 이상까지 높임으로써 도착예정시간 오류에 따른 불필요한 비용 및 인력 낭비를 줄이는 것을 목적으로 한다. Through this, it is possible to calculate the ETA more accurately compared to the conventional calculation of the ETA by simply quantitatively reflecting the surrounding environment information, and ultimately raise the accuracy to 80% or more, thereby reducing the ETA error. The purpose is to reduce unnecessary costs and waste of manpower.

또한, 본 발명은 도착예정시간을 정확하게 예측할 뿐만 아니라 출발지로부터 목적지까지의 경로를 가장 효율적인, 현재 상황에 가장 적합한 경로로 선택하게 함으로써 해상운송 중에 소모되는 자원을 크게 줄이는 것을 목적으로 한다. In addition, an object of the present invention is to significantly reduce resources consumed during maritime transportation by not only accurately predicting the expected arrival time but also selecting the most efficient route from the departure point to the destination to the current situation.

한편, 본 발명의 기술적 과제들은 이상에서 언급한 기술적 과제들로 제한되지 않으며, 언급되지 않은 또 다른 기술적 과제들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.On the other hand, the technical problems of the present invention are not limited to the technical problems mentioned above, and other technical problems not mentioned will be clearly understood by those skilled in the art from the description below.

본 발명은 위와 같은 문제점을 해결하기 위한 것으로, 본 발명에 따른 선박 도착예정시간 연산 방법은 정해진 주기마다 선박에 대한 위치정보를 수집하는 단계; 적어도 하나 이상의 동적환경변수를 참고하여 상기 선박이 시작점으로부터 도착점에 이르는 제1 경로를 생성하는 단계; 선박운행 데이터베이스를 참고하여 상기 제1 경로를 적어도 일부 수정한 제2 경로를 생성하는 단계; 생성된 제2 경로가 기 정해진 조건을 만족시키면 상기 제2 경로를 최종경로로 결정하는 단계; 및 상기 최종경로에 기초한 선박 도착예정시간을 산출하는 단계;를 포함할 수 있다.The present invention is intended to solve the above problems, and a method for calculating an estimated time of arrival of a ship according to the present invention includes the steps of collecting location information about a ship at predetermined intervals; generating a first route from a starting point to a destination point by referring to at least one dynamic environment variable; generating a second route by at least partially modifying the first route by referring to a ship operation database; determining the second route as a final route when the generated second route satisfies a predetermined condition; and calculating an estimated time of arrival of the vessel based on the final route.

또한, 상기 선박 도착예정시간 연산 방법에 있어서 상기 동적환경변수는, 기상정보에 관한 기상변수, 파고에 관한 파고변수, 국제유가에 관한 유가변수, 해양사고에 관한 사고변수, 또는 해양교통량에 관한 교통량변수 중 적어도 하나를 포함할 수 있다.In addition, in the method for calculating the estimated time of arrival of the ship, the dynamic environment variables include meteorological variables related to weather information, wave height variables related to wave heights, oil price variables related to international oil prices, accident variables related to marine accidents, or traffic volume related to marine traffic. At least one of the variables may be included.

또한, 상기 선박 도착예정시간 연산 방법은 상기 제1 경로를 생성하는 단계 이전, 상기 시작점으로부터 도착점에 이르기까지의 기본경로를 적어도 하나 이상 생성하는 단계;를 더 포함하고, 상기 제1 경로는, 상기 기본경로에 상기 동적환경변수가 적어도 하나 이상 적용되어 생성된 것을 특징으로 할 수 있다.In addition, the method for calculating the estimated time of arrival of the ship further includes generating at least one basic route from the starting point to the ending point before generating the first route, wherein the first route comprises: It may be characterized in that it is created by applying at least one dynamic environment variable to a basic path.

또한, 상기 선박 도착예정시간 연산 방법에 있어서 상기 선박운행 데이터베이스는, 과거의 선박운행 정보들이 강화 학습된 결과값들이 누적 저장된 데이터베이스인 것을 특징으로 할 수 있다.In addition, in the method for calculating the expected arrival time of the ship, the ship operation database may be a database in which result values obtained by reinforcement learning of past ship operation information are accumulated and stored.

이 때, 상기 선박운행 데이터베이스에 누적 저장된 결과값들은, 복수 개의 노드들로 구성되는 임의의 경로에 있어서, 선박이 제s노드에서 제s+1노드(s는 1이상의 자연수)로 이동할 시 기대되는 보상을 최대화 하는 것을 목표로 하는 함수의 반복 연산에 의해 산출된 결과값들인 것을 특징으로 할 수 있다.At this time, the result values accumulated and stored in the ship operation database are expected when the ship moves from the s-th node to the s+1-th node (s is a natural number of 1 or more) in an arbitrary path composed of a plurality of nodes. It may be characterized in that they are result values calculated by iterative operation of a function aimed at maximizing compensation.

또한 이 때, 상기 함수는 Also at this time, the function

로 정의되되, s는 노드상태, a는 액션, γ는 할인율, Q(s,a)는 특정 노드상태 s에서 액션 a를 취할 경우 받을 보상의 기대값을 산출하기 위한 행동가치함수, a’은 다음 노드상태(s’)에서 행동가치함수의 값을 최대로 만드는 액션인 것을 특징으로 할 수 있다.

, where s is the node state, a is the action, γ is the discount rate, Q(s,a) is the action value function for calculating the expected value of the reward received when action a is taken in the specific node state s, and a' is It may be characterized as an action that maximizes the value of the action value function in the next node state (s').

또한 이 때, 상기 행동가치함수는 Also, at this time, the action value function is

로 정의되되, s는 노드상태, a는 액션, E는 기대값, R은 보상, γ는 할인율, t는 시간이고, Q_π(s,a)는 특정 상태 s에서 액션 a를 취할 경우 받을 보상의 기대값을 산출하기 위한 함수인 것을 특징으로 할 수 있다.

, where s is the node state, a is the action, E is the expected value, R is the reward, γ is the discount rate, t is the time, and Q _π (s,a) is the reward received when action a is taken in a specific state s It may be characterized in that it is a function for calculating the expected value of .

또한, 상기 선박 도착예정시간 연산 방법에 있어서 상기 제2 경로를 생성하는 단계는, 인공지능 알고리즘에 의해 실행되되, 상기 인공지능 알고리즘은 선박운행 데이터베이스를 기초로 강화 학습된 것을 특징으로 할 수 있다.In addition, in the method for calculating the estimated time of arrival of the ship, the step of generating the second route may be performed by an artificial intelligence algorithm, and the artificial intelligence algorithm may be reinforced and learned based on a ship operation database.

또한, 상기 선박 도착예정시간 연산 방법에 있어서 상기 제2 경로를 생성하는 단계는, 복수 개의 입력변수들 중 어느 하나의 입력변수에 가중치를 부여하는지에 따라 복수 개의 제2 경로들을 생성하는 것을 특징으로 할 수 있으며, 이 때 상기 최종경로는, 상기 복수 개의 제2 경로들 중 항해에 소요되는 비용이 가장 적은 제2 경로로 결정되는 것을 특징으로 할 수 있다.In addition, in the method for calculating the estimated time of arrival of the ship, the generating of the second route is characterized in that a plurality of second routes are generated according to which one of the plurality of input variables is weighted. In this case, the final route may be characterized in that the second route having the lowest cost required for navigation among the plurality of second routes is determined.

한편, 본 발명의 또 다른 실시예에 따라 선박의 경로를 생성하기 위한 알고리즘을 강화 학습 시키는 방법은, 과거 선박운행 정보를 로드하는 단계; 및 상기 과거 선박운행 정보에 있어서, 선박의 시작점으로부터 도착점까지의 항해 결과에 따른 보상의 기대값을 산출하는 함수를 복수 개의 가상경로들에 대하여 반복 실행시키는 단계;를 포함할 수 있다.On the other hand, according to another embodiment of the present invention, a method for reinforcement learning of an algorithm for generating a ship's path includes loading past ship operation information; and repeatedly executing a function for calculating an expected value of compensation according to a navigation result from a start point to a destination point of a ship for a plurality of virtual routes in the past ship operation information.

상기 선박의 경로 생성을 위한 알고리즘을 강화 학습 시키는 방법에 있어서, 상기 시작점으로부터 도착점까지의 경로는 복수 개의 노드들을 포함하고, 상기 함수를 반복 실행시키는 단계는, 선박이 제s노드에서 제s+1노드(s는 1 이상의 자연수)로 이동할 시 기대되는 보상을 최대화 하는 것을 목표로 하는 함수를 반복 실행시키는 단계인 것을 특징으로 할 수 있다.In the reinforcement learning method for the algorithm for generating the path of the ship, the path from the starting point to the destination point includes a plurality of nodes, and the step of repeatedly executing the function is that the ship moves from the s-th node to the s+1-th node. It may be characterized in that it is a step of repeatedly executing a function aimed at maximizing an expected reward when moving to a node (s is a natural number of 1 or more).

또한, 상기 선박의 경로 생성을 위한 알고리즘을 강화 학습 시키는 방법은 상기 복수 개의 가상경로들에 대해 함수를 반복 실행시키되, 각각의 반복 실행 단계에서는 상기 보상의 기대값을 최대화 하기 위하여 변수들을 상이하게 적용시키는 것을 특징으로 할 수 있다.In addition, the reinforcement learning method of the algorithm for generating the ship's path repeatedly executes the function for the plurality of virtual paths, but in each iterative execution step, different variables are applied to maximize the expected value of the reward It can be characterized by doing.

상기 선박의 경로 생성을 위한 알고리즘을 강화 학습 시키는 방법은, 상기 복수 개의 가상경로들에 대해 함수를 반복 실행시킨 결과, 산출된 보상의 기대값이 기준치 이상인 가상경로들을 선별하여 데이터베이스화 하는 단계;를 더 포함할 수 있다.The reinforcement learning method for the algorithm for generating the ship's path includes the steps of selecting virtual paths whose expected value of compensation calculated as a result of repeatedly executing a function for the plurality of virtual paths is equal to or greater than a reference value and storing them in a database. can include more.

한편, 본 발명의 또 다른 실시예에 따른 선박 도착예정시간을 연산하는 연산장치는, 선박에 대한 위치정보, 또는 동적환경변수와 관련된 정보들을 수집하는 정보수집부; 적어도 하나 이상의 동적환경변수, 또는 선박운행 데이터베이스를 참고하여 상기 선박이 시작점으로부터 도착점에 이르는 경로를 연산하는 연산부; 상기 연산부에 의해 연산된 복수 개의 경로들 중 어느 하나를 최종경로로 결정하는 결정부; 동적환경변수와 관련된 정보들이 저장된 동적환경 데이터베이스; 및 상기 정보수집부, 연산부, 결정부, 및 동적환경 데이터베이스를 제어하는 제어부;를 포함할 수 있다.On the other hand, the calculation device for calculating the expected time of arrival of the ship according to another embodiment of the present invention includes an information collection unit for collecting location information about the ship or information related to dynamic environment variables; a calculation unit for calculating a path from a starting point to a destination of the ship by referring to at least one dynamic environment variable or a ship operation database; a determination unit which determines one of the plurality of paths calculated by the calculation unit as a final path; a dynamic environment database in which information related to dynamic environment variables is stored; and a control unit controlling the information collection unit, the calculation unit, the determination unit, and the dynamic environment database.

또한, 상기 연산장치는 과거의 선박운행 정보들을 저장하는 선박운행 데이터베이스를 더 포함할 수 있다.In addition, the calculation device may further include a ship operation database for storing past ship operation information.

또한, 상기 연산장치에 있어서 상기 연산부는, 적어도 하나 이상의 동적환경변수를 참고하여 상기 선박이 시작점으로부터 도착점에 이르는 제1 경로를 생성하고, 상기 선박운행 데이터베이스를 참고하여 상기 제1 경로를 적어도 일부 수정한 제2 경로를 생성하는 것을 특징으로 할 수 있다.In addition, in the calculation device, the calculation unit generates a first route from the start point to the destination point of the ship by referring to at least one or more dynamic environment variables, and at least partially corrects the first route by referring to the ship operation database. It may be characterized by generating one second path.

또한, 상기 연산장치에 있어서 상기 선박운행 데이터베이스는, 과거의 선박운행 정보들이 강화 학습된 결과값들을 누적 저장하되, 상기 결과값들은, 복수 개의 노드들로 구성되는 임의의 경로에 있어서, 선박이 제s노드에서 제s+1노드(s는 1이상의 자연수)로 이동할 시 기대되는 보상을 최대화 하는 것을 목표로 하는 함수의 반복 연산에 의해 산출된 결과값들인 것을 특징으로 할 수 있다.In addition, in the calculation device, the ship operation database accumulates and stores result values obtained by reinforcement learning of past ship operation information, and the result values are determined by the ship in an arbitrary path composed of a plurality of nodes. It may be characterized in that they are result values calculated by iterative operation of a function aimed at maximizing an expected reward when moving from the s node to the s+1th node (s is a natural number of 1 or greater).

또한, 상기 연산장치에 있어서 상기 함수는In addition, in the arithmetic device, the function is

, where s is the node state, a is the action, γ is the discount rate, Q(s,a) is the action value function for calculating the expected value of the reward to be received when action a is taken in the specific node state s, and a' is It may be characterized as an action that maximizes the value of the action value function in the next node state (s').

본 발명에 따르면 선박의 도착예정시간을 보다 정확하게 연산해 낼 수 있는 효과가 있다. According to the present invention, there is an effect of calculating the estimated time of arrival of the ship more accurately.

또한, 선박의 도착예정시간을 정확하게 예측할 수 있어 항구에서 이루어지는 물류 작업들을 정시에 진행시킬 수 있으며, 이에 따라 전체 물류 과정에서의 불필요한 비용을 최소화 할 수 있는 효과도 있다. In addition, since the expected arrival time of the ship can be accurately predicted, logistics operations performed at the port can be performed on time, and thus, unnecessary costs in the entire logistics process can be minimized.

한편, 본 발명에 의한 효과는 이상에서 언급한 것들로 제한되지 않으며, 언급되지 않은 또 다른 기술적 효과들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.On the other hand, the effects of the present invention are not limited to those mentioned above, and other technical effects not mentioned will be clearly understood by those skilled in the art from the description below.

도 1은 본 발명에 따른 선박 도착예정시간 연산 방법을 이용함에 따른 결과물을 이해하기 쉽도록 간략히 도시한 것이다.
도 2는 본 발명에 따른 선박 도착예정시간 연산 방법을 순서에 따라 도시한 것이다.
도 3은 연산 방법의 단계들 중 특히 제2 경로를 생성하는 단계에서 참조되는 경로 맵의 일 예를 도시한 것이다.
도 4는 본 발명에서 활용하게 될 강화학습 알고리즘의 원리를 이해하기 위한 개념도이다.
도 5는 본 발명의 또 다른 실시예에 따른 도착예정시간 연산 방법을 도시한 것이다.
도 6은 본 발명에 따른 연산장치의 세부구성들을 도시한 것이다.1 is a simplified diagram for easy understanding of results obtained by using a method for calculating an estimated time of arrival of a ship according to the present invention.
2 shows a method for calculating the estimated time of arrival of a ship according to the present invention in sequence.
FIG. 3 illustrates an example of a route map referred to in the step of generating a second route among the steps of the calculation method.
4 is a conceptual diagram for understanding the principle of the reinforcement learning algorithm to be utilized in the present invention.
5 illustrates an ETA calculation method according to another embodiment of the present invention.
6 shows detailed configurations of an arithmetic device according to the present invention.

본 발명의 목적과 기술적 구성 및 그에 따른 작용 효과에 관한 자세한 사항은 본 발명의 명세서에 첨부된 도면에 의거한 이하의 상세한 설명에 의해 보다 명확하게 이해될 것이다. 첨부된 도면을 참조하여 본 발명에 따른 실시예를 상세하게 설명한다.Objects and technical configurations of the present invention and details of the operational effects thereof will be more clearly understood by the following detailed description based on the accompanying drawings in the specification of the present invention. An embodiment according to the present invention will be described in detail with reference to the accompanying drawings.

본 명세서에서 개시되는 실시예들은 본 발명의 범위를 한정하는 것으로 해석되거나 이용되지 않아야 할 것이다. 이 분야의 통상의 기술자에게 본 명세서의 실시예를 포함한 설명은 다양한 응용을 갖는다는 것이 당연하다. 따라서, 본 발명의 상세한 설명에 기재된 임의의 실시예들은 본 발명을 보다 잘 설명하기 위한 예시적인 것이며 본 발명의 범위가 실시예들로 한정되는 것을 의도하지 않는다.The embodiments disclosed herein should not be construed or used as limiting the scope of the present invention. It goes without saying that the description, including the embodiments herein, has a variety of applications for those skilled in the art. Therefore, any embodiments described in the detailed description of the present invention are illustrative for better explaining the present invention, and the scope of the present invention is not intended to be limited to the examples.

도면에 표시되고 아래에 설명되는 기능 블록들은 가능한 구현의 예들일 뿐이다. 다른 구현들에서는 상세한 설명의 사상 및 범위를 벗어나지 않는 범위에서 다른 기능 블록들이 사용될 수 있다. 또한, 본 발명의 하나 이상의 기능 블록이 개별 블록들로 표시되지만, 본 발명의 기능 블록들 중 하나 이상은 동일 기능을 실행하는 다양한 하드웨어 및 소프트웨어 구성들의 조합일 수 있다.The functional blocks shown in the drawings and described below are only examples of possible implementations. Other functional blocks may be used in other implementations without departing from the spirit and scope of the detailed description. Also, while one or more functional blocks of the present invention are represented as separate blocks, one or more of the functional blocks of the present invention may be a combination of various hardware and software configurations that perform the same function.

또한, 어떤 구성요소들을 포함한다는 표현은 “개방형”의 표현으로서 해당 구성요소들이 존재하는 것을 단순히 지칭할 뿐이며, 추가적인 구성요소들을 배제하는 것으로 이해되어서는 안 된다.In addition, the expression that certain components are included simply refers to the existence of the corresponding components as an expression of “open type”, and should not be understood as excluding additional components.

나아가 어떤 구성요소가 다른 구성요소에 “연결되어” 있다거나 “접속되어” 있다고 언급될 때에는, 그 다른 구성요소에 직접적으로 연결 또는 접속되어 있을 수도 있지만, 중간에 다른 구성요소가 존재할 수도 있다고 이해되어야 한다. Furthermore, it should be understood that when a component is referred to as being “connected” or “connected” to another component, it may be directly connected or connected to the other component, but other components may exist in the middle. do.

도 1은 본 발명에 따른 선박 도착예정시간 연산 방법을 이용하였을 시 사용자가 경험할 수 있는 상황을 예시적으로 도시한 것으로, 더 정확하게는 어느 특정 선박의 경로 및 도착예정시간을 연산하는 과정을 이해하기 쉽게 도시한 것이다.1 is an example of a situation that a user may experience when using the method for calculating the estimated time of arrival of a ship according to the present invention. it is easily illustrated.

앞서도 언급하였듯 본 발명의 궁극적인 목적은 해상운송 수단인 선박의 도착예정시간을 보다 정확하게 예측하는 것이며, 이 방법의 세부단계 중에는 강화학습에 의해 학습된 데이터를 참고하는 단계가 포함된다. As mentioned above, the ultimate purpose of the present invention is to more accurately predict the expected arrival time of a ship, which is a means of maritime transportation, and among the detailed steps of this method, a step of referring to data learned by reinforcement learning is included.

도 1에는 특정 선박A가 출발지(S)로부터 목적지(D)까지 화물을 운송할 때에 예상 가능한 경로가 도시되어 있으며, 각각의 경로들에 대해서는 도착예정시간이 연산되어 기재되어 있다. 참고로 선박의 경로 및 도착예정시간을 연산하는 주체는 후술하게 될 연산장치가 될 것이며, 연산장치란 중앙처리유닛 및 메모리를 갖춘 일체의 장치를 모두 포함할 수 있다. 가장 먼저 경로①은 기본경로를 도시한 것으로, 다른 변수들의 고려 없이 선박이 운항할 수 있는 기본 항로를 따라 출발지에서 목적지까지의 경로를 표시한 것이다. 또한, 경로①에 대응되는 도착예상시간 역시 선박의 평균속력, 그리고 경로의 총 길이, 필요한 경우 기착점에서의 정박 시간이 참고되어 연산된 것이다. 한편, 경로②는 연산장치가 몇몇 변수들을 반영하여 생성해 낸 것으로, 이 때 반영되는 변수들의 종류에는 기상, 파고, 유가, 해양사고, 또는 해양교통량 이 포함될 수 있다. 즉, 경로②는 위와 같은 외부 환경적인 요인들이 반영된 것으로, 앞서의 경로①에 비해서는 정확도가 더 높다 할 수 있다. 마지막으로 경로③은 상기 경로②에 사전 강화학습에 의해 구축해 두었던 데이터베이스를 참고하여 수정된 것으로, 본 발명에 따른 선박 도착예정시간 연산 방법 중 가장 중요한 경로생성 단계라 할 수 있다. 상기 경로③은 인공지능 알고리즘, 그 중에서도 강화학습 방식을 택하고 있는 인공지능 알고리즘에 의해 복수 개의 과거 선박운행 정보들이 학습된 상태에서 경로②의 오차를 줄이기 위한 보정 과정을 거쳐 생성된 것으로, 당연히 경로③은 경로②에 비해 더 높은 정확도를 가진다. 참고로 도 1에는 경로③에 대응되는 선박 도착예정시간이 2021년 10월 31일 20시이고, 경로②에 대응되는 도착예정시간은 2021년 11월 1일 9시이며, 기본경로인 경로①에 대응되는 도착예정시간은 15:00로 기재되어 있는데, 이로부터는 인공지능 알고리즘을 활용한 결과물이 반영된 경로③이 가장 정확도가 높으면서도 가장 빠른 시간 내에 해상운송을 마칠 수 있는 경로임을 확인할 수 있다. 즉, 본 발명에 따른 선박 도착예정시간 연산 방법을 활용하는 경우 결과적으로 선박의 도착예정시간을 정확하게 예측할 수 있음은 물론, 선박의 해상 이동시간도 가장 짧게 유도할 수 있게 되므로 해상운송 사업의 다양한 분야에서 긍정적인 효과가 기대된다. 참고로 도 1에 도시되어 있는 경로①, ②, ③은 각각 후술될 상세한 설명에서 “기본경로”, “제1 경로”, “제2 경로”로 명명될 것이며, 특별히 다른 언급이 없는 경우 위 용어를 그대로 사용하는 것으로 이해하기로 한다.Figure 1 shows routes that can be expected when a specific ship A transports cargo from a departure point (S) to a destination (D), and the expected arrival time is calculated and described for each route. For reference, the subject that calculates the ship's route and estimated time of arrival will be an arithmetic device to be described later, and the arithmetic device may include both a central processing unit and any device equipped with a memory. First of all, the route ① shows the basic route, and shows the route from the departure point to the destination along the basic route that the ship can navigate without considering other variables. In addition, the expected arrival time corresponding to route ① is also calculated by referring to the average speed of the ship, the total length of the route, and, if necessary, the anchoring time at the stopping point. On the other hand, route ② is created by reflecting several variables by the computing device, and the types of variables reflected at this time may include weather, wave height, oil price, marine accidents, or marine traffic. In other words, the path ② reflects the above external environmental factors, and it can be said that the accuracy is higher than the previous path ①. Finally, route ③ is modified by referring to the database built by prior reinforcement learning in route ②, and it can be said to be the most important route generation step among the methods for calculating the expected arrival time of the ship according to the present invention. The route ③ was created through a correction process to reduce the error of route ② in the state where a plurality of past ship operation information was learned by an artificial intelligence algorithm, especially an artificial intelligence algorithm that adopts a reinforcement learning method. ③ has higher accuracy than path ②. For reference, in FIG. 1, the estimated time of arrival of the ship corresponding to route ③ is 20:00 on October 31, 2021, and the estimated time of arrival corresponding to route ② is 9:00 on November 1, 2021, and the basic route route ① The corresponding expected arrival time is listed as 15:00, and from this, it can be confirmed that the route ③ that reflects the result using the artificial intelligence algorithm is the route that can complete the maritime transportation in the fastest time with the highest accuracy. In other words, when the method for calculating the estimated time of arrival of a ship according to the present invention is used, as a result, it is possible to accurately predict the expected time of arrival of the ship and to derive the shortest maritime travel time of the ship, so that various fields of the maritime transportation business can be obtained. A positive effect is expected from For reference, routes ①, ②, and ③ shown in FIG. 1 will be named “basic route”, “first route”, and “second route” in the detailed description, respectively, unless otherwise specified. It is understood that the is used as it is.

도 2는 본 발명에 따른 선박 도착예정시간 연산 방법을 순서에 따라 도시한 것이다. 앞서 언급하였든, 이 연산 방법은 후술하게 될 연산장치에 의해 이루어질 수 있다. 이하에서는 각 단계들을 더 구체적으로 살펴보기로 한다.2 shows a method for calculating the estimated time of arrival of a ship according to the present invention in sequence. As mentioned above, this calculation method can be performed by an arithmetic device to be described later. In the following, each step will be examined in more detail.

도 2를 참고할 때, 선박 도착예정시간 연산 방법은 가장 먼저 정해진 주기마다 선박에 대한 위치정보를 수집하는 단계(S10)부터 시작될 수 있다. 선박의 위치정보는 다양한 소스를 통해 수집이 가능한데, 예를 들어 Marine Traffic 또는 Vessel Finder와 같은 국제 선박 위치 추적 서비스를 제공하는 데이터베이스 운영주체에 네트워크 접속을 통하여 선박 위치정보를 확보할 수 있다. 한편, 정해진 주기(period)는 사용자(또는 연산장치를 관리하는 관리자)의 설정에 따라 정해질 수 있는 것으로, 사용자는 예를 들어 6시간마다 현재 운항 중인 선박의 위치정보를 수집하도록 설정할 수 있다. 또한, 본 단계에서 수집되는 선박의 위치정보는 바람직하게는 GPS정보라 할 것이나, 반드시 이에 한정되는 것은 아니다. Referring to FIG. 2 , the method for calculating the estimated time of arrival of the ship may first start with the step of collecting location information about the ship at predetermined intervals (S10). Ship location information can be collected through various sources. For example, ship location information can be obtained through network access to a database operator that provides international ship location tracking services such as Marine Traffic or Vessel Finder. On the other hand, the predetermined period (period) can be determined according to the setting of the user (or the manager who manages the computing device), the user can set to collect the location information of the currently operating ship every 6 hours, for example. In addition, the location information of the ship collected in this step will preferably be referred to as GPS information, but is not necessarily limited thereto.

선박의 위치정보를 수집하는 단계 이후, 연산장치는 시작점으로부터 도착점에 이르는 기본경로를 적어도 하나 이상 생성(S20)할 수 있다. 기본경로란, 쉽게 말해 초기화 된 상태의 경로를 이르는 것으로 향후 알고리즘에 의한 연산의 대상이 될, 또는 수정의 대상이 될 경로를 의미할 수 있다. 상기 기본경로는 시작점 및 도착점이 정해져 있는 하나의 시종점 페어(pair)마다 미리 정해져 있을 수 있는데, 예를 들어 부산(한국)-밴쿠버(캐나다) 간에는 도 1의 경로①과 같은 기본경로가 미리 정해져 있을 수 있다. 또는 미리 정해져 있지 않은 경우에는 시작점으로부터 도착점까지 정해져 있는 항로 중 하나를 임의로 정하여 기본경로로 정의할 수도 있다. 참고로 위 실시예들은 S20 단계에서 기본경로를 생성하는 하나의 실시예일뿐이지 반드시 이에 한정되는 것은 아니며, 본 발명에 따른 선박 도착예정시간 연산 방법의 시작에 필요한 기본경로가 정의될 수 있다면 이로써 족하다 할 것이다.After the step of collecting the location information of the ship, the calculation device may generate at least one basic route from the starting point to the destination point (S20). The default path, simply put, refers to a path in an initialized state, and may mean a path that will be subject to operation or modification by an algorithm in the future. The basic route may be pre-determined for each start-and-end point pair in which a start point and an end point are determined. For example, a default route such as route ① in FIG. 1 is pre-determined between Busan (Korea) and Vancouver (Canada). There may be. Alternatively, if it is not determined in advance, one of the routes determined from the starting point to the destination point may be arbitrarily determined and defined as a basic route. For reference, the above embodiments are only one example of generating a basic route in step S20, but are not necessarily limited thereto, and it is sufficient if the basic route necessary for starting the method for calculating the expected arrival time of a ship according to the present invention can be defined. will be.

기본경로가 생성된 후에는 적어도 하나 이상의 동적환경변수를 참고하여 시작점으로부터 도착점에 이르는 제1 경로를 생성(S30)할 수 있다. 동적환경변수란, 선박이 해상에서 운항을 하는 데에 있어 영향을 미칠 수 있는 요소들을 의미하는 것으로, 상기 동적환경변수는 바람직하게는 숫자로 어느 특정 값, 또는 특정 범위를 나타낸 것이거나, 또는 문자로 나타내어진 것일 수 있다. 또한, 상기 동적환경변수의 종류에는 다양한 것들이 존재할 수 있겠으나, 기상정보에 관한 기상변수, 파고에 관한 파고변수, 국제유가에 관한 유가변수, 해양사고에 관한 사고변수, 또는 해양교통량에 관한 교통량변수는 기본적으로 포함될 수 있다. 위 동적환경변수 역시 연산장치가 종류에 관계 없이 온라인 네트워크를 통하여 수집할 수 있는데, 예를 들어 국제 해양 기상 데이터베이스, 기착점, 도착점 국가에서의 뉴스 데이터베이스, 국제 유가 데이터베이스 서비스를 제공하는 운영주체에 네트워크로 접근하여 정보들을 수집할 수 있고, 이렇게 수집된 정보들은 연산장치에 의해 다시 자체적으로 상기 연산장치 내에, 또는 상기 연산장치가 접근이 가능한 클라우드 저장 공간에 데이터베이스로 구축될 수 있다. 한편, 동적환경변수가 고려된 제1 경로는, 예를 들어 기상변수와 관련하여서는 심한 폭우가 예상되는 지역을 직접 통과하지 않도록 경로 중 일부가 수정되거나, 유가변수와 관련하여서는 경로 중 일부가 기본경로에 비해 더 짧아지도록 수정되거나, 교통량변수 관련하여서는 기착점 주변 교통량이 일정 수준 이하인 시점에 항구 접근이 가능하도록 경로를 일부 길게 수정하는 등의 과정을 거쳐 생성된 경로를 의미하는 것으로 이해될 수 있다. 상기 제1 경로 생성 과정은 바람직하게는 각 동적환경변수를 적용시키기 위한 관계식이 존재하여 각각의 수식이 기본경로 상에 반영이 되는 과정이 대수(對數)적 연산 또는 기하학적 연산에 의해 이루어지도록 구현될 수 있다. After the basic route is created, a first route from the starting point to the destination point may be created with reference to at least one or more dynamic environment variables (S30). Dynamic environmental variables refer to factors that can affect the operation of a ship at sea, and the dynamic environmental variables preferably represent a specific value or specific range in numbers, or characters may be represented by In addition, various types of dynamic environment variables may exist, but weather variables related to weather information, wave height variables related to wave heights, oil price variables related to international oil prices, accident variables related to marine accidents, or traffic volume variables related to marine traffic can be included by default. The above dynamic environmental variables can also be collected through an online network regardless of the type of computing device. It is possible to access and collect information, and the information collected in this way can be built into a database in the computing device itself again by the computing device or in a cloud storage space accessible by the computing device. On the other hand, in the first route considering dynamic environmental variables, for example, in relation to weather variables, some of the routes are modified so as not to directly pass through areas where heavy rain is expected, or in relation to oil price variables, some of the routes are basic routes. It can be understood to mean a route created through a process such as being modified to be shorter than , or partially lengthening the route so that port access is possible when the traffic volume around the stopover is below a certain level in relation to the traffic volume variable. Preferably, the first path generation process is implemented such that a relational expression for applying each dynamic environment variable exists, and a process in which each formula is reflected on the basic path is performed by algebraic or geometrical operation. can

한편, 제1 경로가 생성된 후에는 선박운행 데이터베이스를 참고하여 제1 경로를 적어도 일부 수정한 제2 경로가 생성(S40)될 수 있다. 즉, 본 단계는 이미 동적환경변수가 반영된 상태의 제1 경로가 있음에도 불구하고, 재차 별도의 전용 데이터베이스, 정확하게는 선박운행 데이터베이스를 참고하여 제1 경로를 수정함으로써 제2 경로를 생성하는 단계이며, 이 제2 경로를 생성하는 단계는 본 발명에 따른 연산 방법 중 주요한 단계라 할 수 있다. Meanwhile, after the first route is generated, a second route obtained by at least partially modifying the first route may be generated by referring to the ship operation database (S40). That is, this step is a step of creating a second route by modifying the first route by referring to a separate dedicated database, more precisely, a ship operation database, even though there is already a first route with dynamic environment variables reflected, The step of generating the second path may be regarded as a major step of the calculation method according to the present invention.

동적환경변수가 반영된 상태의 제1 경로라 하더라도, 실제 선박이 예측된 제1 경로를 따라 운항할 확률은 높지 않으며, 당연히 제1 경로를 기초로 산출된 선박의 도착예정시간 역시 정확도가 떨어진다. 이에 본 발명에서는 산출되어 있는 제1 경로를 한번 더 수정하되, 대신 과거의 선박운행 기록들이 강화 학습에 의해 학습된 결과값들이 저장된 선박운행 데이터베이스를 참고하여 수정이 이루어지게 함으로써 선박의 예상 운행 경로 및 도착예정시간을 보다 더 정확하게 예측해 보고자 한 것이다. 즉, 상기 제2 경로는 강화 학습에 의해 학습된 결과값들이 반영된 경로일 수 있다. Even in the first route with the dynamic environment variables reflected, the probability that the actual ship will navigate along the predicted first route is not high, and of course, the estimated arrival time of the ship calculated based on the first route is also less accurate. Therefore, in the present invention, the calculated first route is modified once more, but instead, the ship operation record is modified by referring to the ship operation database in which the result values learned by reinforcement learning are stored, so that the expected ship operation path and It was intended to predict the expected arrival time more accurately. That is, the second path may be a path in which result values learned by reinforcement learning are reflected.

이 때 상기 선박운행 데이터베이스에 저장되어 있는 결과값은, 과거 선박(또는 선박들)의 운항 정보를 강화 학습 하되, 복수 개의 노드들로 구성되는 임의의 운항 경로에 있어서 해당 선박이 제s노드에서 제s+1노드(이 때 s는 1 이상의 자연수)로 이동할 시 얻을 수 있는 보상을 최대화 하는 것을 목표로 하는 함수의 반복 연산에 의해 산출된 결과값일 수 있다. 이 때 보상이란 상기 선박의 운항에 따라 사용자가 얻게 될 이득으로도 정의가 가능할 수 있는데, 예를 들어 선박이 도착점에 안전하게 도착하였다는 사실, 선박이 운항함에 따라 얻게 되는 사용자의 경제적 이득 등이 보상의 한 예시가 될 수 있다. 이들 보상은 발명의 이해를 돕기 위해 문언적으로 설명이 되고 있으나, 실제 알고리즘에 따라 연산이 이루어지기 위해서는 상기 보상들이 당연히 수치화 되어야 함을 이해하기로 한다.At this time, the result value stored in the ship operation database is used to reinforce and learn the navigation information of past ships (or ships), but in an arbitrary navigation route consisting of a plurality of nodes, the ship is the first node at the s node. It may be a result value calculated by iterative operation of a function aimed at maximizing a reward that can be obtained when moving to node s+1 (where s is a natural number greater than or equal to 1). In this case, compensation can also be defined as a benefit that the user will obtain according to the operation of the ship. For example, the fact that the ship has safely arrived at the destination, the user's economic benefits obtained as the ship operates, etc. can be an example of Although these compensations are verbally explained to help the understanding of the invention, it will be understood that the compensations must be digitized in order to perform calculations according to actual algorithms.

한편, 앞서 언급한 함수, 즉 어느 특정 노드에서의 선박이 다음 노드로 진행할 때에 얻을 수 있는 보상을 최대화 하는 것을 목표로 하는 함수는 아래와 같은 수학식으로 나타내어질 수 있다.On the other hand, the aforementioned function, that is, a function aimed at maximizing a reward that can be obtained when a ship at a certain node proceeds to the next node can be expressed by the following equation.

위 수학식에서 s는 노드상태, a는 액션, γ는 할인율, Q(s,a)는 특정 노드상태 s에서 액션 a를 취할 경우 받을 보상의 기대값을 산출하기 위한 행동가치함수, a’은 다음 노드상태(s’)에서 행동가치함수의 값을 최대로 만드는 액션을 의미할 수 있다. S는 다시 말하면 선박이 현재 놓여져 있는 상태, a는 선박의 움직임을 의미할 수 있다. In the above equation, s is the node state, a is the action, γ is the discount rate, Q(s,a) is the action value function for calculating the expected value of the reward to be received when action a is taken in the specific node state s, and a' is the following It may mean an action that maximizes the value of the action value function in the node state (s'). In other words, S is the state where the ship is currently placed, and a may mean the movement of the ship.

또한, 위 수학식 1에 포함되어 있는 행동가치함수란 아래와 같이 정의될 수 있다.In addition, the action value function included in Equation 1 above may be defined as follows.

마찬가지로 위 수학식에서 s는 노드상태, a는 액션, E는 기대값, R은 보상, γ는 할인율, t는 시간을 나타내며, Q_π(s,a)는 특정 상태 s에서 액션 a를 취할 경우 받을 보상의 기대값을 산출하는 함수이다. Similarly, in the above equation, s is the node state, a is the action, E is the expected value, R is the reward, γ is the discount rate, t is the time, and Q _π (s,a) is the value received when action a is taken in a specific state s. It is a function that calculates the expected value of the reward.

도 3은 위 수학식 1에 의해 정의되고 있는 알고리즘을 쉽게 이해하기 위한 도면으로, 어느 특정 선박이 제1 노드로부터 제5 노드에 이르는 경로를 따라 운항 한다고 가정할 때 각 노드에서 선택할 수 있는 움직임, 그리고 그 움직임을 선택하였을 때 얻을 수 있는 보상을 임의로 기재한 것이다. 3 is a diagram for easily understanding the algorithm defined by Equation 1 above. Assuming that a specific ship navigates along a route from a first node to a fifth node, motions that can be selected at each node, And the reward that can be obtained when the movement is selected is arbitrarily described.

먼저 선박이 제4 노드에 놓여 있을 때, 혹은 제3 노드에 놓여 있을 때는 선박이 취할 수 있는 액션, 즉 움직임에 대한 선택의 여지가 없으므로 제3 노드 이후에는 상태 및 액션과 관계 없이 항상 보상이 동일한 것으로 연산될 것이다.First, when the ship is placed on the 4th node or the 3rd node, there is no choice about the action that the ship can take, that is, the movement, so the reward is always the same regardless of the state and action after the 3rd node. will be computed as

한편, 선박이 제2 노드상태에 놓여 있을 때, 선박은 제2-1 노드를 통과하여 제3 노드에 도달할 수 있는 액션, 그리고 제2-2 노드를 통과하여 제3 노드에 도달할 수 있는 액션, 또는 별도의 노드를 거치지 않고 곧바로 제3 노드에 도달할 수 있는 액션 중 하나를 선택할 수 있는데, 각각의 경우에 있어 보상은 (0.5+0.1), (0.2+0.1), (0.7)이 될 것이며, 이 때 보상을 최대화 하기 위한 선택은 0.7 보상이 주어지는 액션이어야 할 것이므로 제2 노드상태에서는 곧바로 제3 노드로 진행하는 선박 움직임이 선택될 것이다. On the other hand, when the ship is in the 2nd node state, the ship can pass through the 2-1 node to reach the 3rd node, and pass through the 2-2 node to reach the 3rd node. You can choose one of the actions or actions that can directly reach the third node without going through a separate node. In each case, the rewards will be (0.5+0.1), (0.2+0.1), (0.7). At this time, since the selection to maximize the reward should be an action given a 0.7 reward, the ship movement directly proceeding to the third node will be selected in the second node state.

다른 한편, 제1 노드상태에 놓여 있을 때에도 마찬가지로 선박은 제1-1 노드를 통과하여 제2 노드에 도달할 수 있는 액션, 그리고 곧바로 제2 노드로 도달할 수 있는 액션 중 하나를 선택할 수 있는데, 제1 노드상태에서는 더 큰 보상이 기대되는 곧바로 제2 노드로 진행하는 액션(보상 0.6)을 선택하게 될 것이다. 이 때 이미 선박이 제2 노드상태에 있을 때에는 0.7이라는 최대 보상이 기대되는 것이 확정된 상태이며, 또한 제1 노드상태에서의 액션(움직임)으로 얻을 수 있는 최대 보상은 0.6이므로, 결국 제1 노드로부터 제3 노드에 이르는 데에 얻을 수 있을 것으로 기대되는 보상은 1.3이 될 것이다. On the other hand, even when placed in the first node state, the ship can select one of the actions that can reach the second node through the 1-1 node and the action that can directly reach the second node. In the first node state, an action (reward 0.6) to proceed to the second node will be selected as soon as a larger reward is expected. At this time, when the ship is already in the second node state, it is confirmed that the maximum reward of 0.7 is expected, and the maximum reward that can be obtained by action (movement) in the first node state is 0.6. The reward expected to be obtained for reaching the third node from V would be 1.3.

한편, 이와 같은 방식으로 상기 수학식 1의 함수에 따른 연산이 반복될 수 있으며, 그 결과값은 앞서 언급한 선박운행 데이터베이스에 저장될 수 있다. 이 때, 저장되는 결과값의 형태는 텍스트 및 숫자의 조합으로 된 것일 수도 있겠으나, 경우에 따라 도 4에 도시되어 있는 것과 같은 경로 맵의 형태를 가질 수도 있다. Meanwhile, in this way, the operation according to the function of Equation 1 may be repeated, and the resultant value may be stored in the aforementioned ship operation database. At this time, the form of the stored result value may be a combination of text and numbers, but may have the form of a route map as shown in FIG. 4 in some cases.

도 4에는 시작점(S)로부터 도착점(D)에 이르는 경로가 표시된 경로 맵이 예시적으로 도시되어 있는데, 선박운행 데이터베이스에는 이와 같은 경로 맵들이 누적적으로 저장이 되어 있을 수 있으며, 상기 제2 경로를 생성하는 단계에서는 이렇게 저장된 경로 맵들 중 어느 하나가 참고용으로 선별되어 실제 제2 경로를 생성하는 데에 활용될 수 있다. 각각의 경로 맵들은 선박이 놓여 있는 노드상태, 선박이 다음 노드상태로 진행하는 데에 취할 수 있는 액션(움직임), 그리고 액션에 따른 보상 등 조건이 상이할 수 있으며, 제2 경로를 생성하는 단계에서는 현재 선박이 놓여 있는 상태(동적환경변수가 고려된 상태도 포함함)와 유사한 상태의 경로 맵을 탐색하여 탐색된 경로 맵을 참고할 수 있다.4 shows an exemplary route map showing a route from the starting point S to the ending point D. Such route maps may be cumulatively stored in the ship operation database, and the second route In the step of generating, any one of the route maps stored in this way may be selected for reference and used to generate the actual second route. Each route map may have different conditions, such as the node state where the ship is located, the action (movement) that the ship can take to proceed to the next node state, and the compensation according to the action. Creating a second route In , a route map in a state similar to the state in which the ship is currently placed (including a state in which dynamic environment variables are considered) can be searched and the searched route map can be referred to.

참고로, 앞서 설명한 S40 단계는 제2 경로를 생성할 때에 강화 학습의 결과값들이 누적 저장된 선박운행 데이터베이스를 참고하여 제2 경로를 생성할 수 있다고 설명하였는데, 이 때 상기 제2 경로를 생성하기 위한 알고리즘 그 자체가 인공지능 알고리즘으로 구현될 수도 있다. 즉, S40 단계에서 제2 경로는 인공지능 알고리즘에 의해 생성되되, 이 때 인공지능 알고리즘은 과거 선박운행 정보들을 강화 학습하여 학습이 되어 있는 상태의 것일 수 있으며, 이를 통해 직접적으로 현재 선박의 상태, 현재 선박이 취할 수 있는 액션, 기대되는 보상 등을 입력으로 하여 결과값인 제2 경로를 산출해 낼 수 있다. For reference, in step S40 described above, when generating the second route, it has been explained that the second route can be generated by referring to the ship operation database in which the results of reinforcement learning are accumulated and stored. The algorithm itself may be implemented as an artificial intelligence algorithm. That is, in step S40, the second path is generated by an artificial intelligence algorithm, and at this time, the artificial intelligence algorithm may be in a state in which it has been learned by reinforcement learning of past ship operation information, and through this, directly the current state of the ship, A second route, which is a result value, can be calculated by taking an action that the current ship can take, an expected reward, and the like as inputs.

한편, 상기 선박운행 데이터베이스를 구축하는 과정에서의 연산, 또는 제2 경로를 직접 생성하는 과정에서의 강화 학습 알고리즘은 복수 회가 반복될 수 있으며, 반복될 때마다 보상의 종류를 달리 함으로써 사용자가 원하는 변수에 가중치가 주어진 상태의 경로가 생성되도록 할 수도 있다.On the other hand, the operation in the process of constructing the ship operation database or the reinforcement learning algorithm in the process of directly generating the second route may be repeated multiple times, and each time it is repeated, the type of compensation is changed so that the user wants It is also possible to generate a path with a variable weighted state.

한편, 제2 경로가 생성된 이후에는 최종경로를 결정하는 단계(S50) 및 도착예정시간 산출 단계(S60)가 실행될 수 있다. 최종경로는 연산장치에 의해 생성된 제2 경로가 사용자가 정한 조건을 만족하는지 여부에 따라 결정될 수 있으며, 생성된 제2 경로가 조건을 만족하는 경우에는 해당 제2 경로가 최종경로가 될 것이고, 그렇지 않다면 재차 제1 경로를 생성하는 단계 또는 제2 경로를 생성하는 단계로 회귀함으로써 다른 경로를 최종경로로 결정하게 할 수도 있다. 한편, 도착예정시간은 최종경로가 결정된 후 그 최종경로에 따라 산출이 될 것이다.Meanwhile, after the second route is generated, the step of determining the final route (S50) and the step of calculating the estimated time of arrival (S60) may be executed. The final route may be determined according to whether the second route generated by the computing device satisfies a condition set by the user, and if the generated second route satisfies the condition, the corresponding second route becomes the final route, Otherwise, another route may be determined as the final route by returning to the step of generating the first route or the step of generating the second route again. Meanwhile, the estimated time of arrival will be calculated according to the final route after the final route is determined.

참고로 도 2에 나열되어 있는 과정들은 사용자가 정한 주기에 따라 반복적으로 실행될 수 있다. 예를 들어 선박의 위치정보를 수집하는 주기에 맞추어 그 수집 시점을 하나의 노드로 간주하고, 그 노드상태에 따라 기본경로, 제1 경로, 또는 제2 경로를 생성하는 단계가 수행될 수 있다.For reference, the processes listed in FIG. 2 may be repeatedly executed according to a user-determined period. For example, a step of considering a collection point as one node according to a cycle of collecting ship location information and generating a basic route, a first route, or a second route according to the state of the node may be performed.

또한, 참고로 상기 결과값을 산출해 내는 데에 사용되는 함수, 즉 알고리즘은 앞서 설명한 함수의 반복 과정을 통해 학습이 이루어질 수 있으며, 이는 상기 제2 경로를 직접 생성해 내는 알고리즘에 대해서도 마찬가지라 할 것이다. 알고리즘을 학습 시키는 방법은 앞서 선박운행 데이터베이스 내 결과값들을 산출해 내는 과정과 실질적으로 동일하므로 자세한 설명은 생략하기로 한다.In addition, for reference, the function used to calculate the result value, that is, the algorithm, can be learned through the iterative process of the function described above, which can be said to be the same for the algorithm that directly generates the second path. will be. Since the method of learning the algorithm is substantially the same as the process of calculating the result values in the ship operation database, a detailed description will be omitted.

이상 도 2를 참고하여 본 발명에 따른 선박 도착예정시간 연산 방법의 일 실시예를 살펴 보았다.Referring to FIG. 2 above, an embodiment of a method for calculating an estimated time of arrival of a ship according to the present invention has been looked at.

도 5는 본 발명에 따른 선박 도착예정시간 연산 방법의 또 다른 실시예를 도시한 것이다. 도 5의 실시예 중 기본경로를 생성하는 단계(S200)는 앞서 도 2에서의 S20 단계와 대응이 되는 것이므로 여기서는 자세한 설명을 생략하기로 한다. 5 shows another embodiment of a method for calculating the estimated time of arrival of a ship according to the present invention. Step S200 of generating a default path in the embodiment of FIG. 5 corresponds to step S20 of FIG. 2, and thus a detailed description thereof will be omitted.

한편, S200 단계 이후에는 연산장치 상에서 동적환경변수가 참고된 제1경로가 생성(S300)될 수 있고, 가중치가 반영된 이동조건 입력(S400)이 있을 수 있으며, 각 경로의 노드 별 비용계산(S450)이 이루어질 수 있다. 예를 들어 시간 또는 유류값 등에 가중치가 반영된 이동조건 입력이 있을 수 있으며, 연산장치는 가중치가 반영된 이동조건에 따라 각 이동경로의 비용을 노드(node) 별로 계산할 수 있다. 한편, 이 단계에서는 앞서 도 2에 대한 설명에서 언급하였던 것처럼 선박운행 데이터베이스에 저장된 결과값들이 참조될 수 있다. 이 중 가중치가 반영된 이동조건의 입력(S400) 단계와 각 경로의 노드 별 비용 계산 단계(S450)는 도 2에서 설명하였던 선박운행 데이터베이스를 참고한 제2 경로 생성 단계(S40)의 예시적인 세부 실시예로 이해될 수 있다. 또한, 도 5의 S400 단계에서는 사용자가 임의로 특정 이동조건을 입력할 수 있도록, 즉 연산장치가 사용자로부터의 입력을 수신하도록 인터페이스가 구현된 상태일 수도 있겠으나, 더 바람직하게는 알고리즘에 의해 자동적으로 특정 가중치가 반영된 이동조건들이 반복적으로 입력되도록 함으로써 반복적인 연산이 이루어지게 할 수 있다. 즉, 알고리즘에 의한 시뮬레이션이 반복적으로, 그리고 자동적으로 이루어지게 할 수 있다. On the other hand, after step S200, a first route with reference to dynamic environment variables may be created on the computing device (S300), there may be a movement condition input with weights reflected (S400), and cost calculation for each node of each route (S450). ) can be made. For example, there may be a movement condition input in which weights are reflected, such as time or oil value, and the computing device may calculate the cost of each movement route for each node according to the weighted movement condition. Meanwhile, in this step, as mentioned above in the description of FIG. 2 , result values stored in the ship operation database may be referred to. Among them, the step of inputting the movement conditions with weights reflected (S400) and the step of calculating the cost for each node of each route (S450) is an exemplary detailed implementation of the step of generating the second route (S40) with reference to the ship operation database described in FIG. 2. can be understood as an example. In addition, in step S400 of FIG. 5, the interface may be implemented so that the user can arbitrarily input a specific movement condition, that is, the calculator receives input from the user, but more preferably automatically by an algorithm. Repetitive calculations can be performed by repeatedly inputting movement conditions in which specific weights are reflected. That is, the simulation by the algorithm can be made repeatedly and automatically.

S450 단계 이후에는 이동경로가 최저비용 조건을 만족하는지 판단(S500)하고, 그 판단 결과에 따라 최종경로를 생성하는 단계(S600), 혹은 다시 S300 단계 또는 S400 단계로 회귀하여 다른 경로를 생성하게 할 수 있다. 이동경로가 최저비용 조건을 만족하는지 여부를 판단하고, 이에 따라 최종경로를 생성하는 단계는 도 2 중 S50 단계의 일 세부실시예로 이해될 수 있다. 이 때 비용이라는 용어는 알고리즘의 실행에 따라 산출된 특정 이동경로에 있어서 해당 이동경로를 따라 선박이 운행되었을 때 소요되는 비용을 모두 포함하는 것으로 이해될 수 있으며, 여기에는 예를 들어 선박운영 및/또는 항해에 필요한 인력을 사용하는 데에 소요되는 비용, 유류 비용, 항만에 정박하는 데에 소요되는 비용, 또는 선박의 유지관리에 소요되는 비용 등 여러 종류의 비용들이 포함될 수 있다. 또한, 상기 S500 단계에서 임의의 이동경로가 최저비용 조건을 만족하는지 판단하는 단계는, 예를 들어 기 산출된 타 이동경로들 대비 비용이 최저인지를 판단하는 과정, 또는 사용자에 의해 임의로 정해진 최저비용 대비 산출된 이동경로의 비용이 더 큰 값을 가지는지를 판단하는 과정 등 다양한 과정으로 구현될 수 있다. 또한, 상기 S500 단계는 반드시 비용을 기준으로 판단하는 과정으로만 구현되는 것은 아니며, 최종경로를 생성함에 있어 사용자가 정한 가중치 항목, 또는 그 밖에 최종경로를 고려함에 있어 필요한 항목들이 기준이 될 수 있다는 점도 이해하기로 한다. After step S450, it is determined whether the moving route satisfies the minimum cost condition (S500), and according to the determination result, a final route is generated (S600), or another route is generated by returning to step S300 or step S400 again. can The step of determining whether the movement path satisfies the minimum cost condition and generating the final path accordingly can be understood as a detailed embodiment of step S50 in FIG. 2 . In this case, the term cost can be understood as including all costs required when the ship is operated along the corresponding movement route in a specific movement route calculated according to the execution of the algorithm, which includes, for example, ship operation and / Alternatively, various types of costs may be included, such as the cost of using manpower required for navigation, the cost of oil, the cost of docking in a port, or the cost of maintaining a ship. In addition, the step of determining whether a certain movement path satisfies the minimum cost condition in step S500 is, for example, a process of determining whether the cost is the lowest compared to other previously calculated movement routes, or a minimum cost arbitrarily set by the user. It can be implemented in various processes, such as a process of determining whether the cost of the travel path calculated in contrast has a larger value. In addition, the step S500 is not necessarily implemented only as a process of determining on the basis of cost, and in generating the final route, the weighted items determined by the user or other items necessary for considering the final route can be the criteria. to understand the point.

도 6은 본 발명에 따른 연산장치(200)를 도시한 것으로, 본 발명에 따른 선박 도착예정시간 연산 방법의 실행 주체가 되는 연산장치(200)는 크게 정보수집부(201), 연산부(202), 결정부(203), 및 제어부(206)를 포함할 수 있으며, 동적환경 데이터베이스(204), 및 선박운행 데이터베이스(205)를 더 포함할 수 있다.6 shows an arithmetic device 200 according to the present invention, and the arithmetic device 200, which is the subject of execution of the method for calculating the estimated time of arrival of a ship according to the present invention, is largely divided into an information collection unit 201 and a calculation unit 202. , a decision unit 203, and a control unit 206, and may further include a dynamic environment database 204 and a ship operation database 205.

정보수집부(201)는 본 발명에 따른 연산에 필요한 정보들을 수집 및 저장하는 구성으로, 선박에 대한 위치정보, 또는 동적환경변수와 관련된 정보들을 제공하는 주체에 접근하여 정보를 수집하는 구성이다. 또한 수집된 정보들은 데이터베이스 내에 저장하여 추후 연산 과정에서 활용이 가능하게 한다.The information collection unit 201 is a component that collects and stores information necessary for calculation according to the present invention, and is a component that collects information by accessing a subject that provides location information about a ship or information related to dynamic environment variables. In addition, the collected information is stored in the database so that it can be used in the future calculation process.

연산부(202)는 적어도 하나 이상의 동적환경변수, 및/또는 선박운행 데이터베이스를 참고하여 상기 선박이 시작점으로부터 도착점에 이르는 경로를 연산하는 구성으로, 위 연산부(202)는 기본경로, 제1 경로, 또는 제2 경로를 생성할 수 있다. 각 경로들의 생성 과정에 대해서는 앞서 설명하였으므로 자세한 설명을 생략하기로 한다.The calculation unit 202 is a component that calculates a route from a starting point to a destination point by referring to at least one dynamic environment variable and/or a ship operation database. The calculation unit 202 calculates a basic route, a first route, or A second route can be created. Since the process of generating each path has been described above, a detailed description thereof will be omitted.

결정부(203)는 상기 연산부(202)에 의해 연산된 복수 개의 경로들 중 어느 하나를 최종경로로 결정하는 구성으로, 이 결정에 있어서는 사용자가 미리 지정해 둔 기준값 등이 존재할 수 있다.The determination unit 203 is a component that determines one of the plurality of paths calculated by the calculation unit 202 as the final path, and in this determination, a reference value or the like previously designated by the user may exist.

동적환경 데이터베이스(204)는 용어 그대로 상기 정보수집부(201)에 의해 수집된 정보들이 저장되는 구성이며, 선박운행 데이터베이스(205)는 강화 학습 함수에 의해 따른 결과값들, 바람직하게는 경로 맵(map)들이 저장될 수 있는 구성이며, 이 때 함수는 앞서 설명한 수학식 1 및 수학식 2에 의해 정의될 수 있다. The dynamic environment database 204, as the term implies, is a configuration in which the information collected by the information collection unit 201 is stored, and the ship operation database 205 is the result values followed by the reinforcement learning function, preferably a route map ( map) is a configuration that can be stored, and at this time, the function can be defined by Equations 1 and 2 described above.

마지막으로 연산장치(200)는 제어부(206)를 포함할 수 있는데, 이 제어부는 실질적으로는 중앙처리유닛에 해당할 수 있다. 중앙처리유닛은 컨트롤러(controller), 마이크로 컨트롤러(microcontroller), 마이크로 프로세서(microprocessor), 마이크로 컴퓨터(microcomputer) 등으로도 불릴 수 있다. 또한 중앙처리유닛은 하드웨어(hardware) 또는 펌웨어(firmware), 소프트웨어, 또는 이들의 결합에 의해 구현될 수 있는데, 하드웨어를 이용하여 구현하는 경우에는 ASIC(application specific integrated circuit) 또는 DSP(digital signal processor), DSPD(digital signal processing device), PLD(programmable logic device), FPGA(field programmable gate array) 등으로, 펌웨어나 소프트웨어를 이용하여 구현하는 경우에는 위와 같은 기능 또는 동작들을 수행하는 모듈, 절차 또는 함수 등을 포함하도록 펌웨어나 소프트웨어가 구성될 수 있다. 또한, 상기 서비스 서버(200)는 메모리도 당연한 구성으로 포함할 수 있는데, 메모리는 ROM(Read Only Memory), RAM(Random Access Memory), EPROM(Erasable Programmable Read Only Memory), EEPROM(Electrically Erasable Programmable Read-Only Memory), 플래쉬(flash) 메모리, SRAM(Static RAM), HDD(Hard Disk Drive), SSD(Solid State Drive) 등으로 구현될 수 있다.Finally, the arithmetic device 200 may include a control unit 206, which may actually correspond to a central processing unit. The central processing unit may also be called a controller, a microcontroller, a microprocessor, a microcomputer, and the like. In addition, the central processing unit may be implemented by hardware, firmware, software, or a combination thereof. When implemented using hardware, an ASIC (application specific integrated circuit) or DSP (digital signal processor) , DSPD (digital signal processing device), PLD (programmable logic device), FPGA (field programmable gate array), etc., when implemented using firmware or software, modules, procedures, or functions that perform the above functions or operations Firmware or software may be configured to include. In addition, the service server 200 may also include a memory as a matter of course. The memory includes read only memory (ROM), random access memory (RAM), erasable programmable read only memory (EPROM), and electrically erasable programmable read (EEPROM) memory. -Only Memory), flash memory, SRAM (Static RAM), HDD (Hard Disk Drive), SSD (Solid State Drive), etc.

이상 본 발명에 따른 선박 도착예정시간 연산 방법 및 연산장치에 대해 살펴보았다. 한편, 본 발명은 상술한 특정의 실시예 및 응용예에 한정되지 아니하며, 청구범위에서 청구하는 본 발명의 요지를 벗어남이 없이 당해 발명이 속하는 기술분야에서 통상의 지식을 가진 자에 의해 다양한 변형실시가 가능한 것은 물론이고, 이러한 변형실시들은 본 발명의 기술적 사상이나 전망으로부터 구별되어 이해되어서는 안 될 것이다.Above, the ship estimated time of arrival calculation method and calculation device according to the present invention have been looked at. On the other hand, the present invention is not limited to the specific embodiments and application examples described above, and various modifications may be carried out by those skilled in the art to which the present invention belongs without departing from the gist of the present invention claimed in the claims. Of course, these modifications should not be understood separately from the technical spirit or perspective of the present invention.

200 연산장치
201 정보수집부
202 연산부
203 결정부
204 동적환경 데이터베이스
205 선박운행 데이터베이스
206 제어부200 arithmetic unit
201 Information collection department
202 calculation unit
203 Determination Department
204 Dynamic environment database
205 Ship Operation Database
206 Control

Claims

In the method of calculating the estimated time of arrival of the ship,
Collecting location information about the vessel at a predetermined period;
generating a first route from a starting point to a destination point by referring to at least one dynamic environment variable;
generating a second route by at least partially modifying the first route by referring to a ship operation database;
determining the second route as a final route when the generated second route satisfies a predetermined condition; and
Calculating an estimated ship arrival time based on the final route;
including,
How to calculate the ship's expected arrival time.

According to claim 1,
The dynamic environment variable,
Characterized in that it includes at least one of weather variables related to weather information, wave height variables related to wave heights, oil price variables related to international oil prices, accident variables related to marine accidents, or traffic volume variables related to marine traffic,
Estimated arrival time calculation method.

According to claim 1,
Before the step of generating the first path,
Generating at least one basic path from the starting point to the destination point; further comprising,
Characterized in that the first path is generated by applying at least one dynamic environment variable to the basic path.
Estimated arrival time calculation method.

According to claim 1,
The ship operation database,
Characterized in that the database in which the results of reinforcement learning of past ship operation information are accumulated and stored,
Estimated arrival time calculation method.

According to claim 4,
The result values accumulated and stored in the ship operation database are,
In an arbitrary path composed of a plurality of nodes, when a ship moves from the s-th node to the s+1-th node (s is a natural number greater than or equal to 1), by iterative operation of a function aimed at maximizing the expected reward Characterized in that the calculated result values are
Estimated arrival time calculation method.

According to claim 5,
The function

defined as,
s is the node state, a is the action, γ is the discount rate, Q(s,a) is the action value function for calculating the expected value of the reward to be received when action a is taken in a specific node state s, and a' is the next node state ( Characterized in that it is an action that maximizes the value of the action value function in s'),
How to calculate the ship's expected arrival time.

According to claim 6,
The action value function is

defined as,
s is node state, a is action, E is expected value, R is reward, γ is discount rate, t is time,
Characterized in that Q _π (s, a) is a function for calculating the expected value of the reward to be received when action a is taken in a specific state s,
Estimated arrival time calculation method.

According to claim 1,
The step of generating the second path,
It is executed by an artificial intelligence algorithm, characterized in that the artificial intelligence algorithm is reinforced based on a ship operation database.
Estimated arrival time calculation method.

According to claim 1,
The step of generating the second path,
Characterized in that a plurality of second paths are generated according to which one of the plurality of input variables is weighted,
Estimated arrival time calculation method.

According to claim 9,
Characterized in that the final route is determined as a second route with the lowest cost required for navigation among the plurality of second routes,
Estimated arrival time calculation method.

In the reinforcement learning method for an algorithm for generating a ship's path,
loading past ship operation information; and
In the past ship operation information, repeatedly executing a function for calculating an expected value of compensation according to a navigation result from a starting point to a destination point of a ship for a plurality of virtual routes;
including,
A method for reinforcement learning of an algorithm for generating a ship's path.

According to claim 11,
The path from the starting point to the ending point includes a plurality of nodes,
The step of repeatedly executing the function is a step of repeatedly executing a function aimed at maximizing an expected reward when the ship moves from the s-th node to the s+1-th node (s is a natural number greater than or equal to 1). doing,
A method for reinforcement learning of an algorithm for generating a ship's path.

According to claim 12,
The function is repeatedly executed for the plurality of virtual paths,
Characterized in that, in each iterative execution step, variables are applied differently to maximize the expected value of the reward.
A method for reinforcement learning of an algorithm for generating a ship's path.

According to claim 13,
As a result of repeatedly executing the function for the plurality of virtual paths,
selecting virtual routes having a calculated expected reward value greater than or equal to a reference value and converting them into a database;
Including more,
A method for reinforcement learning of an algorithm for generating a ship's path.

In the calculation device for calculating the estimated time of arrival of the ship,
an information collection unit that collects location information about the vessel or information related to dynamic environment variables;
a calculation unit for calculating a path from a start point to a destination of the ship by referring to at least one dynamic environment variable or a ship operation database;
a determination unit which determines one of the plurality of paths calculated by the calculation unit as a final path;
a dynamic environment database in which information related to dynamic environment variables is stored;
a control unit controlling the information collection unit, the calculation unit, the determination unit, and the dynamic environment database;
including,
computing device.

According to claim 15,
Further comprising a ship operation database for storing past ship operation information,
computing device.

According to claim 16,
The calculation unit,
Creating a first route from the starting point to the destination by referring to at least one dynamic environment variable, and generating a second route by at least partially modifying the first route by referring to the ship operation database ,
computing device.

According to claim 16,
The ship operation database accumulates and stores result values obtained by reinforcement learning of past ship operation information,
The result values are a function that aims to maximize the reward expected when the ship moves from the s-th node to the s+1-th node (s is a natural number greater than or equal to 1) in an arbitrary path composed of a plurality of nodes. Characterized in that the result values calculated by the iterative operation of
computing device.

According to claim 18,
The function

defined as,
s is the node state, a is the action, γ is the discount rate, Q(s,a) is the action value function for calculating the expected value of the reward to be received when action a is taken in a specific node state s, and a' is the next node state ( Characterized in that it is an action that maximizes the value of the action value function in s'),
computing device.

According to claim 19,
The action value function is

defined as,
s is node state, a is action, E is expected value, R is reward, γ is discount rate, t is time,
Characterized in that Q _π (s, a) is a function for calculating the expected value of the reward to be received when action a is taken in a specific state s,
computing device.