KR102619471B1

KR102619471B1 - Method and apparatus for evaluating product information

Info

Publication number: KR102619471B1
Application number: KR1020210031987A
Authority: KR
Inventors: 김준경
Original assignee: 주식회사 미스터카멜
Priority date: 2021-03-11
Filing date: 2021-03-11
Publication date: 2023-12-28
Also published as: KR20220127552A

Abstract

본 개시의 일 실시예에 따라 적어도 하나의 프로세서를 포함하는 컴퓨팅 장치에 의해 수행되는 상품 정보 평가 방법으로서, 비정형 상품 텍스트에 대응되는 정형화된 상품 데이터를 생성하는 단계; 복수의 상품 데이터를 포함하는 상품 데이터 셋에 기초하여 가격 정보를 산출하는 단계; 및 상기 가격 정보에 기초하여 상기 상품 데이터 셋에 포함된 적어도 하나의 상품 데이터에 대응되는 평가 정보를 생성하는 단계를 포함할 수 있다.A product information evaluation method performed by a computing device including at least one processor according to an embodiment of the present disclosure, comprising: generating standardized product data corresponding to unstructured product text; Calculating price information based on a product data set including a plurality of product data; And it may include generating evaluation information corresponding to at least one product data included in the product data set based on the price information.

Description

{METHOD AND APPARATUS FOR EVALUATING PRODUCT INFORMATION}

본 발명은 상품 정보를 평가하는 방법에 관한 것으로서, 보다 구체적으로는 컴퓨팅 장치를 이용해 비정형 데이터인 텍스트로부터 상품 정보에 대한 평가 정보를 생성하는 방법에 관한 것이다.The present invention relates to a method of evaluating product information, and more specifically, to a method of generating evaluation information about product information from text, which is unstructured data, using a computing device.

당업계에는 비정형 데이터를 정형 데이터로 가공하고 가공된 정형 데이터로부터 유의미한 결과를 도출하기 위한 다양한 평가 기법이 존재한다.In the industry, there are various evaluation techniques for processing unstructured data into structured data and deriving meaningful results from the processed structured data.

빅데이터에 기반하여 정형 데이터를 평가하기 위한 평가 기법 중에서도 매매 중개 서비스는 판매자와 구매자의 입장 차이를 고려한 평가가 수행되어야 한다는 점에 특징이 있다.Among evaluation techniques for evaluating structured data based on big data, trading brokerage services are characterized by the fact that evaluations must be performed taking into account differences in the positions of sellers and buyers.

그러나, 종래 기술의 경우 판매를 위해 데이터를 생성한 판매자의 입장에서만 주된 평가가 이뤄지는 평가 기법이 존재하였으며 이에 따라 판매자와 구매자의 관점 차이를 좁히고, 보다 실효적인 평가 방법에 대한 수요가 지속적으로 존재해왔다.However, in the case of the prior art, there was an evaluation technique in which the main evaluation was made only from the perspective of the seller who created the data for sale. Accordingly, there has been a continuous demand for a more effective evaluation method that narrows the gap between the perspectives of the seller and the buyer. .

한국등록특허 KR10-2194528은 담보자산 가치평가 방법, 장치 및 프로그램을 개시하고 있다.Korean registered patent KR10-2194528 discloses a collateral asset valuation method, device, and program.

본 개시는 전술한 배경기술에 대응하여 안출 된 것으로, 상품 정보 평가 방법의 제공을 목적으로 한다.The present disclosure has been devised in response to the above-mentioned background technology, and its purpose is to provide a method for evaluating product information.

전술한 바와 같은 과제를 실현하기 위한 본 개시의 일 실시예에 따라 적어도 하나의 프로세서를 포함하는 컴퓨팅 장치에 의해 수행되는 상품 정보 평가 방법으로서, 비정형 상품 텍스트에 대응되는 정형화된 상품 데이터를 생성하는 단계; 복수의 상품 데이터를 포함하는 상품 데이터 셋에 기초하여 가격 정보를 산출하는 단계; 및 상기 가격 정보에 기초하여 상기 상품 데이터 셋에 포함된 적어도 하나의 상품 데이터에 대응되는 평가 정보를 생성하는 단계를 포함할 수 있다.A product information evaluation method performed by a computing device including at least one processor according to an embodiment of the present disclosure for realizing the above-described problem, comprising the step of generating standardized product data corresponding to unstructured product text. ; Calculating price information based on a product data set including a plurality of product data; And it may include generating evaluation information corresponding to at least one product data included in the product data set based on the price information.

대안적인 실시예에서, 상기 상품 데이터를 생성하는 단계는, 적어도 하나의 평가 인덱스 - 상기 적어도 하나의 평가 인덱스 각각에는 적어도 하나의 비교 대상 토큰이 할당됨 - 를 포함하는 평가 리스트에 기초하여, 상기 비정형 상품 텍스트로부터 정형화된 상품 데이터를 생성하는 단계를 포함할 수 있다.In an alternative embodiment, generating the product data comprises: based on a rating list comprising at least one rating index, each of the at least one rating index being assigned at least one token to be compared; It may include generating standardized product data from product text.

대안적인 실시예에서, 상기 상품 데이터는, 적어도 하나의 비교 결과 토큰을 포함하고, 상기 비교 결과 토큰은, 상기 평가 인덱스에 할당된 비교 대상 토큰을 상기 비정형 상품 텍스트와 비교한 결과에 기초하여 획득될 수 있다.In an alternative embodiment, the product data includes at least one comparison result token, wherein the comparison result token is obtained based on a result of comparing a comparison target token assigned to the evaluation index with the unstructured product text. You can.

대안적인 실시예에서, 상기 방법은, 상기 상품 데이터 셋에 포함된 복수의 상품 데이터 각각의 진위 정보를 산출하는 단계; 및 상기 진위 정보에 기초하여 적어도 하나의 상품 데이터를 상기 상품 데이터 셋에서 제거하는 단계를 더 포함할 수 있다.In an alternative embodiment, the method includes calculating authenticity information for each of a plurality of product data included in the product data set; And it may further include removing at least one product data from the product data set based on the authenticity information.

대안적인 실시예에서, 상기 진위 정보를 산출하는 단계는, 상품 정보 평가를 위한 평가 리스트에 포함된 허위(fake) 인덱스에 기초하여 상기 비정형 상품 텍스트를 검사하는 단계를 포함할 수 있다.In an alternative embodiment, calculating the authenticity information may include inspecting the unstructured product text based on a fake index included in an evaluation list for evaluating product information.

대안적인 실시예에서, 상기 진위 정보를 산출하는 단계는, 상기 가격 정보에 기초하여 수행될 수 있다.In an alternative embodiment, calculating the authenticity information may be performed based on the price information.

대안적인 실시예에서, 상기 가격 정보를 산출하는 단계는, 상품 가격에 따른 상품 수량 분포를 산출하는 단계; 및 상기 상품 수량 분포 및 적어도 하나의 가격 기준값에 기초하여 가격 정보를 산출하는 단계를 포함할 수 있다.In an alternative embodiment, calculating the price information includes calculating a product quantity distribution according to product price; And it may include calculating price information based on the product quantity distribution and at least one price reference value.

대안적인 실시예에서, 상기 적어도 하나의 가격 기준값은, 상기 상품 수량 분포로부터 계산되는 통계값에 기초하여 결정될 수 있다.In an alternative embodiment, the at least one price reference value may be determined based on a statistical value calculated from the product quantity distribution.

대안적인 실시예에서, 상기 적어도 하나의 가격 기준값은, 상품 가격에 따른 실제 판매 수량 분포에 기초하여 결정될 수 있다.In an alternative embodiment, the at least one price reference value may be determined based on a distribution of actual sales quantity according to product price.

대안적인 실시예에서 상기 적어도 하나의 가격 기준값은, 상기 상품 수량 분포로부터 계산되는 통계값에 기초하되, 상품 가격에 따른 실제 판매 수량 분포에 추가적으로 기초하여 결정될 수 있다.In an alternative embodiment, the at least one price reference value may be determined based on a statistical value calculated from the product quantity distribution, but additionally based on the actual sales quantity distribution according to the product price.

대안적인 실시예에서, 상기 방법은, 상기 상품 데이터 셋에 기초하여 상태 정보를 산출하는 단계를 더 포함하고, 상기 평가 정보를 생성하는 단계는, 상기 상태 정보에 추가적으로 기초하여 상기 상품 데이터 셋에 포함된 적어도 하나의 상품 데이터에 대응되는 평가 정보를 생성하는 단계를 포함할 수 있다.In an alternative embodiment, the method further includes calculating status information based on the product data set, and generating the evaluation information further based on the status information included in the product data set. It may include generating evaluation information corresponding to at least one product data.

대안적인 실시예에서, 상기 상태 정보를 산출하는 단계는, 적어도 하나의 상태 인덱스에 기초하여 상기 상품 데이터 셋에 포함된 각각의 상품 데이터에 대한 상태 점수를 연산하는 단계; 및 적어도 하나의 상태 보조 인덱스에 기초하여 상기 연산된 상태 점수를 보정하는 단계를 포함할 수 있다.In an alternative embodiment, calculating the status information may include calculating a status score for each product data included in the product data set based on at least one status index; and correcting the calculated health score based on at least one health assistance index.

대안적인 실시예에서, 상기 상태 보조 인덱스에 포함된 비교 결과 토큰은, 상기 상태 인덱스에 포함된 비교 결과 토큰을 수식하는 문자열일 수 있다.In an alternative embodiment, the comparison result token included in the state auxiliary index may be a string that modifies the comparison result token included in the state index.

대안적인 실시예에서, 상기 상태 점수를 보정하는 단계는, 상기 상태 인덱스에 대응되는 비교 결과 토큰과 상기 상태 보조 인덱스에 대응되는 비교 결과 토큰 사이의 토큰 거리에 기초하여 수행될 수 있다.In an alternative embodiment, correcting the state score may be performed based on a token distance between a comparison result token corresponding to the state index and a comparison result token corresponding to the state auxiliary index.

대안적인 실시예에서, 상기 토큰 거리에 기초하여 상태 점수를 보정하는 단계는, 상기 토큰 거리가 가까울수록 보정으로 인한 상태 점수 변동폭이 증가되는 방식으로 수행될 수 있다.In an alternative embodiment, correcting the status score based on the token distance may be performed in such a way that as the token distance becomes closer, the amount of change in the status score due to correction increases.

대안적인 실시예에서, 상기 방법은, 상기 상품 데이터 셋에 기초하여 판매자 정보를 산출하는 단계를 더 포함하고, 상기 평가 정보를 생성하는 단계는, 상기 판매자 정보에 추가적으로 기초하여 상기 상품 데이터 셋에 포함된 적어도 하나의 상품 데이터에 대응되는 평가 정보를 생성하는 단계를 포함할 수 있다.In an alternative embodiment, the method further includes calculating seller information based on the product data set, and generating the evaluation information is further based on the seller information included in the product data set. It may include generating evaluation information corresponding to at least one product data.

전술한 바와 같은 과제를 실현하기 위한 본 개시의 일 실시예에 따라 컴퓨터 판독가능 저장 매체에 저장된 컴퓨터 프로그램이 개시된다. 상기 컴퓨터 프로그램은 하나 이상의 프로세서에서 실행되는 경우, 상품 정보를 평가하기 위한 이하의 동작들을 수행하도록 하며, 상기 동작들은: 비정형 상품 텍스트에 대응되는 정형화된 상품 데이터를 생성하는 동작; 복수의 상품 데이터를 포함하는 상품 데이터 셋에 기초하여 가격 정보를 산출하는 동작; 및 상기 가격 정보에 기초하여 상기 상품 데이터 셋에 포함된 적어도 하나의 상품 데이터에 대응되는 평가 정보를 생성하는 동작을 포함할 수 있다.According to an embodiment of the present disclosure for realizing the above-described object, a computer program stored in a computer-readable storage medium is disclosed. When the computer program is executed on one or more processors, it performs the following operations for evaluating product information, which operations include: generating standardized product data corresponding to unstructured product text; An operation of calculating price information based on a product data set including a plurality of product data; and generating evaluation information corresponding to at least one product data included in the product data set based on the price information.

전술한 바와 같은 과제를 실현하기 위한 본 개시의 일 실시예에 따라 상품 정보 평가 장치가 개시된다. 상기 장치는 하나 이상의 프로세서; 메모리; 및 네트워크를 포함하고, 그리고 상기 하나 이상의 프로세서는, 비정형 상품 텍스트에 대응되는 정형화된 상품 데이터를 생성하고, 복수의 상품 데이터를 포함하는 상품 데이터 셋에 기초하여 가격 정보를 산출하고, 그리고 상기 가격 정보에 기초하여 상기 상품 데이터 셋에 포함된 적어도 하나의 상품 데이터에 대응되는 평가 정보를 생성할 수 있다.A product information evaluation device is disclosed according to an embodiment of the present disclosure for realizing the above-described problem. The device may include one or more processors; Memory; and a network, wherein the one or more processors generate standardized product data corresponding to unstructured product text, calculate price information based on a product data set including a plurality of product data, and the price information. Based on this, evaluation information corresponding to at least one product data included in the product data set can be generated.

본 개시는 보다 직관적이고 효율적인 방식의 상품 정보 평가 방법을 제공할 수 있다.The present disclosure can provide a method for evaluating product information in a more intuitive and efficient manner.

도 1은 본 개시의 일 실시예에 따른 상품 정보를 평가하기 위한 컴퓨팅 장치의 블록 구성도이다.
도 2는 본 개시의 일 실시예에 따른 '가격에 따른 상품 수량 분포' 데이터를 시각적으로 도시한 예시도이다.
도 3은 본 개시의 일 실시예에 따른 '가격에 따른 실제 판매 수량' 데이터를 시각적으로 도시한 예시도이다.
도 4는 본 개시의 일 실시예에 따른 '가격에 따른 상품 수량 분포' 및 '가격에 따른 실제 판매 수량' 데이터를 시각적으로 도시한 예시도이다.
도 5는 본 개시의 일 실시예에 따라 상품 정보를 평가하기 위한 과정을 나타내는 순서도이다.
도 6은 본 개시의 다른 일 실시예에 따라 상품 정보를 평가하기 위한 과정을 나타내는 순서도이다.
도 7은 본 개시의 실시예들이 구현될 수 있는 예시적인 컴퓨팅 환경에 대한 간략하고 일반적인 개략도이다.1 is a block diagram of a computing device for evaluating product information according to an embodiment of the present disclosure.
Figure 2 is an example diagram visually illustrating 'product quantity distribution according to price' data according to an embodiment of the present disclosure.
Figure 3 is an example diagram visually illustrating 'actual sales quantity according to price' data according to an embodiment of the present disclosure.
Figure 4 is an example diagram visually illustrating 'product quantity distribution according to price' and 'actual sales quantity according to price' data according to an embodiment of the present disclosure.
Figure 5 is a flowchart showing a process for evaluating product information according to an embodiment of the present disclosure.
Figure 6 is a flowchart showing a process for evaluating product information according to another embodiment of the present disclosure.
7 is a brief, general schematic diagram of an example computing environment in which embodiments of the present disclosure may be implemented.

다양한 실시예들이 이제 도면을 참조하여 설명된다. 본 명세서에서, 다양한 설명들이 본 개시의 이해를 제공하기 위해서 제시된다. 그러나, 이러한 실시예들은 이러한 구체적인 설명 없이도 실행될 수 있음이 명백하다.Various embodiments are now described with reference to the drawings. In this specification, various descriptions are presented to provide an understanding of the disclosure. However, it is clear that these embodiments may be practiced without these specific descriptions.

본 명세서에서 사용되는 용어 "컴포넌트", "모듈", "시스템" 등은 컴퓨터-관련 엔티티, 하드웨어, 펌웨어, 소프트웨어, 소프트웨어 및 하드웨어의 조합, 또는 소프트웨어의 실행을 지칭한다. 예를 들어, 컴포넌트는 프로세서상에서 실행되는 처리과정(procedure), 프로세서, 객체, 실행 스레드, 프로그램, 및/또는 컴퓨터일 수 있지만, 이들로 제한되는 것은 아니다. 예를 들어, 컴퓨팅 장치에서 실행되는 애플리케이션 및 컴퓨팅 장치 모두 컴포넌트일 수 있다. 하나 이상의 컴포넌트는 프로세서 및/또는 실행 스레드 내에 상주할 수 있다. 일 컴포넌트는 하나의 컴퓨터 내에 로컬화 될 수 있다. 일 컴포넌트는 2개 이상의 컴퓨터들 사이에 분배될 수 있다. 또한, 이러한 컴포넌트들은 그 내부에 저장된 다양한 데이터 구조들을 갖는 다양한 컴퓨터 판독가능한 매체로부터 실행할 수 있다. 컴포넌트들은 예를 들어 하나 이상의 데이터 패킷들을 갖는 신호(예를 들면, 로컬 시스템, 분산 시스템에서 다른 컴포넌트와 상호작용하는 하나의 컴포넌트로부터의 데이터 및/또는 신호를 통해 다른 시스템과 인터넷과 같은 네트워크를 통해 전송되는 데이터)에 따라 로컬 및/또는 원격 처리들을 통해 통신할 수 있다.As used herein, the terms “component,” “module,” “system,” and the like refer to a computer-related entity, hardware, firmware, software, a combination of software and hardware, or an implementation of software. For example, a component may be, but is not limited to, a process running on a processor, a processor, an object, a thread of execution, a program, and/or a computer. For example, both an application running on a computing device and the computing device can be a component. One or more components may reside within a processor and/or thread of execution. A component may be localized within one computer. A component may be distributed between two or more computers. Additionally, these components can execute from various computer-readable media having various data structures stored thereon. Components can transmit signals, for example, with one or more data packets (e.g., data and/or signals from one component interacting with other components in a local system, a distributed system, to other systems and over a network such as the Internet). Depending on the data being transmitted, they may communicate through local and/or remote processes.

더불어, 용어 "또는"은 배타적 "또는"이 아니라 내포적 "또는"을 의미하는 것으로 의도된다. 즉, 달리 특정되지 않거나 문맥상 명확하지 않은 경우에, "X는 A 또는 B를 이용한다"는 자연적인 내포적 치환 중 하나를 의미하는 것으로 의도된다. 즉, X가 A를 이용하거나; X가 B를 이용하거나; 또는 X가 A 및 B 모두를 이용하는 경우, "X는 A 또는 B를 이용한다"가 이들 경우들 어느 것으로도 적용될 수 있다. 또한, 본 명세서에 사용된 "및/또는"이라는 용어는 열거된 관련 아이템들 중 하나 이상의 아이템의 가능한 모든 조합을 지칭하고 포함하는 것으로 이해되어야 한다.Additionally, the term “or” is intended to mean an inclusive “or” and not an exclusive “or.” That is, unless otherwise specified or clear from context, “X utilizes A or B” is intended to mean one of the natural implicit substitutions. That is, either X uses A; X uses B; Or, if X uses both A and B, “X uses A or B” can apply to either of these cases. Additionally, the term “and/or” as used herein should be understood to refer to and include all possible combinations of one or more of the related listed items.

또한, "포함한다" 및/또는 "포함하는"이라는 용어는, 해당 특징 및/또는 구성요소가 존재함을 의미하는 것으로 이해되어야 한다. 다만, "포함한다" 및/또는 "포함하는"이라는 용어는, 하나 이상의 다른 특징, 구성요소 및/또는 이들의 그룹의 존재 또는 추가를 배제하지 않는 것으로 이해되어야 한다. 또한, 달리 특정되지 않거나 단수 형태를 지시하는 것으로 문맥상 명확하지 않은 경우에, 본 명세서와 청구범위에서 단수는 일반적으로 "하나 또는 그 이상"을 의미하는 것으로 해석되어야 한다.Additionally, the terms “comprise” and/or “comprising” should be understood to mean that the corresponding feature and/or element is present. However, the terms “comprise” and/or “comprising” should be understood as not excluding the presence or addition of one or more other features, elements and/or groups thereof. Additionally, unless otherwise specified or the context is clear to indicate a singular form, the singular terms herein and in the claims should generally be construed to mean “one or more.”

그리고, “A 또는 B 중 적어도 하나”이라는 용어는, “A만을 포함하는 경우”, “B 만을 포함하는 경우”, “A와 B의 구성으로 조합된 경우”를 의미하는 것으로 해석되어야 한다. And, the term “at least one of A or B” should be interpreted to mean “if it contains only A,” “if it contains only B,” or “if it is a combination of A and B.”

당업자들은 추가적으로 여기서 개시된 실시예들과 관련되어 설명된 다양한 예시적 논리적 블록들, 구성들, 모듈들, 회로들, 수단들, 로직들, 및 알고리즘 단계들이 전자 하드웨어, 컴퓨터 소프트웨어, 또는 양쪽 모두의 조합들로 구현될 수 있음을 인식해야 한다. 하드웨어 및 소프트웨어의 상호교환성을 명백하게 예시하기 위해, 다양한 예시적 컴포넌트들, 블록들, 구성들, 수단들, 로직들, 모듈들, 회로들, 및 단계들은 그들의 기능성 측면에서 일반적으로 위에서 설명되었다. 그러한 기능성이 하드웨어로 또는 소프트웨어로서 구현되는지 여부는 전반적인 시스템에 부과된 특정 어플리케이션(application) 및 설계 제한들에 달려 있다. 숙련된 기술자들은 각각의 특정 어플리케이션들을 위해 다양한 방법들로 설명된 기능성을 구현할 수 있다. 다만, 그러한 구현의 결정들이 본 개시내용의 영역을 벗어나게 하는 것으로 해석되어서는 안된다.Those skilled in the art will additionally recognize that the various illustrative logical blocks, components, modules, circuits, means, logic, and algorithm steps described in connection with the embodiments disclosed herein may be implemented using electronic hardware, computer software, or a combination of both. It must be recognized that it can be implemented with To clearly illustrate the interchangeability of hardware and software, various illustrative components, blocks, configurations, means, logics, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented in hardware or software will depend on the specific application and design constraints imposed on the overall system. A skilled technician can implement the described functionality in a variety of ways for each specific application. However, such implementation decisions should not be construed as causing a departure from the scope of the present disclosure.

제시된 실시예들에 대한 설명은 본 개시의 기술 분야에서 통상의 지식을 가진 자가 본 발명을 이용하거나 또는 실시할 수 있도록 제공된다. 이러한 실시예들에 대한 다양한 변형들은 본 개시의 기술 분야에서 통상의 지식을 가진 자에게 명백할 것이다. 여기에 정의된 일반적인 원리들은 본 개시의 범위를 벗어남이 없이 다른 실시예들에 적용될 수 있다. 그리하여, 본 발명은 여기에 제시된 실시예 들로 한정되는 것이 아니다. 본 발명은 여기에 제시된 원리들 및 신규한 특징들과 일관되는 최광의의 범위에서 해석되어야 할 것이다. The description of the presented embodiments is provided to enable anyone skilled in the art to use or practice the present invention. Various modifications to these embodiments will be apparent to those skilled in the art. The general principles defined herein may be applied to other embodiments without departing from the scope of the disclosure. Therefore, the present invention is not limited to the embodiments presented herein. The present invention is to be interpreted in the broadest scope consistent with the principles and novel features presented herein.

도 1은 본 개시의 일 실시예에 따른 상품 정보를 평가하기 위한 컴퓨팅 장치의 블록 구성도이다.1 is a block diagram of a computing device for evaluating product information according to an embodiment of the present disclosure.

도 1에 도시된 컴퓨팅 장치(100)의 구성은 간략화 하여 나타낸 예시일 뿐이다. 본 개시의 일 실시예에서 컴퓨팅 장치(100)는 컴퓨팅 장치(100)의 컴퓨팅 환경을 수행하기 위한 다른 구성들이 포함될 수 있고, 개시된 구성들 중 일부만이 컴퓨팅 장치(100)를 구성할 수도 있다. The configuration of the computing device 100 shown in FIG. 1 is only a simplified example. In one embodiment of the present disclosure, the computing device 100 may include different configurations for performing the computing environment of the computing device 100, and only some of the disclosed configurations may configure the computing device 100.

컴퓨팅 장치(100)는 프로세서(110), 메모리(130), 네트워크부(150)를 포함할 수 있다.The computing device 100 may include a processor 110, a memory 130, and a network unit 150.

프로세서(110)는 하나 이상의 코어로 구성될 수 있으며, 컴퓨팅 장치의 중앙 처리 장치(CPU: central processing unit), 범용 그래픽 처리 장치 (GPGPU: general purpose graphics processing unit), 텐서 처리 장치(TPU: tensor processing unit) 등의 데이터 분석, 딥러닝을 위한 프로세서를 포함할 수 있다. 프로세서(110)는 비정형 상품 텍스트를 처리하여 비정형 상품 텍스트에 대응되는 정형화된 상품 데이터를 생성할 수 있다. 프로세서(110)는 복수의 상품 데이터를 포함하는 상품 데이터 셋에 기초하여 가격 정보를 산출할 수 있다. 상품 데이터 셋은 적어도 하나의 상품 데이터를 포함할 수 있다. 프로세서(110)는 산출된 가격 정보에 기초하여 상품 데이터 각각에 대응되는 평가 정보를 생성할 수 있다. 프로세서(110)는 복수의 상품 데이터를 포함하는 상품 데이터 셋에 기초하여 상태 정보 또는 판매자 정보 중 적어도 하나를 추가적으로 산출할 수 있다. 프로세서(110)가 상태 정보 또는 판매자 정보 중 적어도 하나를 추가적으로 산출할 경우, 프로세서(110)는 가격 정보에 기초하되, 상태 정보 또는 판매자 정보 중 적어도 하나의 추가적인 정보에 추가적으로 기초하여 상품 데이터에 대응되는 평가 정보를 생성할 수도 있다.The processor 110 may be composed of one or more cores, and may include a central processing unit (CPU), a general purpose graphics processing unit (GPGPU), and a tensor processing unit (TPU) of a computing device. unit) may include a processor for data analysis and deep learning. The processor 110 may process the unstructured product text to generate standardized product data corresponding to the unstructured product text. The processor 110 may calculate price information based on a product data set including a plurality of product data. A product data set may include at least one product data. The processor 110 may generate evaluation information corresponding to each piece of product data based on the calculated price information. The processor 110 may additionally calculate at least one of status information or seller information based on a product data set including a plurality of product data. When the processor 110 additionally calculates at least one of the status information or the seller information, the processor 110 calculates the product data corresponding to the product data based on the price information and additionally based on at least one additional information of the status information or the seller information. Evaluation information can also be generated.

본 개시의 일 실시예에 따르면, 메모리(130)는 프로세서(110)가 생성하거나 결정한 임의의 형태의 정보 및 네트워크부(150)가 수신한 임의의 형태의 정보를 저장할 수 있다. 본 개시의 일 실시예에 따르면, 메모리(130)는 플래시 메모리 타입(flash memory type), 하드디스크 타입(hard disk type), 멀티미디어 카드 마이크로 타입(multimedia card micro type), 카드 타입의 메모리(예를 들어 SD 또는 XD 메모리 등), 램(Random Access Memory, RAM), SRAM(Static Random Access Memory), 롬(Read-Only Memory, ROM), EEPROM(Electrically Erasable Programmable Read-Only Memory), PROM(Programmable Read-Only Memory), 자기 메모리, 자기 디스크, 광디스크 중 적어도 하나의 타입의 저장매체를 포함할 수 있다. 컴퓨팅 장치(100)는 인터넷(internet) 상에서 상기 메모리(130)의 저장 기능을 수행하는 웹 스토리지(web storage)와 관련되어 동작할 수도 있다. 전술한 메모리에 대한 기재는 예시일 뿐, 본 개시는 이에 제한되지 않는다.According to an embodiment of the present disclosure, the memory 130 may store any type of information generated or determined by the processor 110 and any type of information received by the network unit 150. According to an embodiment of the present disclosure, the memory 130 is a flash memory type, hard disk type, multimedia card micro type, or card type memory (e.g. (e.g. SD or -Only Memory), and may include at least one type of storage medium among magnetic memory, magnetic disk, and optical disk. The computing device 100 may operate in connection with web storage that performs a storage function of the memory 130 on the Internet. The description of the memory described above is merely an example, and the present disclosure is not limited thereto.

본 개시의 일 실시예에 따른 네트워크부(150)는 임의의 형태의 데이터 및 신호 등을 송수신할 수 있는 임의의 유무선 통신 네트워크가 본 개시 내용에서 표현되는 네트워크에 포함될 수 있다. 본 개시에서 네트워크부(150)는 유선 및 무선 등과 같은 그 통신 양태를 가리지 않고 구성될 수 있으며, 단거리 통신망(PAN: Personal Area Network), 근거리 통신망(WAN: Wide Area Network) 등 다양한 통신망으로 구성될 수 있다. 또한, 상기 네트워크는 공지의 월드와이드웹(WWW: World Wide Web)일 수 있으며, 적외선(IrDA: Infrared Data Association) 또는 블루투스(Bluetooth)와 같이 단거리 통신에 이용되는 무선 전송 기술을 이용할 수도 있다. 본 명세서에서 설명된 기술들은 위에서 언급된 네트워크들뿐만 아니라, 다른 네트워크들에서도 사용될 수 있다. The network unit 150 according to an embodiment of the present disclosure may include any wired or wireless communication network capable of transmitting and receiving arbitrary types of data and signals, etc., as expressed in the present disclosure. In the present disclosure, the network unit 150 may be configured regardless of the communication mode, such as wired or wireless, and may be composed of various communication networks such as a personal area network (PAN) and a wide area network (WAN). You can. Additionally, the network may be the well-known World Wide Web (WWW), or may use wireless transmission technology used for short-distance communication, such as Infrared Data Association (IrDA) or Bluetooth. The techniques described herein can be used in the networks mentioned above, as well as other networks.

도 1에 도시된 컴퓨팅 장치(100)의 구성은 간략화 하여 나타낸 예시일 뿐이다. 본 개시의 일 실시예에서 컴퓨팅 장치(100)는 컴퓨팅 장치(100)의 컴퓨팅 환경을 수행하기 위한 다른 구성들이 포함될 수 있고, 개시된 구성들 중 일부만이 컴퓨팅 장치(100)를 구성할 수도 있다. 도 1에 도시되지 않았으나 본 개시에 따른 컴퓨팅 장치(100)는 사용자의 입력을 수신하기 위한 입력부를 추가적으로 포함할 수 있다.The configuration of the computing device 100 shown in FIG. 1 is only a simplified example. In one embodiment of the present disclosure, the computing device 100 may include different configurations for performing the computing environment of the computing device 100, and only some of the disclosed configurations may configure the computing device 100. Although not shown in FIG. 1, the computing device 100 according to the present disclosure may additionally include an input unit for receiving a user's input.

본 개시에 따른 컴퓨팅 장치(100)는 비정형 상품 텍스트에 대응되는 정형화된 상품 데이터를 생성할 수 있다. 본 개시에 따른 프로세서(110)는 비정형 상품 텍스트를 입력 받아 비정형 상품 텍스트에 대응되는 정형화된 상품 데이터를 생성할 수 있다.The computing device 100 according to the present disclosure can generate standardized product data corresponding to unstructured product text. The processor 110 according to the present disclosure may receive unstructured product text as input and generate standardized product data corresponding to the unstructured product text.

본 개시내용에 있어서 비정형 상품 텍스트는 상품과 관련된 내용을 포함할 수 있다. 상품과 관련된 내용은 예를 들어, 상품코드, 상품 브랜드, 상품 구매 가격, 상품 판매 가격, 상품과 관련된 설명문 등을 포함할 수 있다. 비정형 상품 텍스트에 포함된 내용은 제목 영역, 본문 영역, 기타 영역으로 구분될 수도 있다. 제목 영역에는 예를 들어, 게시글의 제목, 요약 문장 등이 포함될 수 있다. 본문 영역에는 예를 들어, 상품을 설명하기 위한 문장, 문단 등이 포함될 수 있다. 기타 영역에는 예를 들어, 비정형 상품 텍스트의 작성 시간, 수정 시간, 로그 정보, 메타 정보 등이 포함될 수 있다.In the present disclosure, the unstructured product text may include content related to the product. Content related to the product may include, for example, product code, product brand, product purchase price, product sales price, product-related explanatory text, etc. Content included in unstructured product text may be divided into a title area, body area, and other areas. The title area may include, for example, the title of the post, a summary sentence, etc. The body area may include, for example, sentences and paragraphs to describe the product. Other areas may include, for example, creation time, modification time, log information, meta information, etc. of unstructured product text.

본 개시의 일 실시예에 따른 프로세서(110)는 평가 리스트에 기초하여 비정형 상품 텍스트로부터 정형화된 상품 데이터를 생성할 수 있다. 본 개시내용에 있어서 "정형화된 상품 데이터"라는 용어는 간략히"상품 데이터"라는 용어와 상호 교환적으로 사용될 수 있다. 본 개시의 평가 리스트는 적어도 하나의 평가 인덱스를 포함할 수 있다. 평가 리스트에 둘 이상의 평가 인덱스가 포함될 경우, 하나의 평가 인덱스는 다른 하나의 평가 인덱스에 종속된 평가 인덱스일 수 있다. 이 때, 종속된 평가 인덱스는 '하위 평가 인덱스', 나머지 하나의 평가 인덱스는 '상위 평가 인덱스'와 같이 호칭될 수 있다. 본 개시에 따른 평가 리스트는 복수의 평가 인덱스를 포함할 수 있으며, 복수의 평가 인덱스는, 적어도 하나의 상위 평가 인덱스 및 각 상위 평가 인덱스에 적어도 하나의 하위 평가 인덱스가 포함된 형태인, 트리 형태의 데이터 구조로 이뤄질 수 있다. 본 개시에 따른 평가 리스트에 포함된 적어도 하나의 평가 인덱스에는 적어도 하나의 비교 대상 토큰이 할당될 수 있다. 일례로, 평가 리스트는 A 인덱스, B 인덱스, C 인덱스를 포함할 수 있다. 이 때 A 인덱스는 a-1 비교 대상 토큰, a-2 비교 대상 토큰을 포함할 수 있다. B 인덱스는 b-1 비교 대상 토큰, b-2 비교 대상 토큰을 포함할 수 있다. 구체적 일 예시에서, 평가 리스트는 색상 인덱스, 소재 인덱스, 구매처 인덱스 등이 포함될 수 있다. 이 때 색상 인덱스는 "빨강", "골드", "블랙" 등의 비교 대상 토큰들이 포함될 수 있고 소재 인덱스에는 "벨벳", "나일론", "캐시미어" 등 비교 대상 토큰들이 포함될 수 있으며, 구매처 인덱스에는 "고속터미널 지하상가", "백화점", "아울렛" 등 비교 대상 토큰들이 포함될 수 있다. 전술한 평가 인덱스의 종류 및 비교 대상 토큰의 종류에 관한 예시는 본 개시의 실시를 위한 일 예시일 뿐 본 개시는 다양한 종류의 평가 인덱스 및 비교 대상 토큰들을 포함한다.The processor 110 according to an embodiment of the present disclosure may generate standardized product data from unstructured product text based on the evaluation list. In the present disclosure, the term “standardized product data” may be used interchangeably with the term “product data” for brevity. The evaluation list of the present disclosure may include at least one evaluation index. When the evaluation list includes two or more evaluation indexes, one evaluation index may be an evaluation index dependent on another evaluation index. At this time, the dependent evaluation index may be called a 'lower evaluation index', and the remaining evaluation index may be called a 'higher evaluation index'. The evaluation list according to the present disclosure may include a plurality of evaluation indexes, and the plurality of evaluation indexes are in the form of a tree in which at least one upper evaluation index and each upper evaluation index includes at least one lower evaluation index. It can be made up of data structures. At least one comparison target token may be assigned to at least one evaluation index included in the evaluation list according to the present disclosure. For example, the evaluation list may include the A index, B index, and C index. At this time, the A index may include a-1 comparison target token and a-2 comparison target token. The B index may include b-1 comparison target token and b-2 comparison target token. In a specific example, the evaluation list may include a color index, a material index, a purchase location index, etc. At this time, the color index may include tokens to be compared such as "red", "gold", and "black", the material index may include tokens to be compared such as "velvet", "nylon", and "cashmere", and the purchase place index. may include comparison tokens such as "Express Terminal Underground Shopping Mall", "Department Store", and "Outlet". The above-described examples of types of evaluation indexes and tokens to be compared are merely examples for implementing the present disclosure, and the present disclosure includes various types of evaluation indexes and tokens to be compared.

본 개시의 일 실시예에 따라 프로세서(110)가 생성하는 상품 데이터는 적어도 하나의 비교 결과 토큰을 포함할 수 있다. 상기 비교 결과 토큰은, 평가 리스트에 포함된 적어도 하나의 평가 인덱스에 할당된 비교 대상 토큰을 비정형 상품 텍스트와 비교한 결과에 기초하여 획득될 수 있다. 본 개시내용에 있어서 "비교 대상 토큰"은 비정형 상품 텍스트의 평가를 위해 평가 리스트에 포함된 후보 문자열일 수 있다. 본 개시내용에 있어서 "비교 결과 토큰"은 비정형 상품 텍스트와 비교 대상 토큰을 비교한 결과 비정형 상품 텍스트로부터 획득되는 문자열일 수 있다. 본 개시의 상품 데이터에는 평가 리스트에 포함된 평가 인덱스의 적어도 일부가 동일하게 포함될 수 있다. 상품 데이터에 포함된 평가 인덱스는 평가 리스트에 포함된 평가 인덱스의 부분 집합일 수 있다. 상품 데이터에 포함된 평가 인덱스는 평가 리스트에 포함된 평가 인덱스 중 비정형 상품 텍스트에 존재하는 인덱스들로 구성될 수 있다. 상품 데이터에 포함된 각각의 평가 인덱스는 복수의 비교 결과 토큰을 포함할 수 있다. 상품 데이터에 포함된 비교 결과 토큰들의 집합은 평가 리스트에 포함된 비교 대상 토큰들의 집합의 부분집합일 수 있다. 상품 데이터에 포함된 비교 결과 토큰들은 각각 하나의 단일 평가 인덱스를 가질 수 있다. 즉, 평가 인덱스와 비교 결과 토큰의 데이터 연결관계는 One-to-Many 관계일 수 있다. Product data generated by the processor 110 according to an embodiment of the present disclosure may include at least one comparison result token. The comparison result token may be obtained based on a result of comparing a comparison target token assigned to at least one evaluation index included in the evaluation list with the unstructured product text. In the present disclosure, the “token to be compared” may be a candidate string included in an evaluation list for evaluation of unstructured product text. In the present disclosure, the “comparison result token” may be a string obtained from the unstructured product text as a result of comparing the unstructured product text and the comparison target token. The product data of the present disclosure may equally include at least part of the evaluation index included in the evaluation list. The evaluation index included in the product data may be a subset of the evaluation index included in the evaluation list. The evaluation index included in the product data may be composed of indexes that exist in the unstructured product text among the evaluation indexes included in the evaluation list. Each evaluation index included in product data may include a plurality of comparison result tokens. The set of comparison result tokens included in the product data may be a subset of the set of comparison target tokens included in the evaluation list. Comparison result tokens included in product data may each have a single evaluation index. In other words, the data connection relationship between the evaluation index and the comparison result token may be a One-to-Many relationship.

본 개시에 따른 일 예시에 있어서, 평가 리스트가 '브랜드 인덱스' 및 '상태 인덱스'를 평가 인덱스로서 포함한다고 가정한다. 또한, '브랜드 인덱스'에는 {"샤넬", "구찌", "버버리", "고야드"}로 구성된 비교 대상 토큰들이 포함되고 그리고 '상태 인덱스'에는 {"새상품", "거의 새", "약간 사용감", "생활 기스"}로 구성된 비교 대상 토큰들이 포함된다고 가정한다. 이 때, "새상품 고야드 남성 반지갑 판매합니다."라는 비정형 상품 텍스트가 프로세서(110)에 입력된 경우, 프로세서(110)는 상기 비정형 상품 텍스트와 '브랜드 인덱스'에 포함된 하나 이상의 비교 대상 토큰들을 비교함으로써 상기 비정형 상품 텍스트로부터 "고야드"라는 비교 결과 토큰을 획득할 수 있다. 또한, 프로세서(110)는 상기 비정형 상품 텍스트와 '상태 인덱스'에 포함된 하나 이상의 비교 대상 토큰들을 비교함으로써 상기 비정형 상품 텍스트로부터 "새상품"이라는 비교 결과 토큰을 획득할 수 있다. 그 결과 프로세서(110)는 "고야드"라는 비교 결과 토큰 및 "새상품"이라는 비교 결과 토큰을 포함하는 상기 비정형 상품 텍스트에 대응되는 상품 데이터를 생성할 수 있다. In an example according to the present disclosure, it is assumed that the evaluation list includes 'brand index' and 'status index' as evaluation indices. Additionally, the 'Brand Index' includes comparative tokens consisting of {"Chanel", "Gucci", "Burberry", "Goyard"} and the 'Condition Index' includes {"New", "Nearly New", " It is assumed that comparison target tokens consisting of "slight feeling of use" and "life wear" are included. At this time, when the atypical product text “Selling a new Goyard men’s half wallet” is input to the processor 110, the processor 110 compares the atypical product text with one or more comparison target tokens included in the ‘brand index’. By comparing, a comparison result token called “Goyard” can be obtained from the unstructured product text. Additionally, the processor 110 may obtain a comparison result token called “new product” from the unstructured product text by comparing the unstructured product text with one or more comparison target tokens included in the ‘state index’. As a result, the processor 110 may generate product data corresponding to the unstructured product text, including a comparison result token of “Goyard” and a comparison result token of “New Product.”

본 개시에 따른 다른 일 예시에 있어서, 프로세서(110)는 비정형 상품 텍스트 내에서 가격 인덱스에 포함된 비교 대상 토큰과 매칭되는 비교 결과 토큰을 검출한 후, 검출된 비교 결과 토큰에 후속하는 숫자를 가격 값으로 획득할 수 있다. 예를 들어, 평가 리스트는 '가격 인덱스'를 포함하고, '가격 인덱스'에는 {"판매가", "판매 가격", "희망 가격"}으로 구성된 비교 대상 토큰들이 포함될 수 있다. 이 때, "샤넬백 희망 가격 2,000,000원"라는 비정형 상품 텍스트가 프로세서(110)에 입력된 경우, 프로세서(110)는 가격 값에 "2,000,000"이라는 데이터가 포함된 상품 데이터를 생성할 수 있다. In another example according to the present disclosure, the processor 110 detects a comparison result token that matches the comparison target token included in the price index within the unstructured product text, and then sets the number following the detected comparison result token to the price. It can be obtained by value. For example, the evaluation list includes a 'price index', and the 'price index' may include comparison target tokens consisting of {"sale price", "sale price", and "desired price"}. At this time, when the unstructured product text “Chanel bag desired price 2,000,000 won” is input to the processor 110, the processor 110 may generate product data including data “2,000,000” in the price value.

본 개시의 일 실시예에 있어서 프로세서(110)는 복수의 상품 데이터를 포함하는 상품 데이터 셋에 기초하여 가격 정보를 산출할 수 있다. 프로세서(110)는 동일 상품에 관한 상품 데이터를 포함하는 상품 데이터 셋으로부터 가격 정보를 산출할 수 있다. 본 개시내용에 있어서 가격 정보는 예를 들어, 가격 점수 및 가격에 대한 매수의견 등을 포함할 수 있다. 가격 점수는 서로 다른 가격 구간에 대해 부여될 수 있다. 가격 점수는 최소값과 최대값을 갖는 일정 범위 내의 실수(real number)일 수 있다. 가격에 대한 매수의견은 "비쌈", "적정", "저렴", "주의" 등의 내용 및 유사 내용을 포함할 수 있다.In one embodiment of the present disclosure, the processor 110 may calculate price information based on a product data set including a plurality of product data. The processor 110 may calculate price information from a product data set including product data about the same product. In the present disclosure, price information may include, for example, a price score and a purchase opinion regarding the price. Price scores can be awarded for different price ranges. The price score can be a real number within a certain range with a minimum and maximum value. Buying opinions on price may include words such as “expensive,” “reasonable,” “cheap,” “caution,” and similar words.

프로세서(110)는 가격에 따른 상품 수량 분포 및 적어도 하나의 가격 기준값에 기초하여 가격 정보를 산출할 수 있다. 프로세서(110)는 상품 데이터 셋에 포함된 특정 상품에 대한 복수의 가격 값들에 기초하여 가격에 따른 상품 수량 분포를 산출할 수 있다. 이하 도 2를 참조하여 구체적으로 설명한다.The processor 110 may calculate price information based on product quantity distribution according to price and at least one price reference value. The processor 110 may calculate product quantity distribution according to price based on a plurality of price values for a specific product included in the product data set. Hereinafter, it will be described in detail with reference to FIG. 2.

도 2는 본 개시의 일 실시예에 따른 '가격에 따른 상품 수량' 데이터를 시각적으로 도시한 예시도이다. 도 2는 '가격에 따른 상품 수량' 데이터를 이용한 프로세서(110)의 동작을 구체적으로 설명하기 위해 그 데이터를 시각적으로 표현한 것일 뿐 '가격에 따른 상품 수량' 데이터의 형식 또는 구조를 한정하지 않는다. 도 2의 그래프는 가격에 따른 수량 분포를 2차원 좌표에 도시한 도면이다. 도 2의 X축은 가격, Y축은 상품 수량을 의미한다. 상품 데이터의 모수가 충분한 경우, 도 2의 그래프와 같이 '가격에 따른 상품 수량'의 분포는 대략적인 정규 분포 형태를 가질 수 있다. 상품 데이터의 모수가 충분하지 않을 경우, 가격에 따른 상품 수량 분포는 정규 분포가 아닌 임의의 분포 형태를 가질 수도 있다. Figure 2 is an example diagram visually illustrating 'product quantity according to price' data according to an embodiment of the present disclosure. Figure 2 is a visual representation of the data to specifically explain the operation of the processor 110 using the 'product quantity according to price' data and does not limit the format or structure of the 'product quantity according to price' data. The graph in Figure 2 is a diagram showing the quantity distribution according to price in two-dimensional coordinates. The X-axis in Figure 2 represents the price, and the Y-axis represents the product quantity. If the parameters of the product data are sufficient, the distribution of 'product quantity according to price' may have an approximate normal distribution form, as shown in the graph of FIG. 2. If the parameters of product data are not sufficient, the distribution of product quantity according to price may have a random distribution form rather than a normal distribution.

프로세서(110)는 적어도 하나의 가격 기준값을 이용하여 가격 정보를 산출할 수 있다. 이하에서는 우선 본 개시내용에 따라 가격 기준값을 결정하는 방법을 설명한 이후 가격 기준값을 이용하여 가격 정보를 산출하는 방법을 설명한다. 본 개시에 따른 몇몇 실시예에 있어서 프로세서(110)는 상품 수량 분포로부터 계산되는 통계값에 기초하여 적어도 하나의 가격 기준값을 결정할 수 있다. The processor 110 may calculate price information using at least one price reference value. Below, a method of determining a price reference value according to the present disclosure will first be described, and then a method of calculating price information using the price reference value will be described. In some embodiments according to the present disclosure, the processor 110 may determine at least one price reference value based on a statistical value calculated from the product quantity distribution.

본 개시에 따른 가격 기준값을 결정하는 제 1 실시예에 있어서, 프로세서(110)는 상품 수량 분포로부터 전체 수량의 N(N은 양의 실수)퍼센트 및 (100-N)퍼센트를 구분하는 가격값을 가격 기준값으로 결정할 수 있다. 상기 가격 기준값은 복수 개일 수 있다. 다시 도 2를 참조하면, 도 2의 는 복수의 가격 기준값을 나타낸다. 프로세서(110)가 상품 수량 분포로부터 계산되는 통계값에 기초하여 가격 기준값을 결정한 경우, 가격 기준값 은 하위 20%(즉, 상위 80%)와 나머지를 구분하는 가격값일 수 있다. 가격 기준값 는 하위 40%(즉, 상위 60%)와 나머지를 구분하는 가격값일 수 있다. 또한, 가격 기준값 는 하위 60%(즉, 상위 40%)와 나머지를 구분하는 가격값일 수 있다. 계속된 동일 예시에서 가격 기준값 는 하위 80%(즉, 상위 20%)와 나머지를 구분하는 가격값일 수 있다. In a first embodiment of determining a price reference value according to the present disclosure, the processor 110 determines a price value that distinguishes N (N is a positive real number) percent and (100-N) percent of the total quantity from the product quantity distribution. It can be decided based on the price standard. There may be multiple price reference values. Referring again to Figure 2, in Figure 2 represents a plurality of price reference values. When the processor 110 determines the price reference value based on a statistical value calculated from the product quantity distribution, the price reference value may be the price value that distinguishes the bottom 20% (i.e., top 80%) from the rest. price threshold may be the price value that distinguishes the bottom 40% (i.e., top 60%) from the rest. Also, the price reference value may be the price value that distinguishes the bottom 60% (i.e., top 40%) from the rest. Price threshold in the same example continued: may be the price value that distinguishes the bottom 80% (i.e., top 20%) from the rest.

본 개시에 따른 가격 기준값을 결정하는 제 2 실시예에 있어서, 프로세서(110)는 상품 수량 분포로부터 전체 가격값의 중앙값을 결정한 후, 중앙값과의 상대적인 비율에 따라 적어도 하나의 가격 기준값을 결정할 수도 있다. 이 경우 첫번째 가격 기준값은 결정된 중앙값의 16%, 두번째 가격 기준값은 결정된 중앙값의 32%, 세번째 가격 기준값은 결정된 중앙값의 63% 등으로 결정될 수 있다.In the second embodiment of determining the price reference value according to the present disclosure, the processor 110 may determine the median of all price values from the product quantity distribution and then determine at least one price reference value according to the relative ratio with the median. . In this case, the first price reference value may be determined as 16% of the determined median value, the second price reference value may be determined as 32% of the determined median value, the third price reference value may be determined as 63% of the determined median value, etc.

본 개시에 따른 몇몇 실시예에 있어서, 프로세서(110)는 상품 가격에 따른 실제 판매 수량 분포에 기초하여 적어도 하나의 가격 기준값을 결정할 수 있다. 본 개시내용에 따른 "실제 판매 수량"은 실 거래된 상품의 수량을 의미하며 상품 데이터에 포함된 판매 완료 정보에 기초하여 계산될 수 있다. 프로세서(110)는 비정형 상품 텍스트에 포함된 문자열 중 판매 완료를 의미하는 하나 이상의 문자열을 식별하고 이에 기초하여 상품 데이터의 판매 완료 정보를 생성할 수 있다. 판매 완료를 의미하는 하나 이상의 문자열에는 예를 들어, "판매 완료", "판완", "거래종료", "거래완료" 등이 포함될 수 있다. 프로세서(110)는 비정형 상품 텍스트에 포함된 판매 완료 시각에 기초하여 상품 데이터의 판매 완료 정보를 생성할 수도 있다. 비정형 상품 텍스트에 포함된 판매 완료 시각은 비정형 상품 텍스트를 작성한 작성자에 의해 기입된 시간 정보일 수 있다. 프로세서(110)는 복수의 상품 데이터가 서로 중복되고, 중복되는 상품 데이터의 일부에 대해서만 판매 완료 정보가 존재하는 경우, 상기 판매 완료 정보가 존재하는 일부 상품 데이터에 대해서는 실제 판매 수량을 계산하지 않을 수 있다. 이 경우 해당 상품 데이터는, 실제로 판매 완료된 데이터가 아니라 작성자가 비정형 상품 텍스트를 중복하여 작성한 결과 생성된 데이터일 수 있다. 프로세서(110)는 복수의 상품 데이터가 서로 중복되는지 판단하기 위해 제 1 상품 데이터와 제 2 상품 데이터에 포함된 비교 결과 토큰의 중복 비율을 계산할 수 있다. 프로세서(110)는 제 1 상품 데이터와 제 2 상품 데이터에 포함된 비교 결과 토큰의 중복 비율을 사전 결정된 중복 임계 비율과 비교하여 중복 데이터 여부를 결정할 수 있다. In some embodiments according to the present disclosure, the processor 110 may determine at least one price reference value based on actual sales quantity distribution according to product price. “Actual sales quantity” according to the present disclosure refers to the quantity of actually traded products and can be calculated based on sales completion information included in product data. The processor 110 may identify one or more strings indicating completion of sale among the strings included in the unstructured product text and generate sale completion information of the product data based on this. One or more strings indicating the completion of the sale may include, for example, “sale completed”, “sale completed”, “transaction closed”, “transaction completed”, etc. The processor 110 may generate sale completion information of product data based on the sale completion time included in the unstructured product text. The sales completion time included in the unstructured product text may be time information entered by the author of the unstructured product text. If a plurality of product data overlaps with each other and sales completion information exists only for part of the overlapping product data, the processor 110 may not calculate the actual sales quantity for some product data for which the sales completion information exists. there is. In this case, the product data may not be data that has actually been sold, but data created as a result of the author duplicating unstructured product text. The processor 110 may calculate an overlap ratio of comparison result tokens included in the first product data and the second product data to determine whether the plurality of product data overlap each other. The processor 110 may compare the overlap ratio of comparison result tokens included in the first product data and the second product data with a predetermined overlap threshold ratio to determine whether there is duplicate data.

도 3은 본 개시의 일 실시예에 따른 '가격에 따른 실제 판매 수량' 데이터를 시각적으로 도시한 예시도이다. 도 3은 데이터를 이용한 프로세서(110)의 동작을 구체적으로 설명하기 위해 그 데이터를 시각적으로 표현한 것일 뿐, '가격에 따른 실제 판매 수량' 데이터의 형식 또는 구조를 한정하지 않는다. 도 3의 X축은 가격, Y축은 실제 판매 수량을 의미한다. Figure 3 is an example diagram visually illustrating 'actual sales quantity according to price' data according to an embodiment of the present disclosure. Figure 3 is only a visual representation of the data to specifically explain the operation of the processor 110 using the data, and does not limit the format or structure of the 'actual sales quantity according to price' data. The X-axis in Figure 3 represents the price, and the Y-axis represents the actual sales quantity.

본 개시에 따른 가격 기준값을 결정하는 제 3 실시예에 있어서, 프로세서(110)는 가격에 따른 실제 판매 수량이 최대값을 갖는 지점을 기준으로 적어도 하나의 가격 기준값을 결정할 수 있다. 프로세서(110)는 실제 판매 수량이 최대값을 갖는 가격(Max_price)으로부터 사전 결정된 가격 차이(d, d는 양의 실수)만큼 떨어진 두 가격값(e.g. Max_price - d, Max_price + d)을 가격 기준값으로 결정할 수 있다. 도 3의 는 본 개시의 제 3 실시예에 따른 복수의 가격 기준값을 나타낸다. 본 개시내용에 따라 '가격에 따른 실제 판매 수량'을 고려하여 가격 기준값을 설정하고 이에 따라 가격 정보를 결정하는 방법은 실제 판매자와 구매자 간에 발생한 실 거래 정보를 반영하여 가격 정보를 합리적으로 결정하는 장점을 갖는다.In a third embodiment of determining a price reference value according to the present disclosure, the processor 110 may determine at least one price reference value based on a point where the actual sales quantity according to price has a maximum value. The processor 110 uses two price values (eg Max_price - d, Max_price + d) that are separated by a predetermined price difference (d, d is a positive real number) from the price (Max_price) at which the actual sales quantity has the maximum value as the price reference value. You can decide. in Figure 3 represents a plurality of price reference values according to the third embodiment of the present disclosure. According to the present disclosure, the method of setting a price reference value in consideration of 'actual sales quantity according to price' and determining price information accordingly has the advantage of reasonably determining price information by reflecting actual transaction information that occurred between actual sellers and buyers. has

본 개시에 따른 몇몇 실시예에 있어서, 프로세서(110)는 상품 수량 분포로부터 계산되는 통계값에 기초하여 적어도 하나의 가격 기준값을 결정하되, 상품 가격에 따른 실제 판매 수량에 추가적으로 기초하여 적어도 하나의 가격 기준값을 결정할 수 있다. In some embodiments according to the present disclosure, the processor 110 determines at least one price reference value based on a statistical value calculated from the product quantity distribution, but additionally determines at least one price based on the actual sales quantity according to the product price. A standard value can be determined.

본 개시에 따른 가격 기준값을 결정하는 제 4 실시예에 있어서, 프로세서(110)는 가격에 따른 상품 수량과 가격에 따른 실제 판매 수량을 합산하고 합산 수량 분포에 기초하여 적어도 하나의 가격 기준값을 결정할 수 있다. 프로세서(110)는 합산 수량 분포에 기초하되, 전술한 가격 기준값 결정을 위한 제 1 실시예 내지 제 3 실시예 중 적어도 하나의 실시예를 적용하여 가격 기준값을 결정할 수 있다. In the fourth embodiment of determining the price reference value according to the present disclosure, the processor 110 may add the product quantity according to the price and the actual sales quantity according to the price and determine at least one price reference value based on the summed quantity distribution. there is. The processor 110 may determine the price reference value based on the summed quantity distribution by applying at least one embodiment of the first to third embodiments for determining the price reference value described above.

본 개시에 따른 가격 기준값을 결정하는 제 5 실시예에 있어서, 프로세서(110)는 가격에 따른 상품 수량 분포에 기초하여 1차적으로 적어도 하나의 가격 기준값을 결정한 후, 가격에 따른 실제 판매 수량 분포에 기초하여 2차적으로 가격 기준값을 결정할 수 있다. 제 5 실시예에 관하여는 도 4를 참조하여 설명한다. 도 4는 본 개시의 일 실시예에 따른 '가격에 따른 상품 수량'데이터 및 '가격에 따른 실제 판매 수량' 데이터를 시각적으로 도시한 예시도이다. 도 2 및 3과 마찬가지로, 도 4는 복수의 데이터를 이용한 프로세서(110)의 동작을 구체적으로 설명하기 위해 그 데이터를 시각적으로 표현한 것일 뿐, 본 개시에 따른 데이터의 형식 또는 구조를 한정하지 않는다. 도 4의 제 1 그래프(510)는 가격에 따른 상품 수량 분포 함수를 나타낸다. 도 4의 제 2 그래프(530)는 가격에 따른 실제 판매 수량 분포 함수를 나타낸다. 도 4에 있어서, 프로세서(110)는 1차적으로 제 1 그래프(510)에 기초하여 로 표현되는 두 가격 기준값을 결정할 수 있다. 프로세서(110)는 전술한 제 1 실시예 또는 제 2 실시예에 기초하여 제 1 그래프(510)로부터 두 가격 기준값을 결정할 수도 있다. 프로세서(110)는 상기 결정된 두 가격 기준값에 의해 정의되는 구간(i.e. ) 내에서, 제 2 그래프(530)에 기초하여 2차적으로 가격 기준값을 결정할 수 있다. 프로세서(110)는 제 2 그래프(530)에 기초하여 2차적으로 새로운 가격 기준값을 추가하거나 기존의 가격 기준값을 변경할 수 있다. 도 5의 k는 가격에 따른 실제 판매 수량을 나타내는 제 2 그래프(530)와 관련하여 프로세서(110)에 의해 산정된 실제 판매 수량의 임계값일 수 있다. 프로세서(110)는 제 2 그래프(530)에서 상기 결정된 두 가격 기준값에 의해 정의되는 구간(i.e. ) 내의 최대값과 최소값에 기초하여 실제 판매 수량의 임계값(e.g. k)을 결정할 수 있다. 예를 들어, 도 5의 제 2 그래프(530)에서 구간 내 실제 판매 수량 최대값은 100, 실제 판매 수량 최소값은 70일 수 있다. 이 때 프로세서(110)는 최대값과 최소값의 차이에 사전 결정된 비율을 곱한 후, 실제 판매 수량의 최소값에 더함으로써 실제 판매 수량의 임계값을 결정할 수 있다. 전술한 예시는 실시를 위한 일 예시일 뿐 본 개시는 제 2 그래프(530)에 기초하여 실제 판매 수량의 임계값을 결정하는 다양한 방법을 제한없이 포함한다. In the fifth embodiment of determining the price reference value according to the present disclosure, the processor 110 primarily determines at least one price reference value based on the product quantity distribution according to price and then determines the price reference value according to the actual sales quantity distribution according to price. Based on this, the price standard can be determined secondarily. The fifth embodiment will be described with reference to FIG. 4. Figure 4 is an example diagram visually showing 'product quantity according to price' data and 'actual sales quantity according to price' data according to an embodiment of the present disclosure. Like Figures 2 and 3, Figure 4 is only a visual representation of data to specifically explain the operation of the processor 110 using a plurality of data, and does not limit the format or structure of the data according to the present disclosure. The first graph 510 in FIG. 4 shows the product quantity distribution function according to price. The second graph 530 in FIG. 4 shows the actual sales quantity distribution function according to price. In Figure 4, the processor 110 primarily based on the first graph 510 Two price reference values expressed as can be determined. The processor 110 may determine two price reference values from the first graph 510 based on the first or second embodiment described above. The processor 110 is a section defined by the two price reference values determined above (ie ), the price reference value can be determined secondarily based on the second graph 530. The processor 110 may secondarily add a new price reference value or change an existing price reference value based on the second graph 530. k in FIG. 5 may be a threshold value of the actual sales quantity calculated by the processor 110 in relation to the second graph 530 showing the actual sales quantity according to price. The processor 110 selects a section defined by the two determined price reference values in the second graph 530 (ie ) The threshold value (eg k) of the actual sales quantity can be determined based on the maximum and minimum values within ). For example, in the second graph 530 of FIG. 5 The maximum actual sales quantity within a section may be 100, and the minimum actual sales quantity may be 70. At this time, the processor 110 may determine the threshold value of the actual sales quantity by multiplying the difference between the maximum and minimum values by a predetermined ratio and adding it to the minimum value of the actual sales quantity. The above-mentioned example is only an example for implementation, and the present disclosure includes various methods for determining the threshold value of the actual sales quantity based on the second graph 530 without limitation.

프로세서(110)가 제 2 그래프(530)에서 실제 판매 수량의 임계값(e.g. k)을 결정할 경우, 프로세서(110)는 제 1 그래프(510)로부터 1차적으로 결정된 가격 기준값에 기초하여 2차적으로 가격 기준값을 결정할 수 있다. 프로세서(110)는 1차적으로 결정된 두 가격 기준값 (e.g.) 내에서 도 5에 도시된 바와 같이 새로운 가격 기준값()을 추가할 수 있다. 본 개시에 따를 경우, 가격에 따른 상품 수량만을 고려하는 것이 아닌, 가격에 따른 실제 판매 수량을 추가적으로 고려하여 가격 기준값을 결정할 수 있게 된다. 이는 판매자의 희망 판매 가격 측면이 고려한, 현재 판매되고 있는, "상품 수량"과 구매자의 희망 구매 가격 측면을 고려한, 거래가 성사된, "실제 판매 수량"을 모두 고려하여 가격 기준값을 결정할 수 있는 효과를 갖는다. 이하에서는 전술한 몇몇 실시예들을 통해 산출되는 가격 기준값을 이용하여 프로세서(110)가 가격 정보를 산출하는 방법에 대해 서술한다.When the processor 110 determines the actual sales quantity threshold (eg k) in the second graph 530, the processor 110 secondarily determines the price reference value primarily determined from the first graph 510. A price standard can be determined. The processor 110 uses the two primarily determined price reference values (eg ), as shown in Figure 5, the new price reference value ( ) can be added. According to the present disclosure, the price reference value can be determined by additionally considering the actual sales quantity according to the price, rather than only considering the quantity of the product according to the price. This has the effect of determining the price standard by considering both the “product quantity” that is currently being sold, which is considered in terms of the seller's desired sales price, and the “actual sales quantity,” which is the transaction that has been concluded, which is taken into account in terms of the buyer's desired purchase price. have Hereinafter, a method for the processor 110 to calculate price information using the price reference value calculated through some of the above-described embodiments will be described.

본 개시에 따른 프로세서(110)는 가격에 따른 상품 수량 분포 및 적어도 하나의 가격 기준값에 기초하여 가격 정보를 산출할 수 있다. 본 개시에 따른 가격에 따른 상품 수량 분포는 가격 최대값 및 가격 최소값을 포함할 수 있다. 프로세서(110)는 가격 최대값 및 가격 최소값으로 이루어진 '전체 가격 구간'에서 적어도 하나의 가격 기준값을 기준으로 구분되는 서로 상이한 '부분 가격 구간' 각각에 대해, 가격 점수를 할당할 수 있다. 부분 가격 구간을 설정하는 기준이 되는 적어도 하나의 가격 기준값은 전술한 몇몇 실시예에 기초하여 계산될 수 있다. 프로세서(110)는 부분 가격 구간에 포함된 상품 데이터에 대해 해당하는 부분 가격 구간에 할당된 가격 점수를 부여할 수 있다. 일 실시예에 있어서 가격 최소값이 10,000원, 가격 최대값이 1,000,000원이고, 가격 기준값이 100,000원인 경우, 프로세서(110)는 (10,000, 100,000) 구간에 포함된 상품 데이터들과 (100,000, 1,000,000) 구간에 포함되는 상품 데이터들에 각각 서로 다른 가격 점수를 부여할 수 있다. 경계값(e.g. 100,000)을 가격으로 하는 상품 데이터들에 대해 프로세서(110)는 사전 결정된 방식에 따라 가격 점수를 부여할 수 있다. 일례로, 가격 구간의 경계값을 가격으로 하는 상품 데이터에 대한 가격 점수는 아랫부분 가격 구간의 가격 점수 또는 윗부분 가격 구간의 가격 점수일 수 있다. 다른 일 실시예에 있어서 특정 상품에 대한 가격 최소값이 10,000원, 가격 최대값이 1,000,000원이고, 프로세서(110)에 의해 산출된 가격 기준값이 50,000, 200,000인 경우, 프로세서(110)는 가격 기준값에 의해 생성된 구간들(즉, (10,000, 50,000) 구간, (50,000, 200,000) 구간, (200,000, 1,000,000) 구간) 각각에 2점, 3점, 1점의 가격 점수를 부여할 수 있다. 본 개시에 따른 가격 정보 산출 방법은 판매되는 상품 수량 및 실제 판매된 상품 수량 모두에 기초하여 가격 기준값을 산출하고 이에 기초하여 가격 정보를 산출함으로써 보다 효과적인 가격 정보를 산출할 수 있다.The processor 110 according to the present disclosure may calculate price information based on product quantity distribution according to price and at least one price reference value. Product quantity distribution according to price according to the present disclosure may include a maximum price value and a minimum price value. The processor 110 may assign a price score to each of the different 'partial price sections' divided based on at least one price reference value in the 'total price section' consisting of the maximum price value and the minimum price value. At least one price reference value that serves as a standard for setting a partial price range may be calculated based on some of the above-described embodiments. The processor 110 may grant a price score allocated to the corresponding partial price range to product data included in the partial price range. In one embodiment, when the minimum price value is 10,000 won, the maximum price value is 1,000,000 won, and the price reference value is 100,000 won, the processor 110 processes the product data included in the (10,000, 100,000) section and the (100,000, 1,000,000) section. Different price scores can be assigned to each product data included in . For product data whose price is a boundary value (e.g. 100,000), the processor 110 may assign a price score according to a predetermined method. For example, the price score for product data whose price is the boundary value of the price range may be the price score of the lower price range or the price score of the upper price range. In another embodiment, when the minimum price for a specific product is 10,000 won and the maximum price is 1,000,000 won, and the price reference value calculated by the processor 110 is 50,000 and 200,000, the processor 110 determines the price based on the price reference value. A price score of 2 points, 3 points, and 1 point can be assigned to each of the created sections (i.e., (10,000, 50,000) section, (50,000, 200,000) section, and (200,000, 1,000,000) section). The price information calculation method according to the present disclosure can calculate more effective price information by calculating a price reference value based on both the quantity of products sold and the actual quantity of products sold, and calculating price information based on this.

본 개시의 일 실시예에 따른 프로세서(110)는 전체 가격의 평균 가격값보다 작은 가격 범위에 포함되는 복수의 가격 구간들 중, 최대값을 포함하는 구간에 대해 가장 큰 가격 점수를 부여할 수 있다. 예를 들어, 전체 가격의 평균 가격값이 10,000원이고, 적어도 하나의 가격 기준값에 따라 산정된 복수의 가격 구간이 [2,000, 4,000], (4,000, 6,000), (6,000, 8,000], (8,000, 12,000], (12,000, 16,000] 인 경우, 프로세서(110)는 평균 가격값인 10,000원 보다 작은 가격 범위에 있어서, 가장 큰 가격 범위를 포함하는 '(6,000, 8,000] 구간'에 대해 다른 구간들에 비해 큰 가격 점수를 부여할 수 있다.The processor 110 according to an embodiment of the present disclosure may assign the highest price score to the section containing the maximum value among a plurality of price sections included in a price range that is smaller than the average price value of all prices. . For example, the average price of all prices is 10,000 won, and multiple price ranges calculated according to at least one price standard are [2,000, 4,000], (4,000, 6,000), (6,000, 8,000], (8,000, 12,000], (12,000, 16,000], the processor 110 is in a price range smaller than 10,000 won, which is the average price, and provides A higher price score can be given compared to other products.

본 개시에 따른 프로세서(110)는 산출된 가격 정보에 기초하여 상품 데이터 셋에 포함된 적어도 하나의 상품 데이터에 대응되는 평가 정보를 생성할 수 있다. 프로세서(110)는 각각의 상품 데이터에 대한 가격 점수를 정규화하여 각 상품 데이터에 대응되는 평가 정보를 생성할 수 있다. 프로세서(110)는 정규화된 평가 정보 순위에 기초하여 평가 정보를 생성할 수도 있다. 예를 들어, 프로세서(110)는 수학식 1에 나타난 정규화 수식에 기초하여 가격 정보로부터 평가 정보를 생성할 수 있다.The processor 110 according to the present disclosure may generate evaluation information corresponding to at least one product data included in the product data set based on the calculated price information. The processor 110 may normalize the price score for each product data and generate evaluation information corresponding to each product data. The processor 110 may generate evaluation information based on the normalized evaluation information ranking. For example, the processor 110 may generate evaluation information from price information based on the normalization formula shown in Equation 1.

[수학식 1][Equation 1]

z는 정규화 결과 생성된 평가 점수를 나타낸다. 는 전체 상품 데이터의 집합을 나타낸다. 는 각 상품 데이터의 가격 점수를 나타낸다. 는 가격 점수들의 평균값을 나타낸다. 는 가격 점수들의 표준 편차를 나타낸다. z represents the evaluation score generated as a result of normalization. represents the set of entire product data. represents the price score of each product data. represents the average value of the price scores. represents the standard deviation of price scores.

프로세서(110)는 가격 정보에 기초하여 상품 데이터에 대응되는 평가 정보를 생성함으로써, 가격 정보를 안정적으로 제공할 수 있다. 가격 정보에 대한 정규화를 수행하지 않을 경우, 가격 정보는 전체 가격 구간의 끝값과 같은 특이값에 쉽게 영향을 받을 수 있다. 즉, 가격 정보는 아웃라이어(outlier) 데이터에 민감한 정보일 수 있다. 이 때, 본 개시내용과 같이 가격 점수를 정규화하여 평가 정보를 생성하게 되면 아웃라이어(outlier) 데이터에 대한 민감도를 낮출 수 있다. 이는 특이값에 의한 각 점수들의 변화량을 감소시키므로 사용자로 하여금 신뢰할 수 있는 평가 정보를 안정적으로 제공하는 효과를 갖는다.The processor 110 can stably provide price information by generating evaluation information corresponding to product data based on the price information. If normalization is not performed on the price information, the price information can be easily affected by outliers, such as the end values of the entire price range. In other words, price information may be information that is sensitive to outlier data. At this time, if evaluation information is generated by normalizing price scores as in the present disclosure, sensitivity to outlier data can be reduced. This has the effect of stably providing users with reliable evaluation information by reducing the amount of change in each score due to outliers.

본 개시에 따른 프로세서(110)는 상품 데이터 셋에 포함된 복수의 상품 데이터 각각의 진위 정보를 산출할 수 있다. 본 개시내용에 있어서 진위 정보는 상품 데이터에 대한 신뢰도를 구매자에게 제공하기 위한 정보일 수 있다. 진위 정보는 상품 데이터에 대한 신뢰도를 점수화한 신뢰 점수를 포함할 수 있다. 진위 정보는 구매자에게 신뢰도를 설명하기 위한 설명 태그를 포함할 수 있다. 설명 태그에는 예를 들어 '가품', '가품주의', '진품' 등의 내용 또는 이와 유사한 내용의 태그들이 포함될 수 있다. 진위 정보에 포함된 설명 태그는 진위 정보에 포함된 신뢰 점수에 기초하여 산출될 수 있다.The processor 110 according to the present disclosure can calculate authenticity information for each of a plurality of product data included in the product data set. In the present disclosure, authenticity information may be information for providing reliability of product data to buyers. Authenticity information may include a trust score that scores the reliability of product data. Authenticity information may include a description tag to explain reliability to buyers. Description tags may include, for example, content such as 'counterfeit product', 'caution of counterfeit product', 'genuine product', or tags with similar content. The description tag included in the authenticity information may be calculated based on the trust score included in the authenticity information.

프로세서(110)는 비정형 상품 텍스트로부터 상기 비정형 상품 텍스트에 대응되는 상품 데이터에 대한 진위 정보를 산출할 수 있다. 상품 정보 평가를 위한 평가 리스트는 허위(fake) 인덱스를 포함할 수 있다. 프로세서(110)는 평가 리스트에 포함된 허위(fake) 인덱스에 기초하여 비정형 상품 텍스트를 검사할 수 있다. 상기 허위 인덱스는 복수의 하위 인덱스를 포함할 수 있다. 본 개시에 따른 허위 인덱스에 포함된 복수의 하위 인덱스에는 가품 패턴 인덱스, 구매 인덱스 등이 포함될 수 있다. 본 개시내용에 따른 가품 패턴 인덱스는 물건이 가품인 경우에 나타날 수 있는 언어 표현에 관한 하나 이상의 비교 대상 토큰을 포함할 수 있다. 예를 들어, 가품 패턴 인덱스는 “이미테이션”, “정품이랑 거의 같”, “정품이랑 구분”, “느낌”, “스타일” 등의 비교 대상 토큰을 포함할 수 있다. 전술한 예시는 설명을 위한 예시일 뿐, 본 개시는 물건이 가품인 경우에 텍스트에 포함되는 다양한 언어 표현을 포함한다. 본 개시내용에 따른 구매 인덱스는 물건을 판매하기 위한 것이 아니라 구매하기 위한 언어 표현에 관한 하나 이상의 비교 대상 토큰을 포함할 수 있다. 예를 들어, 구매 인덱스는 “삽니다.”, “매입”, “매입 희망” 등의 비교 대상 토큰들을 포함할 수 있다. 구매 인덱스는 판매를 위해 작성된 것이 아닌 구매를 위해 작성된 비정형 상품 텍스트를 검출하기 위한 인덱스로, 상품을 평가하기 위한 대상이 되는 정보가 아닐 수 있다. The processor 110 may calculate authenticity information about product data corresponding to the unstructured product text from the unstructured product text. The evaluation list for evaluating product information may include fake indices. The processor 110 may inspect the unstructured product text based on a fake index included in the evaluation list. The false index may include multiple sub-indexes. A plurality of sub-indexes included in the false index according to the present disclosure may include a fake product pattern index, a purchase index, etc. The counterfeit product pattern index according to the present disclosure may include one or more comparison target tokens related to language expressions that may appear when the product is counterfeit. For example, the counterfeit pattern index may include comparative tokens such as “imitation,” “almost identical to the genuine product,” “distinguished from the genuine product,” “feel,” and “style.” The above-mentioned example is only an example for explanation, and the present disclosure includes various language expressions included in the text when the product is a counterfeit product. A purchase index according to the present disclosure may include one or more comparison tokens that relate to a linguistic expression for purchasing, but not for selling, an item. For example, the purchase index may include comparative tokens such as “buy,” “buy,” and “want to buy.” The purchase index is an index for detecting unstructured product text written for purchase rather than for sale, and may not be information used to evaluate a product.

프로세서(110)는 허위 인덱스에 할당된 하나 이상의 비교 대상 토큰에 기초하여 비정형 상품 텍스트를 검사한 후, 검사 결과에 따라 진위 정보를 산출할 수 있다. 프로세서(110)는 허위 인덱스에 할당된 하나 이상의 비교 대상 토큰이 비정형 상품 텍스트 내에 존재하는 검출 횟수에 기초하여 진위 정보를 산출할 수 있다. 예를 들어, 상기 검출 횟수가 0회 인 경우 해당 상품 데이터의 신뢰 점수는 3점이고 설명 태그는 “진품”일 수 있다. 계속된 실시예에서 상기 검출 횟수가 1 내지 3회인 경우, 해당 상품 데이터의 신뢰 점수는 2점이고 설명 태그는 “가품주의”일 수 있다. 또한, 검출 횟수가 4회 이상인 경우 신뢰 점수는 1점이며 설명 태그는 “가품”일 수 있다. 전술한 예시는 설명을 위한 일 예시에 불과할 뿐 본 개시는 비정형 상품 텍스트에 기초하여 허위 인덱스를 이용한 진위 정보 산출의 다양한 방법을 제한없이 포함한다.The processor 110 may inspect the unstructured product text based on one or more comparison target tokens assigned to the false index and then calculate authenticity information according to the inspection result. The processor 110 may calculate authenticity information based on the number of detections in which one or more comparison target tokens assigned to a false index are present in the unstructured product text. For example, if the number of detections is 0, the trust score of the product data may be 3 and the description tag may be “genuine.” In the continued embodiment, if the number of detections is 1 to 3, the trust score of the corresponding product data may be 2 and the description tag may be “Beware of counterfeit product.” Additionally, if the number of detections is 4 or more, the trust score is 1 and the description tag may be “fake.” The above-mentioned example is only an example for explanation, and the present disclosure includes, without limitation, various methods of calculating authenticity information using a false index based on unstructured product text.

프로세서(110)는 상품 데이터 셋으로부터 산출된 가격 정보에 기초하여 상품 데이터에 대한 진위 정보를 산출할 수 있다. 가격 정보 산출 방법의 경우 상술한 바, 이하에서는 그 자세한 설명은 생략한다. 본 개시에 따른 가격 정보에는 상품 데이터의 가격에 따른 수량, 상품 데이터의 가격 구간별 점수 등이 포함될 수 있다. 프로세서(110)는 가격 정보에 포함된 복수의 가격 구간들 중 적어도 일부에 기초하여 진위 정보를 산출할 수 있다. 예를 들어, 프로세서(110)는 복수의 가격 구간들 중 최하위 가격 구간에 포함된 상품 데이터들을 “가품” 또는 “가품주의” 등으로 결정할 수 있다. 도 2를 다시 참조하면, 프로세서(110)는 도 2의 가격 기준값 보다 작은 가격을 갖는 상품 데이터들을 “가품” 또는 “가품주의”로 결정할 수 있다. 본 개시에 따른 프로세서(110)는 해당 상품의 가격과 상품의 평균가격의 차이를 고려하여 진위 정보를 생성할 수 있다.The processor 110 may calculate authenticity information about product data based on price information calculated from the product data set. The price information calculation method has been described above, and detailed description thereof will be omitted below. Price information according to the present disclosure may include quantity according to price of product data, scores for each price section of product data, etc. The processor 110 may calculate authenticity information based on at least some of the plurality of price ranges included in the price information. For example, the processor 110 may determine product data included in the lowest price range among a plurality of price ranges as “counterfeit products” or “caution of counterfeit products.” Referring again to FIG. 2, the processor 110 determines the price reference value of FIG. 2 Product data with a smaller price can be determined as “counterfeit” or “counterfeit.” The processor 110 according to the present disclosure can generate authenticity information by considering the difference between the price of the product and the average price of the product.

본 개시에 따른 프로세서(110)는 허위 인덱스 및 가격 정보 모두에 기초하여 진위 정보를 산출할 수도 있다. 프로세서(110)는 전술한 바와 같이, 허위 인덱스에 할당된 적어도 하나의 비교 대상 토큰에 기초하여 비정형 상품 텍스트를 검사한 후 각 상품 데이터에 대한 신뢰 점수를 산출할 수 있다. 또한, 프로세서(110)는 가격 정보에 기초하여 가품이 의심되는 구간을 결정할 수 있다. 프로세서(110)는 산출된 신뢰 점수들에 대해 가격 구간에 따른 가중치를 곱하여 진위 정보를 산출할 수 있다. 예를 들어, 허위 인덱스에 기초하여 산출된 복수의 상품 데이터 각각에 대한 신뢰 점수 및 가격이 {(X1: 10000원, 2점), (X2: 30000원, 3점), (X3: 50000원, 1점)}와 같이 나타난다고 가정하자. 계속하여, 가격 구간에 따른 가중치는 20000원 이하에서 0.5, 20000원 초과에서 1이라고 가정하자. 이 때 가중 신뢰 점수는 {(X1: 10000원, 1점), (X2: 30000원, 3점), (X3: 50000원, 1점)}과 같이 나타날 수 있다. 프로세서(110)는 X1 상품 데이터에 대하여 “가품”, X2 상품 데이터에 대하여 “진품”, X3 상품 데이터에 대하여 “가품”이라는 설명 태그를 부여할 수 있다. 프로세서(110)는 산출된 진위 정보에 기초하여 적어도 하나의 상품 데이터를 상품 데이터 셋에서 제거할 수 있다. 상품이 가품일 경우 해당 데이터는 노이즈인 바, 본 개시는 노이즈 데이터를 사용자에게 제공하지 않고 나아가 전체 데이터에 대한 통계를 계산함에 있어서 노이즈를 제거하는 효과를 갖는다.The processor 110 according to the present disclosure may calculate authentic information based on both false index and price information. As described above, the processor 110 may calculate a trust score for each product data after examining the unstructured product text based on at least one comparison target token assigned to the false index. Additionally, the processor 110 may determine a section in which a counterfeit product is suspected based on price information. The processor 110 may calculate authenticity information by multiplying the calculated trust scores by a weight according to the price range. For example, the trust score and price for each of the multiple product data calculated based on the false index are {(X1: 10,000 won, 2 points), (X2: 30,000 won, 3 points), (X3: 50,000 won, 1 point)} Let’s assume it appears like this. Continuing, let us assume that the weight according to the price range is 0.5 for less than 20,000 won and 1 for more than 20,000 won. At this time, the weighted trust score may appear as {(X1: 10,000 won, 1 point), (X2: 30,000 won, 3 points), (X3: 50,000 won, 1 point)}. The processor 110 may assign a description tag of “counterfeit product” to the X1 product data, “genuine product” to the X2 product data, and “counterfeit product” to the X3 product data. The processor 110 may remove at least one product data from the product data set based on the calculated authenticity information. If the product is a counterfeit product, the data is noise, and the present disclosure has the effect of not providing noise data to the user and further removing noise when calculating statistics on the entire data.

본 개시의 일 실시예에 따른 프로세서(110)는 상품 데이터 셋에 기초하여 상태 정보를 산출할 수 있다. 본 개시내용에 있어서 상품에 대한 '상태 정보'는 상품의 '상태를 나타내는 점수'와 상호 교환적으로 사용될 수 있다. 프로세서(110)는 비정형 상품 텍스트에서 상품의 상태(e.g. 보존상태, 구매시기, 구입처, 구성품 유무 등)를 나타내는 문자열을 인식함으로써 상품 데이터를 생성한 후, 생성된 상품 데이터에 기초하여 상태 정보를 산출할 수 있다. 상태 정보는 상품 데이터에 포함된 상태 인덱스들 및 각각의 상태 인덱스에 포함되는 비교 결과 토큰에 기초하여 산출될 수 있다. 상품 데이터에 포함된 상태 인덱스들의 집합은 평가 리스트에 포함된 상태 인덱스들의 집합에 대한 부분 집합일 수 있다.The processor 110 according to an embodiment of the present disclosure may calculate status information based on the product data set. In the present disclosure, 'state information' for a product may be used interchangeably with 'score indicating the state' of the product. The processor 110 generates product data by recognizing a string representing the status of the product (e.g. preservation status, purchase time, place of purchase, presence or absence of components, etc.) of the product in the unstructured product text, and then calculates status information based on the generated product data. can do. State information may be calculated based on state indexes included in product data and comparison result tokens included in each state index. The set of condition indices included in the product data may be a subset of the set of condition indices included in the evaluation list.

본 개시의 일 실시예에 따른 프로세서(110)는 상태 정보를 산출하기 위해, 적어도 하나의 상태 인덱스에 기초하여 상품 데이터 셋에 포함된 각각의 상품 데이터에 대한 상태 점수를 연산한 후, 적어도 하나의 상태 보조 인덱스에 기초하여 연산된 상태 점수를 보정할 수 있다.In order to calculate state information, the processor 110 according to an embodiment of the present disclosure calculates a state score for each product data included in the product data set based on at least one state index, and then calculates the state score for each product data included in the product data set. The calculated state score can be corrected based on the state auxiliary index.

본 개시에 따른 상태 점수는 상태 정보에 포함된 데이터로서 상품 데이터에 포함된 상태 인덱스에 기초하여 계산될 수 있다. 상품 데이터에 포함된 상태 인덱스는 복수의 하위 상태 인덱스를 포함할 수 있다. 각각의 하위 상태 인덱스는 적어도 하나의 비교 결과 토큰을 포함할 수 있다. 상태 인덱스는 보존 상태 인덱스, 연식 인덱스, 구입처 인덱스 등을 하위 상태 인덱스로서 포함할 수 있다. 예를 들어, 제 1 상품 데이터가 “올해 초에 현대백화점에서 구입했습니다. 포장 뜯지도 않은 완전 새상품입니다.”라는 비정형 상품 텍스트에 대해 생성된 데이터일 경우, 프로세서(110)가 생성한 제 1 상품 데이터의 보존 상태 인덱스에는 '새상품'이라는 비교 결과 토큰이 포함될 수 있다. 또한, 제 1 상품 데이터의 연식 인덱스에는 '올해 초'라는 비교 결과 토큰이 포함될 수 있고 제 1 상품 데이터의 구입처 인덱스에는 '백화점'이라는 비교 결과 토큰이 포함될 수 있다. 다른 예를 들어, 제 2 상품 데이터가 “작년에 해외 직구했습니다. 약간의 사용감이 있습니다.”라는 비정형 상품 텍스트에 대해 생성된 데이터일 경우, 프로세서(110)가 생성한 제 2 상품 데이터의 보존 상태 인덱스에는 '사용감'이라는 비교 결과 토큰이 포함될 수 있다. 또한, 제 2 상품 데이터의 연식 인덱스에는 '작년'이라는 비교 결과 토큰이 포함될 수 있고, 제 2 상품 데이터의 구입처 인덱스에는 '해외 직구' 라는 비교 결과 토큰이 포함될 수 있다. The condition score according to the present disclosure is data included in status information and can be calculated based on a status index included in product data. The state index included in product data may include a plurality of sub-state indexes. Each substate index may include at least one comparison result token. The state index may include a preservation state index, an age index, a place of purchase index, etc. as a sub-state index. For example, the first product data is “I purchased it at Hyundai Department Store early this year. In the case of data generated for an unstructured product text such as “This is a completely new product without unpacking,” the preservation status index of the first product data generated by the processor 110 may include a comparison result token called ‘New product’. . Additionally, the year index of the first product data may include a comparison result token of 'early this year', and the purchase place index of the first product data may include a comparison result token of 'department store'. For another example, the second product data is “I bought it directly from overseas last year. In the case of data generated for the unstructured product text “There is a slight feeling of use,” the preservation status index of the second product data generated by the processor 110 may include a comparison result token called “Feeling of use.” Additionally, the year index of the second product data may include a comparison result token of 'last year', and the purchase place index of the second product data may include a comparison result token of 'overseas direct purchase'.

프로세서(110)는 각 인덱스에 할당된 비교 결과 토큰에 기초하여 각 상품 데이터에 대한 상태 점수를 연산할 수 있다. 프로세서(110)는 상태 인덱스에 포함된 비교 결과 토큰의 수에 비례하여 상품 데이터에 대한 상태 점수를 연산할 수 있다. 프로세서(110)는 상태 인덱스에 포함된 비교 결과 토큰의 수가 클수록 높은 점수를 부여할 수 있다. 예를 들어, 제 i 상품 데이터에 포함된 상태 인덱스 관련 비교 결과 토큰의 수가 10이고, 제 j 상품 데이터에 포함된 상태 인덱스 관련 비교 결과 토큰의 수가 5인 경우, 프로세서(110)는 제 i 상품 데이터에 대해 높은 점수를 부여할 수 있다.The processor 110 may calculate a status score for each product data based on the comparison result token assigned to each index. The processor 110 may calculate a status score for product data in proportion to the number of comparison result tokens included in the status index. The processor 110 may assign a higher score as the number of comparison result tokens included in the state index increases. For example, if the number of status index-related comparison result tokens included in the i product data is 10 and the number of status index-related comparison result tokens included in the j product data is 5, the processor 110 processes the i product data A high score can be given for .

프로세서(110)는 적어도 하나의 인덱스에 대한 비교 결과 토큰별로 가중치를 두어 상품 데이터에 대한 상태 점수를 연산할 수 있다. 상기 제 1 상품 데이터와 제 2 상품 데이터에 관한 계속된 실시예에서, 프로세서(110)는 보존 상태 인덱스에 할당된 비교 결과 토큰들에 대해 사전 결정된 점수를 부여할 수 있다. 보존 상태 인덱스의 사전 결정된 점수는 보존 상태가 좋은 것으로 판단되는 문자열에 보다 높은 점수를 주도록 설정될 수 있다. 예를 들어, “새상품”> “새 것과 다름 없”> “사용감”과 같이 점수의 크기에 차등을 두어 하나 이상의 비교 결과 토큰들에 사전 결정된 점수를 부여할 수 있다. 또한, 프로세서(110)는 연식 인덱스에 할당된 비교 결과 토큰이 가까운 과거를 의미할수록 높은 점수를 부여할 수 있다. 예를 들어, “올해 초”> “작년”> “몇 년 전” 과 같이 연식 인덱스에 부여되는 점수에 차등을 줄 수 있다. 또한 프로세서(110)는 구입처 인덱스에 할당된 비교 결과 토큰들에 대해 사전 결정된 점수를 부여할 수 있다. 예를 들어, “백화점”> “아울렛”> “해외 직구” 와 같이 구입처 인덱스에 부여되는 점수에 차등을 줄 수 있다. 위의 예시와 같이 점수를 환산할 경우, 프로세서(110)는 제 1 상품 데이터에 대해 제 2 상품 데이터보다 높은 상태 점수를 부여할 수 있다. 전술한 실시예는 상태 인덱스에 기초하여 상품 데이터에 대한 상태 점수를 연산하는 본 개시의 일 실시예를 구체적으로 서술하기 위한 것에 지나지 않으며, 본 개시는 다양한 실시예를 제한없이 포함한다.The processor 110 may calculate a status score for product data by weighting each token as a result of comparing at least one index. In a continued embodiment relating to the first product data and the second product data, processor 110 may assign predetermined scores to the comparison result tokens assigned to the retention status index. The predetermined score of the preservation status index may be set to give a higher score to a string that is judged to have a good preservation status. For example, a predetermined score can be assigned to one or more comparison result tokens by varying the size of the score, such as “new product” > “just like new” > “feel of use.” Additionally, the processor 110 may assign a higher score as the comparison result token assigned to the age index indicates a closer past. For example, the scores given to the year index can be differentiated, such as “early this year” > “last year” > “a few years ago.” Additionally, the processor 110 may grant predetermined scores to the comparison result tokens assigned to the purchase destination index. For example, the scores given to the purchase place index can be differentiated, such as “Department Store” > “Outlet” > “Overseas Direct Purchase”. When converting scores as in the example above, the processor 110 may grant a higher status score to the first product data than to the second product data. The above-described embodiment is merely intended to specifically describe one embodiment of the present disclosure that calculates the status score for product data based on the status index, and the present disclosure includes various embodiments without limitation.

본 개시에 따른 프로세서(110)는 적어도 하나의 상태 보조 인덱스에 기초하여 전술한 바와 같이 연산된 상태 점수를 보정할 수 있다. 본 개시의 상태 보조 인덱스에 포함된 비교 결과 토큰은 상태 인덱스에 포함된 비교 결과 토큰을 수식하는 문자열일 수 있다. 예를 들어, 상태 보조 인덱스에 포함된 비교 결과 토큰은 부정어 또는 부사어일 수 있다. 부정어는 “없다”,”없는”, “아닌”, “아니지만” 등을 포함할 수 있다. 부사어는 “조금”, “약간”, “많이”, “완전” 등을 포함할 수 있다. 본 개시에 따른 상태 보조 인덱스는 상태 인덱스에 포함된 비교 결과 토큰이 나타내는 언어적 의미의 강도를 증가시키거나 감소시키는 토큰들로 구성될 수 있다. 프로세서(110)는 상태 보조 인덱스에 기초하여, 상태 인덱스에 기초하여 산출된 상태 점수를 보정할 수 있다. The processor 110 according to the present disclosure may correct the state score calculated as described above based on at least one state auxiliary index. The comparison result token included in the state auxiliary index of the present disclosure may be a string that modifies the comparison result token included in the state index. For example, the comparison result token included in the status auxiliary index may be a negative or an adverb. Negative words can include “no”, “without”, “not”, “but”, etc. Adverbs can include “a little,” “a little,” “a lot,” and “completely.” The status auxiliary index according to the present disclosure may be composed of tokens that increase or decrease the strength of the linguistic meaning indicated by the comparison result token included in the status index. The processor 110 may correct the state score calculated based on the state index based on the state assistance index.

전술한 제 1 상품 데이터를 이용하여 상태 점수 보정 방법의 일 실시예를 서술한다. “올해 초에 현대백화점에서 구입했습니다. 포장 뜯지도 않은 완전 새상품입니다.”에 관한 제 1 상품 데이터에 있어서, 상품 데이터에 포함된 상태 인덱스의 비교 결과 토큰은 “새상품”을 포함할 수 있다. 이 때 “새상품”을 수식하는 토큰으로서 상태 보조 인덱스에는 “완전”이라는 비교 결과 토큰이 포함될 수 있다. 일 예시를 위한 “완전”이라는 비교 결과 토큰은 그 수식되는 비교 결과 토큰의 언어적 의미의 강도를 증가시키는 토큰일 수 있다. 프로세서(110)는 수식하는 비교 결과 토큰 및 수식되는 비교 결과 토큰에 기초하여 상태 점수를 보정할 수 있다. “새상품”이라는 비교 결과 토큰에 부여된 점수가 10점일 경우, 프로세서(110)는 “완전”이라는 수식어를 고려하여 점수의 절대값을 증가시킬 수 있다. 또한 프로세서(110)는 “완전”이라는 수식어를 고려하여 절대값이 1 이상인 수를 곱함으로써 기존에 부여된 상태 점수의 절대값을 증가시킬 수도 있다. 프로세서(110)에 의한 보정 결과, “새상품”이라는 비교 결과 토큰에 부여된 점수는 11점으로 조정될 수 있다. An example of a condition score correction method will be described using the above-described first product data. “I bought it at Hyundai Department Store early this year. In the first product data regarding “This is a completely new product without even opening the packaging,” the token as a result of comparing the condition index included in the product data may include “New product.” In this case, as a token that modifies “new product,” the status auxiliary index may include a comparison result token called “complete.” For example, a comparison result token called “complete” may be a token that increases the strength of the linguistic meaning of the comparison result token it modifies. The processor 110 may correct the status score based on the modifying comparison result token and the modifying comparison result token. If the score given to the token as a result of the comparison of “new product” is 10 points, the processor 110 may increase the absolute value of the score by considering the modifier “complete.” Additionally, the processor 110 may consider the modifier “complete” and increase the absolute value of the previously assigned status score by multiplying the number by a number whose absolute value is 1 or more. As a result of correction by the processor 110, the score given to the comparison result token as “new product” can be adjusted to 11 points.

전술한 제 2 상품 데이터를 이용하여 상태 점수 보정 방법의 다른 일 실시예를 서술한다. “작년에 해외 직구했습니다. 약간의 사용감이 있습니다.”에 관한 제 2 상품 데이터에 있어서, 상품 데이터에 포함된 상태 인덱스의 비교 결과 토큰은 “사용감”을 포함할 수 있다. 이 때 “사용감”을 수식하는 토큰으로서 상태 보조 인덱스에는 “약간”라는 비교 결과 토큰이 포함될 수 있다. 일 예시를 위한 “약간”이라는 비교 결과 토큰은 그 수식되는 비교 결과 토큰의 언어적 의미의 강도를 감소시키는 토큰일 수 있다. 프로세서(110)에 의해 “사용감”이라는 비교 결과 토큰에 부여된 점수가 5점인 경우, 프로세서(110)는 “약간”이라는 수식어를 고려하여 점수의 절대값을 감소시킬 수 있다. 또한 프로세서(110)는 “약간”이라는 수식어를 고려하여 절대값이 1 미만인 수를 곱함으로써 기존에 부여된 상태 점수의 절대값을 감소시킬 수도 있다. 프로세서(110)에 의한 보정 결과, “사용감”이라는 비교 결과 토큰에 부여된 점수는 4점으로 조정될 수 있다. 전술한 실시예는 상태 보조 인덱스에 추가적으로 기초하여 상품 데이터에 대해 기 연산된 상태 점수를 보정하는 본 개시의 일 실시예를 구체적으로 개시하기 위한 서술에 지나지 않으며, 본 개시는 다양한 실시예를 제한없이 포함한다.Another embodiment of the condition score correction method will be described using the above-described second product data. “I bought it directly from overseas last year. In the second product data regarding “There is a slight feeling of use.”, the token as a result of comparison of the condition index included in the product data may include “a feeling of use.” In this case, as a token that modifies “feel of use,” the status auxiliary index may include a comparison result token called “a little.” For example, a comparison result token of “a little” may be a token that reduces the strength of the linguistic meaning of the comparison result token it modifies. If the score given to the token by the processor 110 as a result of the comparison of “feel of use” is 5 points, the processor 110 may reduce the absolute value of the score by considering the modifier “slightly.” Additionally, the processor 110 may reduce the absolute value of the previously assigned status score by considering the modifier “slightly” and multiplying the number by a number whose absolute value is less than 1. As a result of correction by the processor 110, the score given to the token as a result of the comparison of “feel of use” may be adjusted to 4 points. The above-described embodiment is nothing more than a description to specifically disclose an embodiment of the present disclosure that corrects the previously calculated status score for product data based additionally on the status auxiliary index, and the present disclosure is intended to describe various embodiments without limitation. Includes.

본 개시의 일 실시예에 따라 상태 점수를 보정하는 단계는, 상태 인덱스에 대응되는 비교 결과 토큰과 상기 상태 보조 인덱스에 대응되는 비교 결과 토큰 사이의 토큰 거리에 기초하여 수행될 수 있다. 본 개시내용에 있어서 토큰 거리는 문장 내에서의 단어 거리를 의미할 수 있다. 예를 들어 “Lorem ipsum dolor sit amet”이라는 문장에서 Lorem과 dolor의 토큰 거리는 2일 수 있다. 또한 sit과 amet 사이의 토큰 거리는 1일 수 있다.According to an embodiment of the present disclosure, the step of correcting the status score may be performed based on the token distance between the comparison result token corresponding to the status index and the comparison result token corresponding to the status auxiliary index. In the present disclosure, token distance may mean word distance within a sentence. For example, in the sentence “Lorem ipsum dolor sit amet” the token distance between Lorem and dolor might be 2. Also, the token distance between sit and amet can be 1.

본 개시의 일 실시예에 있어서, 프로세서(110)는 상태 인덱스에 대응되는 비교 결과 토큰과 상태 보조 인덱스에 대응되는 비교 결과 토큰 사이의 토큰 거리가 사전 결정된 임계 토큰 거리 이하일 경우 상태 점수를 보정할 수 있다. 일 실시예에서 상태 인덱스에 대응되는 비교 결과 토큰이 “스크래치”이고 상태 보조 인덱스에 대응되는 비교 결과 토큰이 “없”이라고 가정하자. 그리고 사전 결정된 임계 토큰 거리는 2일 수 있다. 이 때, “스크래치가 약간 있지만, 딱히 없는 편입니다.”라는 텍스트에 기초하여 생성된 제 3 상품 데이터와 “스크래치가 없습니다.”라는 텍스트에 기초하여 생성된 제 4 상품 데이터가 존재할 수 있다. 제 3 상품 데이터에서 “스크래치”와 “없” 사이의 토큰 거리는 4이다. 제 3 상품 데이터에서 프로세서(110)는 “스크래치”라는 토큰에 기초하여 산출된 상태 점수를 보정하지 않을 수 있다. 반면, 제 4 상품 데이터에서 “스크래치”와 “없” 사이의 토큰 거리는 1이며, 프로세서(110)는 “스크래치”라는 토큰에 기초하여 산출된 상태 점수를 “없”이라는 비교 결과 토큰에 기초하여 보정할 수 있다. 위와 같이 본 개시에 따른 프로세서(110)는 상태 보조 인덱스와 상태 인덱스 사이의 거리를 고려하여, 같은 수식이라도 수식의 강도가 다른 점을 고려하여 합리적으로 텍스트를 분석하고 최종적으로 상태점수를 산출할 수 있다.In one embodiment of the present disclosure, the processor 110 may correct the state score when the token distance between the comparison result token corresponding to the state index and the comparison result token corresponding to the state auxiliary index is less than or equal to a predetermined threshold token distance. there is. In one embodiment, assume that the comparison result token corresponding to the state index is “scratch” and the comparison result token corresponding to the state auxiliary index is “missing.” And the predetermined threshold token distance may be 2. At this time, there may be third product data created based on the text “There are a few scratches, but not really,” and fourth product data created based on the text “There are no scratches.” In the third product data, the token distance between “scratch” and “no” is 4. In the third product data, the processor 110 may not correct the status score calculated based on the token “scratch”. On the other hand, in the fourth product data, the token distance between “scratch” and “nothing” is 1, and the processor 110 corrects the status score calculated based on the token “scratch” based on the comparison result token of “nothing”. can do. As described above, the processor 110 according to the present disclosure can reasonably analyze the text and finally calculate the state score by considering the distance between the state auxiliary index and the state index, and considering the fact that the strength of the formula is different even if the formula is the same. there is.

본 개시의 추가적인 실시예서 프로세서(110)는, 토큰 거리가 가까울수록 보정으로 인한 상태 점수 변동폭이 증가되는 방식으로 상태 점수를 보정할 수 있다. 예를 들어, 같은 토큰 쌍(상태 인덱스에 포함된 토큰 - 상태 보조 인덱스에 포함된 토큰)에 기초한 토큰 거리가 1일 때 상태 점수의 변동폭은 토큰 거리가 3일 때 상태 점수의 변동폭보다 클 수 있다. 전술한 제 3 및 제 4 상품 데이터를 다시 참조하되, “스크래치”에 관하여 부여된 상태 점수가 '-5'라고 가정하자. 이 때 토큰 거리가 가까울수록 상태 점수의 보정으로 인한 변동폭을 증가하기 위해, 프로세서(110)는 “없”이라는 부정어에 의한 보정 방식으로 을 곱하는 방식을 취할 수 있다. 즉, 제 3 상품 데이터에 있어서, “스크래치”에 부여된 상태 점수에는 이 곱해질 수 있다. 그 결과 보정된 상태 점수는 일 수 있다. 마찬가지로, 제 4 상품 데이터에 있어서, “스크래치”에 부여된 상태 점수에는 이 곱해질 수 있다. 그 결과 보정된 상태 점수는 5일 수 있다. 이처럼 본 개시에 따른 프로세서(110)는 상태 보조 인덱스에 대한 비교 결과 토큰에 기초하여 상태 점수를 보정하되, 추가적으로 거리에 반비례하여 상태 점수를 보정함으로써, 같은 수식 관계를 갖는 단어라도 그 거리에 따라 의미의 강도가 달라지는 점을 고려할 수 있다. 예컨대 위 “스크래치가 딱히 없는 편”이라는 표현보다 “스크래치가 없”다는 표현이 보다 명확하게 “스크래치”의 유무를 전달하는 바, 본 개시에 따르면 위와 같은 언어적 의미를 반영한 상태 점수의 차이를 토큰 거리와 변동폭의 비율에 기초하여 효과적으로 산출할 수 있게 된다. 전술한 예시는 본 개시를 구체적으로 서술하기 위한 일 실시예에 지나지 않으며, 본 개시는 다양한 실시예를 제한없이 포함한다.In an additional embodiment of the present disclosure, the processor 110 may correct the state score in such a way that the range of change in the state score due to correction increases as the token distance becomes closer. For example, based on the same token pair (tokens included in the state index - tokens included in the state secondary index), the change in state score when the token distance is 1 may be greater than the change in state score when the token distance is 3. . Refer again to the third and fourth product data described above, but assume that the status score given for “scratch” is '-5'. At this time, in order to increase the variation due to correction of the status score as the token distance becomes closer, the processor 110 uses a correction method using the negative word “no”. You can take the multiplication method. That is, in the third product data, the status score given to “scratch” has This can be multiplied. As a result, the corrected health score is It can be. Similarly, in the fourth product data, the status score assigned to “scratch” has This can be multiplied. The resulting corrected health score may be 5. In this way, the processor 110 according to the present disclosure corrects the status score based on the token as a result of comparison with the status auxiliary index, and additionally corrects the status score in inverse proportion to the distance, so that even words with the same modifier relationship have meaning according to the distance. It can be taken into account that the intensity varies. For example, the expression “there are no scratches” conveys the presence or absence of “scratches” more clearly than the expression “there are no scratches” above, and according to the present disclosure, the difference in status scores reflecting the above linguistic meaning is expressed as a token. It can be effectively calculated based on the ratio of distance and fluctuation width. The above-described example is merely an example for specifically describing the present disclosure, and the present disclosure includes various embodiments without limitation.

본 개시의 일 실시예에 따른 프로세서(110)는 가격 정보에 기초하되, 상태 정보에 추가적으로 기초하여 상품 데이터 셋에 포함된 적어도 하나의 상품 데이터에 대응되는 평가 정보를 생성할 수 있다. 프로세서(110)는 상태 정보에 포함된 상태 점수를 정규화하여 정규화된 상태 점수를 생성할 수 있다. 정규화된 상태 점수는 상태 점수의 평균, 분산 및 표준 편차 등의 통계량에 기초하여 산출될 수 있다. 정규화된 상태 점수는 전술한 수학식 1에 기초하여 각 상태 점수를 정규 분포로 변환하는 동작에 기초하여 산출될 수 있다. 프로세서(110)는 정규화된 상태 점수와 전술한 정규화된 가격 점수를 가중합하여 평가 정보를 생성할 수 있다. 상태 점수와 가격 점수의 가중합을 위한 가중 비율은 개인화를 위해 임의의 비율로 사전 결정될 수 있다. 상태 점수와 가격 점수의 가중합을 위한 가중치는 1:1 일 수도 있다. 일 실시예에 따라 상태 점수와 가격 점수가 각각 6점, 8점이라고 가정하자. 상태 점수와 가격 점수의 가중합을 위한 가중 비율이 3:7인 경우, 평가 점수는 6*0.3 + 8*0.7 = 0.74일 수 있다. 반면, 상태 점수와 가격 점수의 가중합을 위한 가중 비율이 6:4인 경우, 평가 점수는 6*0.6 + 8*0.4 = 0.68일 수 있다. 즉, 프로세서(110)는 동일하게 산출된 가격 점수 및 상태 점수라도 최종적인 가중 비율에 따라 상이한 평가 정보를 산출할 수 있다.The processor 110 according to an embodiment of the present disclosure may generate evaluation information corresponding to at least one product data included in the product data set based on price information and additionally based on status information. The processor 110 may generate a normalized state score by normalizing the state score included in the state information. The normalized status score can be calculated based on statistics such as the mean, variance, and standard deviation of the status score. The normalized state score can be calculated based on the operation of converting each state score into a normal distribution based on Equation 1 described above. The processor 110 may generate evaluation information by weighting the normalized status score and the above-described normalized price score. The weighting ratio for the weighted sum of the status score and the price score can be predetermined at an arbitrary ratio for personalization. The weight for the weighted sum of the status score and price score may be 1:1. According to one embodiment, let's assume that the status score and price score are 6 and 8, respectively. If the weighting ratio for the weighted sum of the status score and the price score is 3:7, the evaluation score may be 6*0.3 + 8*0.7 = 0.74. On the other hand, if the weighting ratio for the weighted sum of the status score and the price score is 6:4, the evaluation score may be 6*0.6 + 8*0.4 = 0.68. That is, the processor 110 may calculate different evaluation information according to the final weighting ratio even if the price score and status score are calculated identically.

본 개시의 일 실시예에 따른 프로세서(110)는 상품 데이터 셋에 기초하여 판매자 정보를 산출하고, 판매자 정보에 추가적으로 기초하여 상품 데이터 셋에 포함된 적어도 하나의 상품 데이터에 대응되는 평가 정보를 생성할 수 있다. The processor 110 according to an embodiment of the present disclosure calculates seller information based on the product data set and generates evaluation information corresponding to at least one product data included in the product data set based on the seller information additionally. You can.

프로세서(110)는 상품 데이터에 포함된 판매자의 활동 기간, 판매자 점수 등에 기초하여 판매자 정보를 산출할 수 있다. 프로세서(110)는 상품 데이터 셋으로부터 특정 판매자의 상품 데이터 최초 업로드 시각을 획득할 수 있다. 프로세서(110)는 상품 데이터 셋으로부터 특정 판매자의 상품 데이터 마지막 업로드 시각을 획득할 수 있다. 프로세서(110)는 특정 판매자의 최초 업로드 시각 및 마지막 업로드 시각의 차이에 기초하여 판매자의 활동 기간을 산출할 수 있다. 프로세서(110)는 판매자의 활동 기간이 길수록 높은 점수를 부여하고 이에 기초하여 판매자 정보를 산출할 수 있다. 프로세서(110)는 비정형 상품 텍스트의 메타 정보에 포함된 판매자별 판매자 점수를 획득하고 이에 기초하여 판매자 정보를 산출할 수 있다.The processor 110 may calculate seller information based on the seller's activity period, seller score, etc. included in the product data. The processor 110 may obtain the first upload time of product data of a specific seller from the product data set. The processor 110 may obtain the last upload time of product data of a specific seller from the product data set. The processor 110 may calculate the seller's activity period based on the difference between the first upload time and the last upload time of a specific seller. The processor 110 may assign a higher score the longer the seller's activity period is and calculate seller information based on this. The processor 110 may obtain a seller score for each seller included in the meta information of the unstructured product text and calculate seller information based on this.

프로세서(110)는 판매자 정보에 추가적으로 기초하여 상품 데이터 셋에 포함된 상품 데이터에 대응되는 평가 정보를 생성할 수 있다. 프로세서(110)는 판매자 정보에 기초하여 평가 정보를 생성하기 위해 판매자 점수를 정규화할 수 있다. 정규화된 판매자 점수는 전술한 가격 점수 또는 상태 점수와 유사한 방법으로 산출될 수 있는 바 이하 설명을 생략한다. 프로세서(110)는 상태 점수, 가격 점수 및 판매자 점수의 상대적인 비율에 기초하여 최종 평가 정보를 산출할 수 있다. 프로세서(110)는 상태 점수, 가격 점수 및 판매자 점수 각각에 가중치를 할당하고, 할당된 가중치에 기초하여 평가 정보를 산출할 수 있다. 일례로, 상태 점수, 가격 점수 및 판매자 점수에 대한 가중치는 각각 1:1:1의 비율로 설정될 수 있다. 본 개시의 평가 정보를 생성하기 위한 적어도 두 점수 사이의 상대적인 비율(즉, 가중치)은 임의의 비율로 사전 결정될 수 있다. 프로세서(110)는 점수 각각에 할당되는 가중치를 임의로 결정하고 할당된 가중치에 기초하여 평가 정보를 산출할 수 있다. 여기서 임의로 가중치를 결정하는 방식은 예를 들어, 컴퓨팅 장치(100)의 입력부를 통한 사용자의 입력에 기초하여 수행될 수 있다. 프로세서(110)는 동일하게 산출된 가격 점수, 상태 점수 및 판매자 점수에 있어서, 가중 비율에 따라 상이한 평가 정보를 산출할 수 있다. 이러한 본 개시는 특정 사용자의 가격 선호, 상태 선호 또는 판매자 선호 등과 같은 성향을 고려하여 최종적으로 평가 정보를 산출할 수 있는 장점을 갖는다.The processor 110 may generate evaluation information corresponding to product data included in the product data set based additionally on the seller information. Processor 110 may normalize seller scores to generate evaluation information based on seller information. Since the normalized seller score can be calculated in a similar way to the above-described price score or status score, the description below is omitted. Processor 110 may calculate final evaluation information based on the relative ratio of condition score, price score, and seller score. The processor 110 may assign weights to each of the status score, price score, and seller score, and calculate evaluation information based on the assigned weights. For example, the weights for the condition score, price score, and seller score may each be set at a ratio of 1:1:1. The relative ratio (i.e., weight) between at least two scores for generating evaluation information of the present disclosure may be predetermined at an arbitrary ratio. The processor 110 may randomly determine the weight assigned to each score and calculate evaluation information based on the assigned weight. Here, the method of arbitrarily determining the weight may be performed, for example, based on a user's input through the input unit of the computing device 100. The processor 110 may calculate different evaluation information according to the weighting ratio in the price score, condition score, and seller score calculated equally. This disclosure has the advantage of being able to finally calculate evaluation information by considering a specific user's tendencies such as price preference, status preference, or seller preference.

도 5는 본 개시의 일 실시예에 따라 상품 정보를 평가하기 위한 과정을 나타내는 순서도이다. 본 개시에 따른 프로세서(110)는 단계 S610에서 비정형 상품 텍스트에 대응되는 정형화된 상품 데이터를 생성할 수 있다. 프로세서(110)는 평가 인덱스를 포함하는 평가 리스트에 기초하여 정형화된 상품 데이터를 생성할 수 있다. 평가 인덱스에는 각각 하나 이상의 비교 대상 토큰이 할당되며, 프로세서(110)에 의한 비교 결과, 비교 대상 토큰들 중 비정형 상품 텍스트 내에 포함된 것으로 결정된 토큰은 '비교 결과 토큰'으로서 상품 데이터에 포함될 수 있다. 단계 S630에서 프로세서(110)는 복수의 상품 데이터를 포함하는 상품 데이터 셋에 기초하여 가격 정보를 산출할 수 있다. 프로세서(110)는 가격에 따른 상품 수량 분포 및 적어도 하나의 가격 기준값에 기초하여 가격 정보를 산출할 수 있다. 가격 기준값은 상품 수량 분포로부터 계산되는 통계값에 기초하여 결정될 수 있다. 가격 기준값은 상품 가격에 따른 실제 판매 수량에 기초하여 결정될 수 있다. 가격 기준값은 상품 수량 분포로부터 계산되는 통계값에 기초하되, 상품 가격에 따른 실제 판매 수량에 추가적으로 기초하여 결정될 수도 있다. 단계 S650에서 프로세서(110)는 산출된 가격 정보에 기초하여 상기 상품 데이터 셋에 포함된 적어도 하나의 상품 데이터에 대응되는 평가 정보를 생성할 수 있다. 프로세서(110)는 가격 정보에 포함된 가격 점수를 정규화하여 전체적인 분포를 고려한 평가 점수를 생성할 수 있다.Figure 5 is a flowchart showing a process for evaluating product information according to an embodiment of the present disclosure. The processor 110 according to the present disclosure may generate standardized product data corresponding to unstructured product text in step S610. The processor 110 may generate standardized product data based on an evaluation list including an evaluation index. One or more comparison target tokens are assigned to each evaluation index, and as a result of comparison by the processor 110, a token determined to be included in the unstructured product text among the comparison target tokens may be included in the product data as a 'comparison result token'. In step S630, the processor 110 may calculate price information based on a product data set including a plurality of product data. The processor 110 may calculate price information based on product quantity distribution according to price and at least one price reference value. The price reference value may be determined based on statistical values calculated from product quantity distribution. The price reference value may be determined based on the actual sales quantity according to the product price. The price reference value is based on a statistical value calculated from the product quantity distribution, but may be determined additionally based on the actual sales quantity according to the product price. In step S650, the processor 110 may generate evaluation information corresponding to at least one product data included in the product data set based on the calculated price information. The processor 110 may normalize the price scores included in the price information to generate an evaluation score that takes the overall distribution into consideration.

도 6은 본 개시의 다른 일 실시예에 따라 상품 정보를 평가하기 위한 과정을 나타내는 순서도이다. 프로세서(110)는 단계 S710에서 비정형 상품 텍스트에 대응되는 정형화된 상품 데이터를 생성할 수 있다. 단계 S730에서 프로세서(110)는 복수의 상품 데이터를 포함하는 상품 데이터 셋에 기초하여 가격 정보를 산출할 수 있다. 상기 S710 및 S730은 도 5의 S610 및 S630과 유사한 단계들로 구성될 수 있다. 단계 S750에서 프로세서(110)는 복수의 상품 데이터를 포함하는 상품 데이터 셋에 기초하여 상태 정보를 산출할 수 있다. 상태 정보는 상품 데이터에서 상태 인덱스에 해당하는 비교 결과 토큰에 기초하여 산출될 수 있다. 프로세서(110)는 상품 데이터 내에서 상태 보조 인덱스에 해당하는 비교 결과 토큰에 기초하여 산출된 상태 점수를 보정할 수 있다. 상태 보조 인덱스에 포함된 비교 결과 토큰은 상태 인덱스에 포함된 비교 결과 토큰을 수식하기 위한 문자열일 수 있다. 프로세서(110)는 상태 인덱스 및 상태 보조 인덱스 각각에 해당하는 비교 결과 토큰 사이의 토큰 거리에 기초하여 상태 점수를 보정할 수 있다. 단계 S770에서 프로세서(110)는 복수의 상품 데이터를 포함하는 상품 데이터 셋에 기초하여 판매자 정보를 산출할 수 있다. 판매자 정보는 판매자의 활동 기간, 판매자 점수 등에 기초하여 산출될 수 있다. 단계 S790에서 프로세서(110)는 가격 정보, 상태 정보 및 판매자 정보에 기초하여 평가 정보를 생성할 수 있다. 프로세서(110)는 산출된 가격 점수, 상태 점수 및 판매자 점수를 각각 정규화하여 합산 가능한 수치 데이터로 변환할 수 있다. 프로세서(110)는 정규화된 가격 점수, 상태 점수 및 판매자 점수를 사전 결정된 비율로 가중합하여 최종 평가 점수를 산출할 수 있다. 사전 결정된 비율은 사용자에 따라 서로 다른 비율로 결정될 수 있다.Figure 6 is a flowchart showing a process for evaluating product information according to another embodiment of the present disclosure. The processor 110 may generate standardized product data corresponding to the unstructured product text in step S710. In step S730, the processor 110 may calculate price information based on a product data set including a plurality of product data. S710 and S730 may consist of steps similar to S610 and S630 of FIG. 5. In step S750, the processor 110 may calculate status information based on a product data set including a plurality of product data. Status information can be calculated based on a comparison result token corresponding to the status index in the product data. The processor 110 may correct the status score calculated based on the comparison result token corresponding to the status auxiliary index in the product data. The comparison result token included in the state auxiliary index may be a string for modifying the comparison result token included in the state index. The processor 110 may correct the state score based on the token distance between the comparison result tokens corresponding to each of the state index and the state auxiliary index. In step S770, the processor 110 may calculate seller information based on a product data set including a plurality of product data. Seller information may be calculated based on the seller's activity period, seller score, etc. In step S790, the processor 110 may generate evaluation information based on price information, status information, and seller information. The processor 110 may normalize the calculated price score, status score, and seller score, respectively, and convert them into numerical data that can be summed. The processor 110 may calculate the final evaluation score by weighting the normalized price score, condition score, and seller score at a predetermined ratio. The predetermined ratio may be determined to be a different ratio depending on the user.

도 7은 본 개시의 실시예들이 구현될 수 있는 예시적인 컴퓨팅 환경에 대한 간략하고 일반적인 개략도이다. 본 개시가 일반적으로 컴퓨팅 장치에 의해 구현될 수 있는 것으로 전술되었지만, 당업자라면 본 개시가 하나 이상의 컴퓨터 상에서 실행될 수 있는 컴퓨터 실행가능 명령어 및/또는 기타 프로그램 모듈들과 결합되어 및/또는 하드웨어와 소프트웨어의 조합으로써 구현될 수 있다는 것을 잘 알 것이다.7 is a brief, general schematic diagram of an example computing environment in which embodiments of the present disclosure may be implemented. Although the present disclosure has generally been described above as being capable of being implemented by a computing device, those skilled in the art will understand that the present disclosure can be implemented in combination with computer-executable instructions and/or other program modules that can be executed on one or more computers and/or in hardware and software. It will be well known that it can be implemented as a combination.

일반적으로, 프로그램 모듈은 특정의 태스크를 수행하거나 특정의 추상 데이터 유형을 구현하는 루틴, 프로그램, 컴포넌트, 데이터 구조, 기타 등등을 포함한다. 또한, 당업자라면 본 개시의 방법이 단일-프로세서 또는 멀티프로세서 컴퓨터 시스템, 미니컴퓨터, 메인프레임 컴퓨터는 물론 퍼스널 컴퓨터, 핸드헬드 컴퓨팅 장치, 마이크로프로세서-기반 또는 프로그램가능 가전 제품, 기타 등등(이들 각각은 하나 이상의 연관된 장치와 연결되어 동작할 수 있음)을 비롯한 다른 컴퓨터 시스템 구성으로 실시될 수 있다는 것을 잘 알 것이다.Typically, program modules include routines, programs, components, data structures, etc. that perform specific tasks or implement specific abstract data types. Additionally, those skilled in the art will understand that the methods of the present disclosure are applicable to single-processor or multiprocessor computer systems, minicomputers, mainframe computers, as well as personal computers, handheld computing devices, microprocessor-based or programmable consumer electronics, and the like (each of which may be It will be appreciated that the system may be implemented with other computer system configurations, including those capable of operating in conjunction with one or more associated devices.

본 개시의 설명된 실시예들은 또한 어떤 태스크들이 통신 네트워크를 통해 연결되어 있는 원격 처리 장치들에 의해 수행되는 분산 컴퓨팅 환경에서 실시될 수 있다. 분산 컴퓨팅 환경에서, 프로그램 모듈은 로컬 및 원격 메모리 저장 장치 둘 다에 위치할 수 있다.The described embodiments of the disclosure can also be practiced in distributed computing environments where certain tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices.

컴퓨터는 통상적으로 다양한 컴퓨터 판독가능 매체를 포함한다. 컴퓨터에 의해 액세스 가능한 매체는 그 어떤 것이든지 컴퓨터 판독가능 매체가 될 수 있고, 이러한 컴퓨터 판독가능 매체는 휘발성 및 비휘발성 매체, 일시적(transitory) 및 비일시적(non-transitory) 매체, 이동식 및 비-이동식 매체를 포함한다. 제한이 아닌 예로서, 컴퓨터 판독가능 매체는 컴퓨터 판독가능 저장 매체 및 컴퓨터 판독가능 전송 매체를 포함할 수 있다. 컴퓨터 판독가능 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보를 저장하는 임의의 방법 또는 기술로 구현되는 휘발성 및 비휘발성 매체, 일시적 및 비-일시적 매체, 이동식 및 비이동식 매체를 포함한다. 컴퓨터 판독가능 저장 매체는 RAM, ROM, EEPROM, 플래시 메모리 또는 기타 메모리 기술, CD-ROM, DVD(digital video disk) 또는 기타 광 디스크 저장 장치, 자기 카세트, 자기 테이프, 자기 디스크 저장 장치 또는 기타 자기 저장 장치, 또는 컴퓨터에 의해 액세스될 수 있고 원하는 정보를 저장하는 데 사용될 수 있는 임의의 기타 매체를 포함하지만, 이에 한정되지 않는다.Computers typically include a variety of computer-readable media. Computer-readable media can be any medium that can be accessed by a computer, and such computer-readable media includes volatile and non-volatile media, transitory and non-transitory media, removable and non-transitory media. Includes removable media. By way of example, and not limitation, computer-readable media may include computer-readable storage media and computer-readable transmission media. Computer-readable storage media refers to volatile and non-volatile media, transient and non-transitory media, removable and non-removable, implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data. Includes media. Computer readable storage media may include RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital video disk (DVD) or other optical disk storage, magnetic cassette, magnetic tape, magnetic disk storage or other magnetic storage. This includes, but is not limited to, a device, or any other medium that can be accessed by a computer and used to store desired information.

컴퓨터 판독가능 전송 매체는 통상적으로 반송파(carrier wave) 또는 기타 전송 메커니즘(transport mechanism)과 같은 피변조 데이터 신호(modulated data signal)에 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터 등을 구현하고 모든 정보 전달 매체를 포함한다. 피변조 데이터 신호라는 용어는 신호 내에 정보를 인코딩하도록 그 신호의 특성들 중 하나 이상을 설정 또는 변경시킨 신호를 의미한다. 제한이 아닌 예로서, 컴퓨터 판독가능 전송 매체는 유선 네트워크 또는 직접 배선 접속(direct-wired connection)과 같은 유선 매체, 그리고 음향, RF, 적외선, 기타 무선 매체와 같은 무선 매체를 포함한다. 상술된 매체들 중 임의의 것의 조합도 역시 컴퓨터 판독가능 전송 매체의 범위 안에 포함되는 것으로 한다.A computer-readable transmission medium typically implements computer-readable instructions, data structures, program modules, or other data on a modulated data signal, such as a carrier wave or other transport mechanism. Includes all information delivery media. The term modulated data signal refers to a signal in which one or more of the characteristics of the signal have been set or changed to encode information within the signal. By way of example, and not limitation, computer-readable transmission media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, RF, infrared, and other wireless media. Combinations of any of the above are also intended to be included within the scope of computer-readable transmission media.

컴퓨터(1102)를 포함하는 본 개시의 여러가지 측면들을 구현하는 예시적인 환경(1100)이 나타내어져 있으며, 컴퓨터(1102)는 처리 장치(1104), 시스템 메모리(1106) 및 시스템 버스(1108)를 포함한다. 시스템 버스(1108)는 시스템 메모리(1106)(이에 한정되지 않음)를 비롯한 시스템 컴포넌트들을 처리 장치(1104)에 연결시킨다. 처리 장치(1104)는 다양한 상용 프로세서들 중 임의의 프로세서일 수 있다. 듀얼 프로세서 및 기타 멀티프로세서 아키텍처도 역시 처리 장치(1104)로서 이용될 수 있다.An example environment 1100 is shown that implements various aspects of the present disclosure, including a computer 1102, which includes a processing unit 1104, a system memory 1106, and a system bus 1108. do. System bus 1108 couples system components, including but not limited to system memory 1106, to processing unit 1104. Processing unit 1104 may be any of a variety of commercially available processors. Dual processors and other multiprocessor architectures may also be used as processing unit 1104.

시스템 버스(1108)는 메모리 버스, 주변장치 버스, 및 다양한 상용 버스 아키텍처 중 임의의 것을 사용하는 로컬 버스에 추가적으로 상호 연결될 수 있는 몇 가지 유형의 버스 구조 중 임의의 것일 수 있다. 시스템 메모리(1106)는 판독 전용 메모리(ROM)(1110) 및 랜덤 액세스 메모리(RAM)(1112)를 포함한다. 기본 입/출력 시스템(BIOS)은 ROM, EPROM, EEPROM 등의 비휘발성 메모리(1110)에 저장되며, 이 BIOS는 시동 중과 같은 때에 컴퓨터(1102) 내의 구성요소들 간에 정보를 전송하는 일을 돕는 기본적인 루틴을 포함한다. RAM(1112)은 또한 데이터를 캐싱하기 위한 정적 RAM 등의 고속 RAM을 포함할 수 있다.System bus 1108 may be any of several types of bus structures that may further be interconnected to a memory bus, peripheral bus, and local bus using any of a variety of commercial bus architectures. System memory 1106 includes read only memory (ROM) 1110 and random access memory (RAM) 1112. The basic input/output system (BIOS) is stored in non-volatile memory 1110, such as ROM, EPROM, and EEPROM, and is a basic input/output system that helps transfer information between components within the computer 1102, such as during startup. Contains routines. RAM 1112 may also include high-speed RAM, such as static RAM, for caching data.

컴퓨터(1102)는 또한 내장형 하드 디스크 드라이브(HDD)(1114)(예를 들어, EIDE, SATA)－이 내장형 하드 디스크 드라이브(1114)는 또한 적당한 섀시(도시 생략) 내에서 외장형 용도로 구성될 수 있음－, 자기 플로피 디스크 드라이브(FDD)(1116)(예를 들어, 이동식 디스켓(1118)으로부터 판독을 하거나 그에 기록을 하기 위한 것임), 및 광 디스크 드라이브(1120)(예를 들어, CD-ROM 디스크(1122)를 판독하거나 DVD 등의 기타 고용량 광 매체로부터 판독을 하거나 그에 기록을 하기 위한 것임)를 포함한다. 하드 디스크 드라이브(1114), 자기 디스크 드라이브(1116) 및 광 디스크 드라이브(1120)는 각각 하드 디스크 드라이브 인터페이스(1124), 자기 디스크 드라이브 인터페이스(1126) 및 광 드라이브 인터페이스(1128)에 의해 시스템 버스(1108)에 연결될 수 있다. 외장형 드라이브 구현을 위한 인터페이스(1124)는 USB(Universal Serial Bus) 및 IEEE 1394 인터페이스 기술 중 적어도 하나 또는 그 둘 다를 포함한다.Computer 1102 may also include an internal hard disk drive (HDD) 1114 (e.g., EIDE, SATA)—the internal hard disk drive 1114 may also be configured for external use within a suitable chassis (not shown). Yes - a magnetic floppy disk drive (FDD) 1116 (e.g., for reading from or writing to a removable diskette 1118), and an optical disk drive 1120 (e.g., a CD-ROM for reading the disk 1122 or reading from or writing to other high-capacity optical media such as DVDs). Hard disk drive 1114, magnetic disk drive 1116, and optical disk drive 1120 are connected to system bus 1108 by hard disk drive interface 1124, magnetic disk drive interface 1126, and optical drive interface 1128, respectively. ) can be connected to. The interface 1124 for implementing an external drive includes at least one or both of Universal Serial Bus (USB) and IEEE 1394 interface technologies.

이들 드라이브 및 그와 연관된 컴퓨터 판독가능 매체는 데이터, 데이터 구조, 컴퓨터 실행가능 명령어, 기타 등등의 비휘발성 저장을 제공한다. 컴퓨터(1102)의 경우, 드라이브 및 매체는 임의의 데이터를 적당한 디지털 형식으로 저장하는 것에 대응한다. 상기에서의 컴퓨터 판독가능 매체에 대한 설명이 HDD, 이동식 자기 디스크, 및 CD 또는 DVD 등의 이동식 광 매체를 언급하고 있지만, 당업자라면 집 드라이브(zip drive), 자기 카세트, 플래쉬 메모리 카드, 카트리지, 기타 등등의 컴퓨터에 의해 판독가능한 다른 유형의 매체도 역시 예시적인 운영 환경에서 사용될 수 있으며 또 임의의 이러한 매체가 본 개시의 방법들을 수행하기 위한 컴퓨터 실행가능 명령어를 포함할 수 있다는 것을 잘 알 것이다.These drives and their associated computer-readable media provide non-volatile storage of data, data structures, computer-executable instructions, and the like. For computer 1102, drive and media correspond to storing any data in a suitable digital format. Although the description of computer-readable media above refers to removable optical media such as HDDs, removable magnetic disks, and CDs or DVDs, those skilled in the art will also recognize removable optical media such as zip drives, magnetic cassettes, flash memory cards, cartridges, etc. It will be appreciated that other types of computer-readable media may also be used in the example operating environment, and that any such media may contain computer-executable instructions for performing the methods of the present disclosure.

운영 체제(1130), 하나 이상의 애플리케이션 프로그램(1132), 기타 프로그램 모듈(1134) 및 프로그램 데이터(1136)를 비롯한 다수의 프로그램 모듈이 드라이브 및 RAM(1112)에 저장될 수 있다. 운영 체제, 애플리케이션, 모듈 및/또는 데이터의 전부 또는 그 일부분이 또한 RAM(1112)에 캐싱될 수 있다. 본 개시가 여러가지 상업적으로 이용가능한 운영 체제 또는 운영 체제들의 조합에서 구현될 수 있다는 것을 잘 알 것이다.A number of program modules may be stored in drives and RAM 1112, including an operating system 1130, one or more application programs 1132, other program modules 1134, and program data 1136. All or portions of the operating system, applications, modules and/or data may also be cached in RAM 1112. It will be appreciated that the present disclosure may be implemented on various commercially available operating systems or combinations of operating systems.

사용자는 하나 이상의 유선/무선 입력 장치, 예를 들어, 키보드(1138) 및 마우스(1140) 등의 포인팅 장치를 통해 컴퓨터(1102)에 명령 및 정보를 입력할 수 있다. 기타 입력 장치(도시 생략)로는 마이크, IR 리모콘, 조이스틱, 게임 패드, 스타일러스 펜, 터치 스크린, 기타 등등이 있을 수 있다. 이들 및 기타 입력 장치가 종종 시스템 버스(1108)에 연결되어 있는 입력 장치 인터페이스(1142)를 통해 처리 장치(1104)에 연결되지만, 병렬 포트, IEEE 1394 직렬 포트, 게임 포트, USB 포트, IR 인터페이스, 기타 등등의 기타 인터페이스에 의해 연결될 수 있다.A user may enter commands and information into computer 1102 through one or more wired/wireless input devices, such as a keyboard 1138 and a pointing device such as mouse 1140. Other input devices (not shown) may include microphones, IR remote controls, joysticks, game pads, stylus pens, touch screens, etc. These and other input devices are connected to the processing unit 1104 through an input device interface 1142, which is often connected to the system bus 1108, but may also include a parallel port, an IEEE 1394 serial port, a game port, a USB port, an IR interface, It can be connected by other interfaces, etc.

모니터(1144) 또는 다른 유형의 디스플레이 장치도 역시 비디오 어댑터(1146) 등의 인터페이스를 통해 시스템 버스(1108)에 연결된다. 모니터(1144)에 부가하여, 컴퓨터는 일반적으로 스피커, 프린터, 기타 등등의 기타 주변 출력 장치(도시 생략)를 포함한다.A monitor 1144 or other type of display device is also connected to system bus 1108 through an interface, such as a video adapter 1146. In addition to monitor 1144, computers typically include other peripheral output devices (not shown) such as speakers, printers, etc.

컴퓨터(1102)는 유선 및/또는 무선 통신을 통한 원격 컴퓨터(들)(1148) 등의 하나 이상의 원격 컴퓨터로의 논리적 연결을 사용하여 네트워크화된 환경에서 동작할 수 있다. 원격 컴퓨터(들)(1148)는 워크스테이션, 컴퓨팅 디바이스 컴퓨터, 라우터, 퍼스널 컴퓨터, 휴대용 컴퓨터, 마이크로프로세서-기반 오락 기기, 피어 장치 또는 기타 통상의 네트워크 노드일 수 있으며, 일반적으로 컴퓨터(1102)에 대해 기술된 구성요소들 중 다수 또는 그 전부를 포함하지만, 간략함을 위해, 메모리 저장 장치(1150)만이 도시되어 있다. 도시되어 있는 논리적 연결은 근거리 통신망(LAN)(1152) 및/또는 더 큰 네트워크, 예를 들어, 원거리 통신망(WAN)(1154)에의 유선/무선 연결을 포함한다. 이러한 LAN 및 WAN 네트워킹 환경은 사무실 및 회사에서 일반적인 것이며, 인트라넷 등의 전사적 컴퓨터 네트워크(enterprise-wide computer network)를 용이하게 해주며, 이들 모두는 전세계 컴퓨터 네트워크, 예를 들어, 인터넷에 연결될 수 있다.Computer 1102 may operate in a networked environment using logical connections to one or more remote computers, such as remote computer(s) 1148, via wired and/or wireless communications. Remote computer(s) 1148 may be a workstation, computing device computer, router, personal computer, portable computer, microprocessor-based entertainment device, peer device, or other conventional network node, and is generally connected to computer 1102. For simplicity, only memory storage device 1150 is shown, although it includes many or all of the components described. The logical connections depicted include wired/wireless connections to a local area network (LAN) 1152 and/or a larger network, such as a wide area network (WAN) 1154. These LAN and WAN networking environments are common in offices and companies and facilitate enterprise-wide computer networks, such as intranets, all of which can be connected to a worldwide computer network, such as the Internet.

LAN 네트워킹 환경에서 사용될 때, 컴퓨터(1102)는 유선 및/또는 무선 통신 네트워크 인터페이스 또는 어댑터(1156)를 통해 로컬 네트워크(1152)에 연결된다. 어댑터(1156)는 LAN(1152)에의 유선 또는 무선 통신을 용이하게 해줄 수 있으며, 이 LAN(1152)은 또한 무선 어댑터(1156)와 통신하기 위해 그에 설치되어 있는 무선 액세스 포인트를 포함하고 있다. WAN 네트워킹 환경에서 사용될 때, 컴퓨터(1102)는 모뎀(1158)을 포함할 수 있거나, WAN(1154) 상의 통신 컴퓨팅 디바이스에 연결되거나, 또는 인터넷을 통하는 등, WAN(1154)을 통해 통신을 설정하는 기타 수단을 갖는다. 내장형 또는 외장형 및 유선 또는 무선 장치일 수 있는 모뎀(1158)은 직렬 포트 인터페이스(1142)를 통해 시스템 버스(1108)에 연결된다. 네트워크화된 환경에서, 컴퓨터(1102)에 대해 설명된 프로그램 모듈들 또는 그의 일부분이 원격 메모리/저장 장치(1150)에 저장될 수 있다. 도시된 네트워크 연결이 예시적인 것이며 컴퓨터들 사이에 통신 링크를 설정하는 기타 수단이 사용될 수 있다는 것을 잘 알 것이다.When used in a LAN networking environment, computer 1102 is connected to local network 1152 through wired and/or wireless communication network interfaces or adapters 1156. Adapter 1156 may facilitate wired or wireless communication to LAN 1152, which also includes a wireless access point installed thereon for communicating with wireless adapter 1156. When used in a WAN networking environment, the computer 1102 may include a modem 1158 or be connected to a communicating computing device on the WAN 1154 or to establish communications over the WAN 1154, such as via the Internet. Have other means. Modem 1158, which may be internal or external and a wired or wireless device, is coupled to system bus 1108 via serial port interface 1142. In a networked environment, program modules described for computer 1102, or portions thereof, may be stored in remote memory/storage device 1150. It will be appreciated that the network connections shown are exemplary and that other means of establishing a communications link between computers may be used.

컴퓨터(1102)는 무선 통신으로 배치되어 동작하는 임의의 무선 장치 또는 개체, 예를 들어, 프린터, 스캐너, 데스크톱 및/또는 휴대용 컴퓨터, PDA(portable data assistant), 통신 위성, 무선 검출가능 태그와 연관된 임의의 장비 또는 장소, 및 전화와 통신을 하는 동작을 한다. 이것은 적어도 Wi-Fi 및 블루투스 무선 기술을 포함한다. 따라서, 통신은 종래의 네트워크에서와 같이 미리 정의된 구조이거나 단순하게 적어도 2개의 장치 사이의 애드혹 통신(ad hoc communication)일 수 있다.Computer 1102 may be associated with any wireless device or object deployed and operating in wireless communications, such as a printer, scanner, desktop and/or portable computer, portable data assistant (PDA), communications satellite, wirelessly detectable tag. Performs actions to communicate with any device or location and telephone. This includes at least Wi-Fi and Bluetooth wireless technologies. Accordingly, communication may be a predefined structure as in a conventional network or may simply be ad hoc communication between at least two devices.

Wi-Fi(Wireless Fidelity)는 유선 없이도 인터넷 등으로의 연결을 가능하게 해준다. Wi-Fi는 이러한 장치, 예를 들어, 컴퓨터가 실내에서 및 실외에서, 즉 기지국의 통화권 내의 아무 곳에서나 데이터를 전송 및 수신할 수 있게 해주는 셀 전화와 같은 무선 기술이다. Wi-Fi 네트워크는 안전하고 신뢰성 있으며 고속인 무선 연결을 제공하기 위해 IEEE 802.11(a, b, g, 기타)이라고 하는 무선 기술을 사용한다. 컴퓨터를 서로에, 인터넷에 및 유선 네트워크(IEEE 802.3 또는 이더넷을 사용함)에 연결시키기 위해 Wi-Fi가 사용될 수 있다. Wi-Fi 네트워크는 비인가 2.4 및 5GHz 무선 대역에서, 예를 들어, 11Mbps(802.11a) 또는 54 Mbps(802.11b) 데이터 레이트로 동작하거나, 양 대역(듀얼 대역)을 포함하는 제품에서 동작할 수 있다.Wi-Fi (Wireless Fidelity) allows connection to the Internet, etc. without wires. Wi-Fi is a wireless technology, like cell phones, that allows these devices, such as computers, to send and receive data indoors and outdoors, anywhere within the coverage area of a base station. Wi-Fi networks use wireless technology called IEEE 802.11 (a, b, g, etc.) to provide secure, reliable, and high-speed wireless connections. Wi-Fi can be used to connect computers to each other, the Internet, and wired networks (using IEEE 802.3 or Ethernet). Wi-Fi networks can operate in the unlicensed 2.4 and 5 GHz wireless bands, for example, at data rates of 11 Mbps (802.11a) or 54 Mbps (802.11b), or in products that include both bands (dual band). .

본 개시의 기술 분야에서 통상의 지식을 가진 자는 정보 및 신호들이 임의의 다양한 상이한 기술들 및 기법들을 이용하여 표현될 수 있다는 것을 이해할 것이다. 예를 들어, 위의 설명에서 참조될 수 있는 데이터, 지시들, 명령들, 정보, 신호들, 비트들, 심볼들 및 칩들은 전압들, 전류들, 전자기파들, 자기장들 또는 입자들, 광학장들 또는 입자들, 또는 이들의 임의의 결합에 의해 표현될 수 있다.Those skilled in the art will understand that information and signals may be represented using any of a variety of different technologies and techniques. For example, data, instructions, commands, information, signals, bits, symbols and chips that may be referenced in the above description include voltages, currents, electromagnetic waves, magnetic fields or particles, optical fields. It can be expressed by particles or particles, or any combination thereof.

본 개시의 기술 분야에서 통상의 지식을 가진 자는 여기에 개시된 실시예들과 관련하여 설명된 다양한 예시적인 논리 블록들, 모듈들, 프로세서들, 수단들, 회로들 및 알고리즘 단계들이 전자 하드웨어, (편의를 위해, 여기에서 소프트웨어로 지칭되는) 다양한 형태들의 프로그램 또는 설계 코드 또는 이들 모두의 결합에 의해 구현될 수 있다는 것을 이해할 것이다. 하드웨어 및 소프트웨어의 이러한 상호 호환성을 명확하게 설명하기 위해, 다양한 예시적인 컴포넌트들, 블록들, 모듈들, 회로들 및 단계들이 이들의 기능과 관련하여 위에서 일반적으로 설명되었다. 이러한 기능이 하드웨어 또는 소프트웨어로서 구현되는지 여부는 특정한 애플리케이션 및 전체 시스템에 대하여 부과되는 설계 제약들에 따라 좌우된다. 본 개시의 기술 분야에서 통상의 지식을 가진 자는 각각의 특정한 애플리케이션에 대하여 다양한 방식들로 설명된 기능을 구현할 수 있으나, 이러한 구현 결정들은 본 개시의 범위를 벗어나는 것으로 해석되어서는 안 될 것이다.Those skilled in the art will understand that the various illustrative logical blocks, modules, processors, means, circuits and algorithm steps described in connection with the embodiments disclosed herein may be used in electronic hardware, (for convenience) It will be understood that it may be implemented by various forms of program or design code (referred to herein as software) or a combination of both. To clearly illustrate this interoperability of hardware and software, various illustrative components, blocks, modules, circuits and steps have been described above generally with respect to their functionality. Whether this functionality is implemented as hardware or software depends on the specific application and design constraints imposed on the overall system. A person skilled in the art of this disclosure may implement the described functionality in various ways for each specific application, but such implementation decisions should not be construed as departing from the scope of this disclosure.

여기서 제시된 다양한 실시예들은 방법, 장치, 또는 표준 프로그래밍 및/또는 엔지니어링 기술을 사용한 제조 물품(article)으로 구현될 수 있다. 용어 제조 물품은 임의의 컴퓨터-판독가능 저장장치로부터 액세스 가능한 컴퓨터 프로그램, 캐리어, 또는 매체(media)를 포함한다. 예를 들어, 컴퓨터-판독가능 저장매체는 자기 저장 장치(예를 들면, 하드 디스크, 플로피 디스크, 자기 스트립, 등), 광학 디스크(예를 들면, CD, DVD, 등), 스마트 카드, 및 플래쉬 메모리 장치(예를 들면, EEPROM, 카드, 스틱, 키 드라이브, 등)를 포함하지만, 이들로 제한되는 것은 아니다. 또한, 여기서 제시되는 다양한 저장 매체는 정보를 저장하기 위한 하나 이상의 장치 및/또는 다른 기계-판독가능한 매체를 포함한다.The various embodiments presented herein may be implemented as a method, apparatus, or article of manufacture using standard programming and/or engineering techniques. The term article of manufacture includes a computer program, carrier, or media accessible from any computer-readable storage device. For example, computer-readable storage media include magnetic storage devices (e.g., hard disks, floppy disks, magnetic strips, etc.), optical disks (e.g., CDs, DVDs, etc.), smart cards, and flash. Includes, but is not limited to, memory devices (e.g., EEPROM, cards, sticks, key drives, etc.). Additionally, various storage media presented herein include one or more devices and/or other machine-readable media for storing information.

제시된 프로세스들에 있는 단계들의 특정한 순서 또는 계층 구조는 예시적인 접근들의 일례임을 이해하도록 한다. 설계 우선순위들에 기반하여, 본 개시의 범위 내에서 프로세스들에 있는 단계들의 특정한 순서 또는 계층 구조가 재배열될 수 있다는 것을 이해하도록 한다. 첨부된 방법 청구항들은 샘플 순서로 다양한 단계들의 엘리먼트들을 제공하지만 제시된 특정한 순서 또는 계층 구조에 한정되는 것을 의미하지는 않는다.It is to be understood that the specific order or hierarchy of steps in the processes presented is an example of illustrative approaches. It is to be understood that the specific order or hierarchy of steps in processes may be rearranged within the scope of the present disclosure, based on design priorities. The appended method claims present elements of the various steps in a sample order but are not meant to be limited to the particular order or hierarchy presented.

제시된 실시예들에 대한 설명은 임의의 본 개시의 기술 분야에서 통상의 지식을 가진 자가 본 개시를 이용하거나 또는 실시할 수 있도록 제공된다. 이러한 실시예들에 대한 다양한 변형들은 본 개시의 기술 분야에서 통상의 지식을 가진 자에게 명백할 것이며, 여기에 정의된 일반적인 원리들은 본 개시의 범위를 벗어남이 없이 다른 실시예들에 적용될 수 있다. 그리하여, 본 개시는 여기에 제시된 실시예들로 한정되는 것이 아니라, 여기에 제시된 원리들 및 신규한 특징들과 일관되는 최광의의 범위에서 해석되어야 할 것이다.The description of the presented embodiments is provided to enable any person skilled in the art to make or use the present disclosure. Various modifications to these embodiments will be apparent to those skilled in the art, and the general principles defined herein may be applied to other embodiments without departing from the scope of the disclosure. Thus, the present disclosure is not limited to the embodiments presented herein but is to be interpreted in the broadest scope consistent with the principles and novel features presented herein.

Claims

A product information evaluation method performed by a computing device including at least one processor, comprising:
Generating standardized product data corresponding to unstructured product text;
Calculating price information based on a product data set including a plurality of product data;
calculating status information based on the product data set; and
generating evaluation information corresponding to at least one product data included in the product data set based on the price information and additionally based on the status information;
Including,
The step of calculating the status information is,
calculating a state score for each product data included in the product data set based on at least one state index; and
correcting the calculated health score based on at least one health assistance index;
Including,
How to evaluate product information.

According to claim 1,
The step of generating the product data is,
generating standardized product data from the unstructured product text based on an evaluation list including at least one evaluation index, each of the at least one evaluation index being assigned at least one token to be compared;
Including,
How to evaluate product information.

According to claim 2,
The product data is,
Contains at least one comparison result token,
The token as a result of the comparison is,
Obtained based on the results of comparing the comparison target token assigned to the evaluation index with the unstructured product text,
How to evaluate product information.

According to claim 1,
calculating authenticity information for each of a plurality of product data included in the product data set; and
removing at least one product data from the product data set based on the authenticity information;
Containing more,
How to evaluate product information.

According to claim 4,
The step of calculating the authenticity information is,
inspecting the unstructured product text based on a fake index included in an evaluation list for evaluating product information;
Including,
How to evaluate product information.

According to claim 4,
The step of calculating the authenticity information is,
Carried out based on the above price information,
How to evaluate product information.

According to claim 1,
The step of calculating the price information is,
Calculating product quantity distribution according to product price; and
calculating price information based on the product quantity distribution and at least one price reference value;
Including,
How to evaluate product information.

According to claim 7,
The at least one price reference value is,
Determined based on statistical values calculated from the product quantity distribution,
How to evaluate product information.

According to claim 7,
The at least one price reference value is,
Determined based on the actual sales quantity distribution according to product price,
How to evaluate product information.

According to claim 7,
The at least one price reference value is,
Based on statistical values calculated from the product quantity distribution,
Determined additionally based on the actual sales quantity distribution according to product price,
How to evaluate product information.

delete

According to claim 1,
The comparison result token included in the state auxiliary index is,
A string that modifies the comparison result token included in the status index,
How to evaluate product information.

According to claim 1,
The step of correcting the status score is,
Performed based on the token distance between the comparison result token corresponding to the state index and the comparison result token corresponding to the state auxiliary index,
How to evaluate product information.

According to claim 14,
The step of correcting the status score based on the token distance is,
This is performed in such a way that the closer the token distance is, the greater the change in status score due to correction.
How to evaluate product information.

According to claim 1,
calculating seller information based on the product data set;
It further includes,
The step of generating the evaluation information is,
generating evaluation information corresponding to at least one product data included in the product data set based additionally on the seller information;
Including,
How to evaluate product information.

A computer program stored in a computer-readable storage medium, wherein the computer program, when executed on one or more processors, performs the following operations for evaluating product information, the operations being:
An operation of generating standardized product data corresponding to unstructured product text;
An operation of calculating price information based on a product data set including a plurality of product data;
An operation of calculating status information based on the product data set; and
generating evaluation information corresponding to at least one product data included in the product data set based on the price information and additionally based on the status information;
Including,
The operation of calculating the status information is,
Computing a state score for each product data included in the product data set based on at least one state index; and
Correcting the calculated health score based on at least one health assistance index;
Including,
A computer program stored on a computer-readable storage medium.

As a product information evaluation device,
One or more processors;
one or more memories; and
network department;
Includes, and
The one or more processors:
Generate standardized product data corresponding to unstructured product text,
Calculate price information based on a product data set containing multiple product data,
Calculate status information based on the product data set, and
Generating evaluation information corresponding to at least one product data included in the product data set based on the price information, but additionally based on the status information,
The process of calculating the status information includes calculating a status score for each product data included in the product data set based on at least one status index; and
correcting the calculated state score based on at least one state auxiliary index;
Including,
Product information evaluation device.