KR102591481B1

KR102591481B1 - AI-based sales product recommendation system using trend analysis

Info

Publication number: KR102591481B1
Application number: KR1020230050801A
Authority: KR
Inventors: 최환석
Original assignee: 에디토 주식회사
Priority date: 2023-04-18
Filing date: 2023-04-18
Publication date: 2023-10-19

Abstract

According to one embodiment, in a sales product recommendation system based on artificial intelligence-based trends, a system including a server for recommending products for sale based on artificial intelligence-based trends is provided. It has the effect of recommending products by category, which may predict an increase in sales based on sales volume, number of reviews, number of searches, and social media data.

Description

AI-based sales product recommendation system using trend analysis}

본 발명은 인공지능 기반 트렌드에 기초한 판매제품 추천 시스템을 제공하기 위한 기술에 관한 것이다.The present invention relates to technology for providing a sales product recommendation system based on artificial intelligence-based trends.

종래의 판매제품 추천 시스템은, 이전에 판매한 판매 기록을 바탕으로 앞으로의 판매 제품을 추천할 수 있었다. 하지만 이러한 판매제품 추천 시스템은, 이전의 판매 기록만을 기반으로 제품을 추천하기 때문에, 빠르게 바뀌는 트렌드에 대응하는 판매 제품을 추천하기 어려운 문제점이 있었다.The conventional sales product recommendation system was able to recommend future sales products based on previously sold sales records. However, because this sales product recommendation system recommends products based only on previous sales records, it has the problem of making it difficult to recommend sales products that respond to rapidly changing trends.

또한, 판매자가 이미 판매한 제품만을 고려하기 때문에, 판매자가 다양한 제품을 판매하지 않을 경우, 시장 동향과 판매 트렌드를 정확하게 파악할 수 없는 문제점이 있었다.In addition, since only products already sold by the seller are considered, there is a problem in that market trends and sales trends cannot be accurately identified if the seller does not sell a variety of products.

그리고 이전 판매 기록만을 기반으로 제품을 추천하기 때문에, 이전 판매 기록에서의 잘못된 판매 행동을 반복할 수 있다. 예를 들면, 판매자가 이전에 판매했던 특정 제품의 판매량이 떨어졌던 데이터가 존재하면, 해당 제품을 다시는 추천하지 않을 수 있다. 그러나 이전의 판매량 데이터와 다른 추세를 보이는 경우도 있기에, 올바르지 않은 추천이 반복될 경우 잠재적인 매출 기회를 놓칠 수 있는 문제점이 있었다.And because products are recommended only based on previous sales records, incorrect sales behavior from previous sales records may be repeated. For example, if there is data showing that sales of a specific product previously sold by a seller have dropped, the seller may not recommend that product again. However, since there are cases where trends differ from previous sales data, there is a problem that potential sales opportunities may be missed if incorrect recommendations are repeated.

또한, 판매자는 여러 가격비교 사이트 서버와 소셜미디어 서버를 통해 획득한 대량의 데이터 중에서 필요한 데이터를 얻기 어려운 문제점이 있었다.Additionally, sellers had the problem of obtaining the necessary data from the large amount of data obtained through various price comparison site servers and social media servers.

한국공개특허 제10-2021-0099690호Korean Patent Publication No. 10-2021-0099690 한국등록특허 제10-2425811호Korean Patent No. 10-2425811 한국등록특허 제10-2147649호Korean Patent No. 10-2147649 한국등록특허 제10-2363615호Korean Patent No. 10-2363615

일실시예에 따르면, 인공지능 기반 트렌드에 기초한 판매제품 추천 시스템에 있어서, 인공지능 기반 트렌드에 기초한 판매제품을 추천하는 서버를 포함하는 시스템을 제공하기 위한 것을 그 목적으로 한다.According to one embodiment, the purpose of the system is to provide a system for recommending products for sale based on artificial intelligence-based trends, including a server that recommends products for sale based on artificial intelligence-based trends.

또한, 상기 서버는, 웹사이트 또는 모바일앱을 운용하고, 복수의 가격비교 사이트 서버를 통해, 카테고리별 제품의 판매량, 후기수 및 검색수를 포함하는 제1 데이터를 주 단위로 수집하고, 판매량, 후기수 및 검색수를 학습데이터로 판매량의 증가를 예측하도록 기학습된 제1 인공지능 모델에, 상기 제1 데이터를 입력하여 제1 출력을 획득하되, 상기 제1 인공지능 모델은, 회귀 분석 알고리즘(regression analysis)을 기반으로 하고, 복수의 소셜미디어 서버를 통해, 텍스트를 포함하는 제2 데이터를 격일 수집하고, 긍정적인 언급수가 증가한 제품을 출력하도록 기학습된 제2 인공지능 모델에, 상기 제2 데이터를 입력하여 제2 출력을 획득하되, 상기 제2 인공지능 모델은, 텍스트 마이닝을 위한 TF-IDF, Word2Vec, LDA 중 하나의 알고리즘, 감성 분석을 위한 Deep Learning 알고리즘, 시계열 분석을 위한 ARIMA 또는 Prophet 알고리즘을 기반으로 하고, 상기 텍스트 마이닝을 위한 키워드는, 총 판매량이 가장 많은 가격비교 사이트의 카테고리 대분류, 중분류, 소분류 명칭이고, 상기 서버는, 총 판매량이 가장 많은 가격비교 사이트 서버를 통해, 최하위 카테고리별 판매량 1위 상품의 리뷰, 해시태그, 상품설명, 판매량 및 검색수를 포함하는 제3 데이터를 주 단위로 수집하고, 기학습된 제3 인공지능 모델에, 상기 판매량 1위 상품의 리뷰를 감정 분석하고, 감정 분석 결과가 긍정적 중립적, 부정적인 리뷰의 개수를 출력하되, 상기 제3 인공지능 모델은, KoBERT, KoELECTRA, HanBERT 모델 중 하나를 이용하고, 상기 웹사이트 또는 모바일앱을 통해 사용자 계정이 로그인되면, 사용자 계정 정보에서 사용자 단말을 통해 기설정된 관심 카테고리를 획득하고, 상기 웹사이트 또는 모바일앱을 통해 상기 사용자 단말에, 상기 제1 출력을 바탕으로 카테고리별 예상 판매량의 증가율이 높은 순서로 제품을 정렬하여 표시하되, 상기 기설정된 관심 카테고리가 가장 먼저 노출되도록 게시하고, 사용자 단말을 통해, 상기 예상 판매량의 증가가 큰 순서로 게시된 제품 중 하나가 선택되면, 상기 선택된 제품의 판매량, 후기수, 검색수 추이 및 예상 판매량을 그래프로 시각화하여 표시하고, 상기 웹사이트 또는 모바일앱을 통해 상기 사용자 단말에, 상기 제2 출력을 바탕으로 카테고리별 긍정적 언급수 증가 제품을 언급수가 많은 순서로 정렬하여 표시하되, 상기 기설정된 관심 카테고리가 가장 먼저 노출되도록 게시하고, 사용자 단말을 통해, 상기 언급수가 많은 순서로 게시된 제품 중 하나가 선택되면, 상기 선택된 제품의 언급수 추이를 그래프로 시각화하여 표시하고, 상기 사용자 단말을 통해, 상기 웹사이트 또는 모바일앱에서 최하위 카테고리가 선택되면, 상기 제3 데이터와 상기 제3 인공지능 모델의 출력을 바탕으로, 상기 웹사이트 또는 모바일앱에 상기 선택된 최하위 카테고리의 판매량 1위 상품의 해시태그, 상품설명을 게시하고, 상기 선택된 최하위 카테고리의 판매량 1위 상품의 긍정적, 중립적, 부정적인 리뷰 개수를 시각화하여 표시하고, 상기 선택된 최하위 카테고리의 판매량 1위 상품의 판매량과 검색수 추이를 시각화하여 표시하는 시스템을 제공할 수 있다.In addition, the server operates a website or mobile app and collects first data including sales volume, number of reviews, and number of searches of products by category on a weekly basis through a plurality of price comparison site servers, sales volume, The first data is input to a first artificial intelligence model that has been previously trained to predict an increase in sales using the number of reviews and searches as learning data to obtain a first output, and the first artificial intelligence model uses a regression analysis algorithm. Based on regression analysis, second data including text is collected every other day through a plurality of social media servers, and a second artificial intelligence model is pre-trained to output products with an increased number of positive mentions. 2 Input data to obtain a second output, wherein the second artificial intelligence model is one of TF-IDF, Word2Vec, and LDA algorithms for text mining, Deep Learning algorithm for sentiment analysis, ARIMA for time series analysis, or Based on the Prophet algorithm, the keywords for text mining are the category major, medium, and small categories of the price comparison site with the highest total sales, and the server is the lowest ranking through the price comparison site server with the highest total sales. Third-party data, including reviews, hashtags, product descriptions, sales volume, and number of searches, of the top-selling product by category are collected on a weekly basis, and the reviews of the top-selling product are generated by a pre-trained third artificial intelligence model. Sentiment analysis is performed, and the sentiment analysis result outputs the number of positive, neutral, and negative reviews. The third artificial intelligence model uses one of the KoBERT, KoELECTRA, and HanBERT models, and the user account is registered through the website or mobile app. When logged in, a preset interest category is obtained through the user terminal from the user account information, and products are ordered in order of the highest growth rate of expected sales volume for each category based on the first output to the user terminal through the website or mobile app. are sorted and displayed so that the preset interest category is exposed first, and when one of the products posted in the order of the largest increase in expected sales is selected through the user terminal, the sales volume and number of reviews of the selected product are displayed. , visualizing and displaying the search count trend and expected sales volume in a graph, and sorting products with an increase in the number of positive mentions by category in order of the number of mentions based on the second output to the user terminal through the website or mobile app. Displayed so that the preset interest category is exposed first, and when one of the products posted in the order of the number of mentions is selected through the user terminal, the trend of the number of mentions of the selected product is visualized and displayed in a graph. , When the lowest category is selected on the website or mobile app through the user terminal, based on the third data and the output of the third artificial intelligence model, the sales volume of the lowest category selected on the website or mobile app Post the hashtag and product description of the No. 1 product, visualize and display the number of positive, neutral, and negative reviews of the No. 1 product in sales in the selected lowest category, and display the sales volume and number of searches for the No. 1 product in sales in the selected lowest category. A system that visualizes and displays trends can be provided.

또한, 상기 서버는, 사용자 단말을 통해 생성된, B2B 공급자가 공급하는 제품의 판매, 취소, 교환, 반품 데이터를 포함하는 제4 데이터를 수집하고, 상기 B2B 공급자의 제품의 마진율, 취소율, 교환율, 반품율을 산출하고, 사용자 단말을 통해 생성된, B2C 판매자가 판매하는 제품의 판매, 취소, 교환, 반품 데이터를 포함하는 제5 데이터를 수집하고, 상기 B2C 판매자의 제품의 마진율, 취소율, 교환율, 반품율을 산출하고, 상기 제4 데이터와 상기 제5 데이터를 바탕으로, 기지정된 기준에 따라 문제 제품과 추천 제품을 추출하고, 상기 웹사이트 또는 모바일앱에 상기 추출한 문제 제품과 추천 제품을 게시하고, 상기 문제 제품과 추천 제품을 추출하기 위한 점수는, [수학식 1]을 기반으로 산출하고,In addition, the server collects fourth data generated through the user terminal, including sales, cancellation, exchange, and return data of products supplied by the B2B supplier, and margin rate, cancellation rate, and exchange of the products of the B2B supplier. Calculate the rate and return rate, collect fifth data generated through the user terminal, including sales, cancellation, exchange, and return data of products sold by B2C sellers, and calculate the margin rate, cancellation rate, and Calculate the exchange rate and return rate, extract problem products and recommended products according to predetermined criteria based on the fourth data and fifth data, and post the extracted problem products and recommended products on the website or mobile app. The score for posting and extracting the problem products and recommended products is calculated based on [Equation 1],

[수학식 1][Equation 1]

상기 score는 문제 제품과 추천 제품을 추출하기 위한 점수이고, m는 마진율이며, p는 판매율이고, s는 취소율이며, t는 교환율이고, q는 반품율이고, 상기 score가 카테고리별 score 분포에서 상위 15% 이상이면 문제 제품, 하위 15% 이하이면 추천 제품으로 추출하는 시스템을 제공할 수 있다.The score is a score for extracting problem products and recommended products, m is the margin rate, p is the sales rate, s is the cancellation rate, t is the exchange rate, q is the return rate, and the score is from the score distribution for each category. A system can be provided to extract problem products if they are in the top 15% or more, and recommended products if they are in the bottom 15% or less.

상기와 같이, 인공지능 기반 트렌드에 기초한 판매제품을 추천 시스템을 제공함으로써, 판매량, 후기수, 검색수, 소셜미디어 데이터를 바탕으로 판매량의 증가를 예측한 제품을 카테고리별로 추천할 수 있고, 사용자 단말에 판매량, 후기수, 검색수, 긍정적 언급수 추이 및 예상 판매량을 시각화하여 표시할 수 있으며, 최하위 카테고리별 판매량 1위 상품의 데이터를 시각화하여 제공할 수 있고, B2B 공급자가 공급하는 제품의 데이터와 B2C 판매자가 판매하는 제품의 데이터를 바탕으로 문제 제품과 추천 제품을 추출하여 사용자 단말에 제공할 수 있는 효과가 있다.As described above, by providing a recommendation system for sales products based on artificial intelligence-based trends, products whose sales volume is predicted to increase based on sales volume, number of reviews, number of searches, and social media data can be recommended by category, and the user terminal You can visualize and display the sales volume, number of reviews, number of searches, trends in the number of positive mentions, and expected sales volume. You can visualize and provide data on the product with the highest sales volume for each lowest category, and data on products supplied by B2B suppliers and It has the effect of extracting problem products and recommended products based on data on products sold by B2C sellers and providing them to the user terminal.

본 발명의 목적은 이상에서 언급한 목적으로 제한되지 않으며, 언급되지 않은 또 다른 목적들은 아래의 기재로부터 명확하게 이해될 수 있을 것이다.The object of the present invention is not limited to the object mentioned above, and other objects not mentioned can be clearly understood from the description below.

일실시예에 따르면, 인공지능 기반 트렌드에 기초한 판매제품 추천 시스템에 있어서, 인공지능 기반 트렌드에 기초한 판매제품을 추천하는 서버를 포함하는 시스템을 제공한다.According to one embodiment, in a system for recommending products for sale based on artificial intelligence-based trends, a system including a server for recommending products for sale based on artificial intelligence-based trends is provided.

또한, 상기 서버는, 웹사이트 또는 모바일앱을 운용하고, 복수의 가격비교 사이트 서버를 통해, 카테고리별 제품의 판매량, 후기수 및 검색수를 포함하는 제1 데이터를 주 단위로 수집하고, 판매량, 후기수 및 검색수를 학습데이터로 판매량의 증가를 예측하도록 기학습된 제1 인공지능 모델에, 상기 제1 데이터를 입력하여 제1 출력을 획득하되, 상기 제1 인공지능 모델은, 회귀 분석 알고리즘(regression analysis)을 기반으로 하고, 복수의 소셜미디어 서버를 통해, 텍스트를 포함하는 제2 데이터를 격일 수집하고, 긍정적인 언급수가 증가한 제품을 출력하도록 기학습된 제2 인공지능 모델에, 상기 제2 데이터를 입력하여 제2 출력을 획득하되, 상기 제2 인공지능 모델은, 텍스트 마이닝을 위한 TF-IDF, Word2Vec, LDA 중 하나의 알고리즘, 감성 분석을 위한 Deep Learning 알고리즘, 시계열 분석을 위한 ARIMA 또는 Prophet 알고리즘을 기반으로 하고, 상기 텍스트 마이닝을 위한 키워드는, 총 판매량이 가장 많은 가격비교 사이트의 카테고리 대분류, 중분류, 소분류 명칭이고, 상기 서버는, 총 판매량이 가장 많은 가격비교 사이트 서버를 통해, 최하위 카테고리별 판매량 1위 상품의 리뷰, 해시태그, 상품설명, 판매량 및 검색수를 포함하는 제3 데이터를 주 단위로 수집하고, 기학습된 제3 인공지능 모델에, 상기 판매량 1위 상품의 리뷰를 감정 분석하고, 감정 분석 결과가 긍정적 중립적, 부정적인 리뷰의 개수를 출력하되, 상기 제3 인공지능 모델은, KoBERT, KoELECTRA, HanBERT 모델 중 하나를 이용하고, 상기 웹사이트 또는 모바일앱을 통해 사용자 계정이 로그인되면, 사용자 계정 정보에서 사용자 단말을 통해 기설정된 관심 카테고리를 획득하고, 상기 웹사이트 또는 모바일앱을 통해 상기 사용자 단말에, 상기 제1 출력을 바탕으로 카테고리별 예상 판매량의 증가율이 높은 순서로 제품을 정렬하여 표시하되, 상기 기설정된 관심 카테고리가 가장 먼저 노출되도록 게시하고, 사용자 단말을 통해, 상기 예상 판매량의 증가가 큰 순서로 게시된 제품 중 하나가 선택되면, 상기 선택된 제품의 판매량, 후기수, 검색수 추이 및 예상 판매량을 그래프로 시각화하여 표시하고, 상기 웹사이트 또는 모바일앱을 통해 상기 사용자 단말에, 상기 제2 출력을 바탕으로 카테고리별 긍정적 언급수 증가 제품을 언급수가 많은 순서로 정렬하여 표시하되, 상기 기설정된 관심 카테고리가 가장 먼저 노출되도록 게시하고, 사용자 단말을 통해, 상기 언급수가 많은 순서로 게시된 제품 중 하나가 선택되면, 상기 선택된 제품의 언급수 추이를 그래프로 시각화하여 표시하고, 상기 사용자 단말을 통해, 상기 웹사이트 또는 모바일앱에서 최하위 카테고리가 선택되면, 상기 제3 데이터와 상기 제3 인공지능 모델의 출력을 바탕으로, 상기 웹사이트 또는 모바일앱에 상기 선택된 최하위 카테고리의 판매량 1위 상품의 해시태그, 상품설명을 게시하고, 상기 선택된 최하위 카테고리의 판매량 1위 상품의 긍정적, 중립적, 부정적인 리뷰 개수를 시각화하여 표시하고, 상기 선택된 최하위 카테고리의 판매량 1위 상품의 판매량과 검색수 추이를 시각화하여 표시할 수 있다.In addition, the server operates a website or mobile app and collects first data including sales volume, number of reviews, and number of searches of products by category on a weekly basis through a plurality of price comparison site servers, sales volume, The first data is input to a first artificial intelligence model that has been previously trained to predict an increase in sales using the number of reviews and searches as learning data to obtain a first output, and the first artificial intelligence model uses a regression analysis algorithm. Based on regression analysis, second data including text is collected every other day through a plurality of social media servers, and a second artificial intelligence model is pre-trained to output products with an increased number of positive mentions. 2 Input data to obtain a second output, wherein the second artificial intelligence model is one of TF-IDF, Word2Vec, and LDA algorithms for text mining, Deep Learning algorithm for sentiment analysis, ARIMA for time series analysis, or Based on the Prophet algorithm, the keywords for text mining are the category major, medium, and small categories of the price comparison site with the highest total sales, and the server is the lowest ranking through the price comparison site server with the highest total sales. Third-party data, including reviews, hashtags, product descriptions, sales volume, and number of searches, of the top-selling product by category are collected on a weekly basis, and the reviews of the top-selling product are generated by a pre-trained third artificial intelligence model. Sentiment analysis is performed, and the sentiment analysis result outputs the number of positive, neutral, and negative reviews. The third artificial intelligence model uses one of the KoBERT, KoELECTRA, and HanBERT models, and the user account is registered through the website or mobile app. When logged in, a preset interest category is obtained through the user terminal from the user account information, and products are ordered in order of the highest growth rate of expected sales volume for each category based on the first output to the user terminal through the website or mobile app. are sorted and displayed so that the preset interest category is exposed first, and when one of the products posted in the order of the largest increase in expected sales is selected through the user terminal, the sales volume and number of reviews of the selected product are displayed. , visualizing and displaying the search count trend and expected sales volume in a graph, and sorting products with an increase in the number of positive mentions by category in order of the number of mentions based on the second output to the user terminal through the website or mobile app. Displayed so that the preset interest category is exposed first, and when one of the products posted in the order of the number of mentions is selected through the user terminal, the trend of the number of mentions of the selected product is visualized and displayed in a graph. , When the lowest category is selected on the website or mobile app through the user terminal, based on the third data and the output of the third artificial intelligence model, the sales volume of the lowest category selected on the website or mobile app Post the hashtag and product description of the No. 1 product, visualize and display the number of positive, neutral, and negative reviews of the No. 1 product in sales in the selected lowest category, and display the sales volume and number of searches for the No. 1 product in sales in the selected lowest category. Trends can be visualized and displayed.

[수학식 1][Equation 1]

상기 score는 문제 제품과 추천 제품을 추출하기 위한 점수이고, m는 마진율이며, p는 판매율이고, s는 취소율이며, t는 교환율이고, q는 반품율이고, 상기 score가 카테고리별 score 분포에서 상위 15% 이상이면 문제 제품, 하위 15% 이하이면 추천 제품으로 추출할 수 있다.The score is a score for extracting problem products and recommended products, m is the margin rate, p is the sales rate, s is the cancellation rate, t is the exchange rate, q is the return rate, and the score is from the score distribution for each category. If it is in the top 15% or more, it can be extracted as a problem product, and if it is in the bottom 15% or less, it can be extracted as a recommended product.

일실시예에 따르면, 인공지능 기반 트렌드에 기초한 판매제품을 추천 시스템을을 제공함으로써, 판매량, 후기수, 검색수, 소셜미디어 데이터를 바탕으로 판매량의 증가를 예측한 제품을 카테고리별로 추천할 수 있는 효과가 있다.According to one embodiment, by providing a recommendation system for sales products based on artificial intelligence-based trends, products that predict an increase in sales volume based on sales volume, number of reviews, number of searches, and social media data can be recommended by category. It works.

또한, 사용자 단말에 판매량, 후기수, 검색수, 긍정적 언급수 추이 및 예상 판매량을 시각화하여 표시할 수 있는 효과가 있다.In addition, there is an effect of being able to visualize and display sales volume, number of reviews, number of searches, trend of positive mentions, and expected sales volume on the user terminal.

또한, 사용자 단말에 최하위 카테고리별 판매량 1위 상품의 데이터를 시각화하여 제공할 수 있는 효과가 있다.In addition, there is an effect of being able to visualize and provide data on the product with the highest sales volume for each lowest category to the user terminal.

또한, B2B 공급자가 공급하는 제품의 데이터와 B2C 판매자가 판매하는 제품의 데이터를 바탕으로 문제 제품과 추천 제품을 추출하여 사용자 단말에 제공할 수 있는 효과가 있다.In addition, there is an effect in that problem products and recommended products can be extracted and provided to the user terminal based on data on products supplied by B2B suppliers and data on products sold by B2C sellers.

한편, 실시예들에 따른 효과들은 이상에서 언급한 것으로 제한되지 않으며, 언급되지 않은 또 다른 효과들은 아래의 기재로부터 해당 기술 분야의 통상의 지식을 가진 자에게 명확히 이해될 수 있을 것이다.Meanwhile, the effects according to the embodiments are not limited to those mentioned above, and other effects not mentioned can be clearly understood by those skilled in the art from the description below.

도 1은 일실시예에 따른 인공지능 기반 트렌드에 기초한 판매제품 추천 시스템의 구성을 개략적으로 나타낸 개념도이다.
도 2는 일실시예에 따른 데이터를 수집하고 분석하는 과정을 설명하기 위한 순서도이다.
도 3은 일실시예에 따른 사용자 단말에 분석 데이터를 제공하는 과정을 설명하기 위한 순서도이다.
도 4는 일실시예에 따른 사용자 단말에 문제 상품과 추천 상품을 제공하는 과정을 설명하기 위한 순서도이다.Figure 1 is a conceptual diagram schematically showing the configuration of a sales product recommendation system based on artificial intelligence-based trends according to an embodiment.
Figure 2 is a flow chart to explain the process of collecting and analyzing data according to an embodiment.
Figure 3 is a flowchart for explaining a process of providing analysis data to a user terminal according to an embodiment.
Figure 4 is a flowchart for explaining the process of providing problem products and recommended products to a user terminal according to an embodiment.

이하에서, 첨부된 도면을 참조하여 실시예들을 상세하게 설명한다. 그러나, 실시예들에는 다양한 변경이 가해질 수 있어서 특허출원의 권리 범위가 이러한 실시예들에 의해 제한되거나 한정되는 것은 아니다. 실시예들에 대한 모든 변경, 균등물 내지 대체물이 권리 범위에 포함되는 것으로 이해되어야 한다.Hereinafter, embodiments will be described in detail with reference to the attached drawings. However, various changes can be made to the embodiments, so the scope of the patent application is not limited or limited by these embodiments. It should be understood that all changes, equivalents, or substitutes for the embodiments are included in the scope of rights.

실시예들에 대한 특정한 구조적 또는 기능적 설명들은 단지 예시를 위한 목적으로 개시된 것으로서, 다양한 형태로 변경되어 실시될 수 있다. 따라서, 실시예들은 특정한 개시형태로 한정되는 것이 아니며, 본 명세서의 범위는 기술적 사상에 포함되는 변경, 균등물, 또는 대체물을 포함한다.Specific structural or functional descriptions of the embodiments are disclosed for illustrative purposes only and may be modified and implemented in various forms. Accordingly, the embodiments are not limited to the specific disclosed form, and the scope of the present specification includes changes, equivalents, or substitutes included in the technical spirit.

제1 또는 제2 등의 용어를 다양한 구성요소들을 설명하는데 사용될 수 있지만, 이런 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 해석되어야 한다. 예를 들어, 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소는 제1 구성요소로도 명명될 수 있다.Terms such as first or second may be used to describe various components, but these terms should be interpreted only for the purpose of distinguishing one component from another component. For example, a first component may be named a second component, and similarly, the second component may also be named a first component.

어떤 구성요소가 다른 구성요소에 "연결되어" 있다고 언급된 때에는, 그 다른 구성요소에 직접적으로 연결되어 있거나 또는 접속되어 있을 수도 있지만, 중간에 다른 구성요소가 존재할 수도 있다고 이해되어야 할 것이다.When a component is referred to as being “connected” to another component, it should be understood that it may be directly connected or connected to the other component, but that other components may exist in between.

실시예에서 사용한 용어는 단지 설명을 목적으로 사용된 것으로, 한정하려는 의도로 해석되어서는 안된다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 명세서에서, "포함하다" 또는 "가지다" 등의 용어는 명세서 상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.The terms used in the examples are for descriptive purposes only and should not be construed as limiting. Singular expressions include plural expressions unless the context clearly dictates otherwise. In this specification, terms such as “comprise” or “have” are intended to designate the presence of features, numbers, steps, operations, components, parts, or combinations thereof described in the specification, but are not intended to indicate the presence of one or more other features. It should be understood that this does not exclude in advance the possibility of the existence or addition of elements, numbers, steps, operations, components, parts, or combinations thereof.

다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 실시예가 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥 상 가지는 의미와 일치하는 의미를 가지는 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다.Unless otherwise defined, all terms used herein, including technical or scientific terms, have the same meaning as generally understood by a person of ordinary skill in the technical field to which the embodiments belong. Terms defined in commonly used dictionaries should be interpreted as having a meaning consistent with the meaning in the context of the related technology, and unless explicitly defined in the present application, should not be interpreted in an ideal or excessively formal sense. No.

또한, 첨부 도면을 참조하여 설명함에 있어, 도면 부호에 관계없이 동일한 구성 요소는 동일한 참조부호를 부여하고 이에 대한 중복되는 설명은 생략하기로 한다. 실시예를 설명함에 있어서 관련된 공지 기술에 대한 구체적인 설명이 실시예의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우 그 상세한 설명을 생략한다.In addition, when describing with reference to the accompanying drawings, identical components will be assigned the same reference numerals regardless of the reference numerals, and overlapping descriptions thereof will be omitted. In describing the embodiments, if it is determined that detailed descriptions of related known technologies may unnecessarily obscure the gist of the embodiments, the detailed descriptions are omitted.

실시예들은 퍼스널 컴퓨터, 랩톱 컴퓨터, 태블릿 컴퓨터, 스마트 폰, 텔레비전, 스마트 가전 기기, 지능형 자동차, 키오스크, 웨어러블 장치 등 다양한 형태의 제품으로 구현될 수 있다.Embodiments may be implemented in various types of products such as personal computers, laptop computers, tablet computers, smart phones, televisions, smart home appliances, intelligent vehicles, kiosks, and wearable devices.

도 1은 일실시예에 따른 인공지능 기반 트렌드에 기초한 판매제품 추천 시스템의 구성을 개략적으로 나타낸 개념도이고, 도 2는 일실시예에 따른 데이터를 수집하고 분석하는 과정을 설명하기 위한 순서도이며, 도 3은 일실시예에 따른 사용자 단말에 분석 데이터를 제공하는 과정을 설명하기 위한 순서도이고, 도 4는 일실시예에 따른 사용자 단말에 문제 상품과 추천 상품을 제공하는 과정을 설명하기 위한 순서도이다.Figure 1 is a conceptual diagram schematically showing the configuration of a sales product recommendation system based on artificial intelligence-based trends according to an embodiment, and Figure 2 is a flow chart to explain the process of collecting and analyzing data according to an embodiment. 3 is a flowchart for explaining a process for providing analysis data to a user terminal according to an embodiment, and Figure 4 is a flowchart for explaining a process for providing problem products and recommended products to a user terminal according to an embodiment.

도 1을 참조하면, 일실시예로서 인공지능 기반 트렌드에 기초한 판매제품 추천 시스템은, 서버(100)를 포함할 수 있다. 상기 서버는, 복수의 가격비교 사이트의 서버와, 복수의 소셜미디어 서버로부터 데이터를 수집하고 분석한 결과를 사용자 단말에 제공할 수 있다.Referring to FIG. 1 , as an example, a system for recommending products for sale based on artificial intelligence-based trends may include a server 100. The server may collect data from a plurality of price comparison site servers and a plurality of social media servers and provide analysis results to the user terminal.

서버(100)는, 복수의 가격비교 사이트의 서버, 복수의 소셜미디어 서버, 및 사용자 단말이 데이터를 송수신하기 위하여 네트워크를 이용할 수 있다. 네트워크는 Wi-Fi, Bluetooth, NFC 등의 무선 네트워크 기술, 이더넷, 광섬유, DSL 등의 유선 네트워크 기술 중 그 어느 기술도 이용될 수 있다.The server 100 may use a network to enable servers of a plurality of price comparison sites, a plurality of social media servers, and user terminals to transmit and receive data. The network can use any of wireless network technologies such as Wi-Fi, Bluetooth, and NFC, and wired network technologies such as Ethernet, optical fiber, and DSL.

일실시예로서 서버는, 복수의 서버들을 통해 수집한 데이터들을 분석하는 복수의 인공지능 모델을 포함할 수 있다.In one embodiment, the server may include a plurality of artificial intelligence models that analyze data collected through a plurality of servers.

도 2를 참조하면, 상기 서버(100)는, 웹사이트 또는 모바일앱을 운용할 수 있다. 사용자 단말에 맞는 적절한 UI/UX를 고려하여 적절한 크기와 형태로 결과 데이터를 제공할 수 있다. 상기 웹사이트 또는 모바일앱을 통해 사용자 정보를 획득하고 관리할 수도 있다. 사용자 단말을 통해 작성된 사용자 정보를 바탕으로 맞춤형 판매제품 추천 서비스를 제공할 수도 있다.Referring to FIG. 2, the server 100 can operate a website or mobile app. Resulting data can be provided in an appropriate size and form considering the appropriate UI/UX for the user terminal. User information can also be obtained and managed through the above website or mobile app. A customized sales product recommendation service can also be provided based on user information created through the user terminal.

사용자가 관심 카테고리로 지정한 카테고리의 데이터만 상기 웹페이지나 모바일앱에 표시할 수 있고, 사용자가 이전에 확인한 카테고리 기록을 저장하여 우측에 검색 기록 목록을 제공할 수도 있고, 사용자 단말을 통해 선택된 카테고리와 관련이 높은 카테고리를 추천하거나, 대분류, 중분류, 소분류 카테고리를 눈으로 쉽게 확인하고 선택할 수 있도록 카테고리 목록을 드롭다운 리스트 형태로 제공할 수 있다.Only data from categories designated by the user as a category of interest can be displayed on the web page or mobile app. Category records previously checked by the user can be saved to provide a search history list on the right, and categories and categories selected through the user terminal can be displayed on the mobile app. You can recommend highly related categories, or provide a list of categories in the form of a drop-down list so that you can easily check and select major, medium, and small categories.

일실시예로서 서버(100)는, S201 단계에서 복수의 가격비교 사이트 서버를 통해, 카테고리별 제품의 판매량, 후기수 및 검색수를 포함하는 제1 데이터를 주 단위로 수집할 수 있다.As an example, the server 100 may collect first data including the sales volume, number of reviews, and number of searches of products by category on a weekly basis through a plurality of price comparison site servers in step S201.

데이터 수집을 위하여 복수의 가격비교 사이트에서 제공하는 API를 이용할 수 있다.To collect data, you can use APIs provided by multiple price comparison sites.

또한, S202 단계에서 판매량, 후기수 및 검색수를 학습데이터로 판매량의 증가를 예측하도록 기학습된 제1 인공지능 모델에, 상기 제1 데이터를 입력하여 제1 출력을 획득할 수 있다.Additionally, in step S202, the first data may be input to a first artificial intelligence model that has been pre-trained to predict an increase in sales using the sales volume, number of reviews, and number of searches as learning data to obtain a first output.

상기 제1 인공지능 모델은, 회귀 분석 알고리즘(regression analysis)을 기반으로 할 수 있다.The first artificial intelligence model may be based on a regression analysis algorithm.

회귀 분석 알고리즘은, 독립 변수와 종속 변수 간의 관계를 분석하는 통계학적 기법 중 하나로서, 독립 변수와 종속 변수는 수학적으로 연관성이 있는데, 회귀 분석은 이러한 연관성을 통계적으로 분석하고 모델링할 수 있다. 이 모델링된 회귀식을 이용하여 독립 변수 값이 주어졌을 때 종속 변수 값을 예측할 수 있다.The regression analysis algorithm is one of the statistical techniques that analyzes the relationship between independent variables and dependent variables. The independent variables and dependent variables are mathematically related, and regression analysis can statistically analyze and model this relationship. Using this modeled regression equation, the dependent variable value can be predicted given the independent variable value.

판매량을 예측하기 위해 판매량, 후기수, 검색수를 독립 변수로 하여 회귀 분석 알고리즘을 적용하면, 각 독립 변수가 종속 변수인 판매량과 어떤 관계를 가지는지 분석할 수 있다.If you apply a regression analysis algorithm with sales volume, number of reviews, and number of searches as independent variables to predict sales volume, you can analyze the relationship between each independent variable and sales volume, which is the dependent variable.

S203 단계에서 서버(100)는, 복수의 소셜미디어 서버를 통해, 텍스트를 포함하는 제2 데이터를 격일 수집할 수 있다.In step S203, the server 100 may collect second data including text every other day through a plurality of social media servers.

복수의 소셜미디어 서버를 통해 데이터를 수집하기 위하여, 상기 복수의 소셜미디어 서버에서 제공하는 API를 이용할 수 있다.To collect data through multiple social media servers, you can use APIs provided by the multiple social media servers.

복수의 소셜미디어에 올라오는 텍스트 데이터는 실시간 업데이트되고 변화가 빠르며 여러 사용자들의 관심사와 트렌드를 반영하고 있다고 보는 것이 타당할 수 있다. 소셜미디어 데이터의 빠른 변화를 고려하여 데이터는 격일 수집할 수 있다.It may be reasonable to say that text data posted on multiple social media is updated in real time, changes quickly, and reflects the interests and trends of various users. Considering the rapid changes in social media data, data can be collected every other day.

다음으로 S204 단계에서 서버(100)는, 긍정적인 언급수가 증가한 제품을 출력하도록 기학습된 제2 인공지능 모델에, 상기 제2 데이터를 입력하여 제2 출력을 획득할 수 있다.Next, in step S204, the server 100 may obtain a second output by inputting the second data into a second artificial intelligence model that has been previously trained to output products with an increased number of positive mentions.

상기 제2 인공지능 모델은, 텍스트 마이닝을 위한 TF-IDF, Word2Vec, LDA 중 하나의 알고리즘, 감성 분석을 위한 Deep Learning 알고리즘, 시계열 분석을 위한 ARIMA 또는 Prophet 알고리즘을 기반으로 할 수 있다.The second artificial intelligence model may be based on one of TF-IDF, Word2Vec, and LDA algorithms for text mining, Deep Learning algorithm for sentiment analysis, and ARIMA or Prophet algorithm for time series analysis.

상기 텍스트 마이닝을 위한 키워드는, 총 판매량이 가장 많은 가격비교 사이트의 카테고리 대분류, 중분류, 소분류 명칭일 수 있다.Keywords for text mining may be the names of major categories, medium categories, and small categories of the price comparison site with the highest total sales volume.

예를 들어, 대분류는 여성패션, 남성패션, 화장품/미용, 가구/인테리어, 식품, 출산/유아동, 반려동물용품, 생활/주방용품, 가전, 디지털, 컴퓨터, 스포츠/레저, 건강/의료용품, 자동차/공구, 취미/문구/악기, 여행, 도서/음반/DVD, 헬스/건강, 문구/오피스 등일 수 있다. 또한, 더욱 세부한 카테고리를 포함할 수도 있다. 예를 들어, 여성패션 카테고리는 여성의류, 여성신발/지갑, 주얼리, 언더웨어/잠옷, 시계, 패션잡화/소품, 원피스 등의 카테고리를 포함할 수 있다.For example, major categories include women's fashion, men's fashion, cosmetics/beauty, furniture/interior, food, maternity/baby care, pet supplies, household/kitchen supplies, home appliances, digital, computers, sports/leisure, health/medical supplies, It could be cars/tools, hobbies/stationery/musical instruments, travel, books/records/DVDs, fitness/health, stationery/office, etc. Additionally, more detailed categories may be included. For example, the women's fashion category may include categories such as women's clothing, women's shoes/purse, jewelry, underwear/pajamas, watches, fashion accessories/accessories, and dresses.

TF-IDF(Term Frequency-Inverse Document Frequency)는, 텍스트 내에서 단어의 빈도와 전체 문서에서의 빈도를 고려하여 단어의 가중치를 계산할 수 있다. 카테고리 대분류, 중분류, 소분류 명칭과 같은 특정 단어가 소셜미디어 데이터에 많이 등장하면 상기 특정 단어는 해당 문서와 관련이 깊다고 판단할 수 있다.TF-IDF (Term Frequency-Inverse Document Frequency) can calculate the weight of a word by considering its frequency in the text and its frequency in the entire document. If a specific word, such as a major category, medium category, or subcategory name, appears frequently in social media data, it can be determined that the specific word is closely related to the document.

Word2Vec는, 텍스트에 있는 단어들을 벡터 공간에 매핑하는 방식으로 단어들의 유사도를 계산할 수 있는 알고리즘으로서, 각 단어를 고정된 길이의 벡터로 표현하여 문서 내에서의 단어 간 유사도를 계산할 수 있다.Word2Vec is an algorithm that can calculate the similarity of words by mapping words in a text to a vector space. It can calculate the similarity between words in a document by expressing each word as a vector of fixed length.

LDA(Latent Dirichlet Allocation)는, 주어진 문서들이 생성된 과정을 가정하고, 문서에서 단어들이 어떤 토픽(topic)에 속해있을지 예측하는 알고리즘으로서, 각 토픽은 여러 개의 단어로 구성되어 있고, 문서는 다수의 토픽으로 구성될 수 있다.LDA (Latent Dirichlet Allocation) is an algorithm that assumes the process by which given documents were created and predicts which topic words in the document will belong to. Each topic consists of several words, and the document contains multiple words. It can be composed of topics.

감성 분석을 위한 Deep Learning 알고리즘은 딥러닝을 이용하여 감정을 분석하는 방식으로서, 더 상세하게는 LSTM(Long Short-Term Memory)을 이용하여 감성 분류 모델을 구성할 수 있다.The Deep Learning algorithm for emotional analysis is a method of analyzing emotions using deep learning. More specifically, an emotional classification model can be constructed using LSTM (Long Short-Term Memory).

LSTM(Long Short-Term Memory)은 시계열 데이터나 순차적으로 입력되는 데이터를 처리할 수 있으며, 이전 시점의 출력값이 현재 시점의 입력값으로 다시 들어가는 재귀적 구조를 가지고 있기 때문에, 시점이 지날수록 이전 입력값이 현재 출력값에 미치는 영향이 감소하는 문제가 발생하는 일반적인 RNN(Recurrent Neural Network) 구조에 반해, LSTM은 이전 정보를 보존하기 위한 기억 셀(memory cell)과 이를 제어하는 게이트(gate)를 도입하여 이전 입력값의 영향을 더 오랫동안 보존할 수 있다. 따라서 LSTM은 장기 의존성을 가진 데이터를 처리하는 데 뛰어난 성능을 보인다. LSTM을 이용하여 감성 분류 모델을 구성하면, 이전 입력값의 감성 정보를 잘 보존하며, 다음 입력값을 처리할 때 해당 정보를 반영할 수 있어 정확한 감성 분류를 수행할 수 있다.LSTM (Long Short-Term Memory) can process time series data or sequentially input data, and has a recursive structure in which the output value at a previous point in time goes back into the input value at the current point in time, so as time passes, the previous input In contrast to the typical RNN (Recurrent Neural Network) structure, which has the problem of reducing the influence of values on the current output value, LSTM introduces a memory cell to preserve previous information and a gate to control it. The influence of previous inputs can be preserved for a longer period of time. Therefore, LSTM shows excellent performance in processing data with long-term dependencies. When a sentiment classification model is constructed using LSTM, the sentiment information of the previous input value is well preserved and the information can be reflected when processing the next input value, allowing accurate sentiment classification.

시계열 분석을 위한 ARIMA(AutoRegressive Integrated Moving Average)는 시계열 데이터에서 자기 상관성, 추세성, 계절성 등을 고려하여 분석할 수 있다. Prophet은 시계열 예측 라이브러리로, 휴일 효과, 계절성 등을 고려하여 시계열 데이터를 예측할 수 있다. 따라서 휴일이나 특정 계절에 따른 언급도를 예측할 수도 있다.ARIMA (AutoRegressive Integrated Moving Average) for time series analysis can be analyzed by considering autocorrelation, trend, seasonality, etc. in time series data. Prophet is a time series prediction library that can predict time series data by considering holiday effects, seasonality, etc. Therefore, you can also predict mentions based on holidays or specific seasons.

일실시예로서 S205 단계에서 서버(100)는, 총 판매량이 가장 많은 가격비교 사이트 서버를 통해, 최하위 카테고리별 판매량 1위 상품의 리뷰, 해시태그, 상품설명, 판매량 및 검색수를 포함하는 제3 데이터를 주 단위로 수집할 수 있다.As an example, in step S205, the server 100, through the price comparison site server with the highest total sales, provides a third list including reviews, hashtags, product descriptions, sales volume, and number of searches of the product with the highest sales volume for each lowest category. Data can be collected on a weekly basis.

그리고 S206 단계에서 서버는, 기학습된 제3 인공지능 모델에, 상기 판매량 1위 상품의 리뷰를 감정 분석하고, 감정 분석 결과가 긍정적 중립적, 부정적인 리뷰의 개수를 출력하되, 상기 제3 인공지능 모델은, KoBERT, KoELECTRA, HanBERT 모델 중 하나를 이용할 수 있다.And in step S206, the server analyzes the reviews of the product with the highest sales volume using a pre-trained third artificial intelligence model, and outputs the number of positive, neutral, and negative reviews as a result of the sentiment analysis, and the third artificial intelligence model You can use one of the KoBERT, KoELECTRA, and HanBERT models.

KoBERT 모델은, 한글 자연어 처리 라이브러리인 KoNLPy와 BERT 모델을 결합한 모델로서, 다양한 감정 분류 데이터셋을 학습하여, 한글 텍스트에서 긍정, 부정, 중립 등의 감정을 분석할 수 있다. KoBERT는 특히 상품 리뷰와 같은 텍스트 데이터에 대해서 감정 분석 성능이 높아 유용할 수 있다.The KoBERT model is a model that combines KoNLPy, a Korean natural language processing library, and the BERT model. By learning various emotion classification datasets, it can analyze emotions such as positive, negative, and neutral in Korean text. KoBERT can be particularly useful for text data such as product reviews due to its high sentiment analysis performance.

KoELECTRA 모델은 한국어 ELECTRA 모델로서, 대용량의 한글 텍스트 데이터를 학습하여 다양한 자연어 처리 태스크에 적용할 수 있다. 상품 리뷰와 같은 텍스트 데이터에 대해서도 감정 분석 성능이 높아서 유용하게 사용할 수 있다.The KoELECTRA model is a Korean ELECTRA model that can be applied to various natural language processing tasks by learning large amounts of Korean text data. It can also be useful for text data such as product reviews due to its high emotional analysis performance.

HanBERT 모델은 한글 BERT 모델로서, 네이버 영화 리뷰 데이터를 학습하였고, 한글 텍스트에서 긍정, 부정, 중립 등의 감정을 분석할 수 있다. 상품 리뷰와 같은 텍스트 데이터에 대해서도 감정 분석 성능이 높아 많이 사용된다.The HanBERT model is a Korean BERT model that learned Naver movie review data and can analyze emotions such as positive, negative, and neutral in Korean text. It is also widely used for text data such as product reviews due to its high emotional analysis performance.

도 3을 참조하면, 일실시예로서 S301 단계에서 서버(100)는, 상기 웹사이트 또는 모바일앱을 통해 사용자 계정이 로그인되면, 사용자 계정 정보에서 사용자 단말을 통해 기설정된 관심 카테고리를 획득할 수 있다.Referring to FIG. 3, as an example, in step S301, when a user account is logged in through the website or mobile app, the server 100 may obtain a preset interest category from the user account information through the user terminal. .

상기 기설정된 관심 카테고리는 사용자 단말을 통해, 회원가입시 선택한 것일 수 있다. 또한, 상기 사용자 단말을 통해 복수의 관심 카테고리가 선택될 수 있다. 필요한 관심 카테고리만 남기고 나머지 카테고리는 숨김 처리를 하여 화면 구성을 명료하게 표시할 수도 있다.The preset interest category may be selected when registering as a member through the user terminal. Additionally, a plurality of interest categories may be selected through the user terminal. You can display the screen composition clearly by leaving only the necessary categories of interest and hiding the remaining categories.

S302 단계에서 서버(100)는, 상기 웹사이트 또는 모바일앱을 통해 상기 사용자 단말에, 상기 제1 출력을 바탕으로 카테고리별 예상 판매량의 증가율이 높은 순서로 제품을 정렬하여 표시할 수 있다.In step S302, the server 100 may sort and display products in order of the highest growth rate of expected sales volume for each category based on the first output on the user terminal through the website or mobile app.

상기 카테고리는 총 판매량이 가장 많은 가격비교 사이트의 카테고리 대분류, 중분류, 소분류 명칭으로 지정할 수 있다. 또한, 카테고리별 예상 판매량의 증가율이 높은 순서로 제품을 정렬하여 표시할 때, 상위 10개까지 목록만 제공할 수도 있다. 그리고 목록 아래에 확장하기 버튼을 제공하여, 10개 이상의 전체 목록을 제공할 수 있다.The above categories can be designated as major, medium, or small categories of the price comparison site with the highest total sales volume. Additionally, when displaying products by sorting them in order of the highest expected sales volume growth by category, only the top 10 may be provided. And by providing an expand button below the list, you can provide a full list of 10 or more items.

또한, 상기 기설정된 관심 카테고리가 가장 먼저 노출되도록 게시할 수 있다. 복수의 관심 카테고리가 선택된 경우에는, 정렬된 순서에 따라 가장 첫번째 순서의 관심 카테고리가 사용자 단말에서 가장 먼저 보이도록 표시할 수 있다. 또는, 판매 분야에 해당하지 않은 카테고리는 숨김처리하여 선택된 복수의 관심 카테고리만 표시되도록 사용자 단말을 통해 설정될 수도 있다.Additionally, the preset interest category can be posted so that it is exposed first. When a plurality of interest categories are selected, the interest category in the first order may be displayed to be visible first on the user terminal according to the sorted order. Alternatively, categories that do not fall into the sales field may be hidden and set through the user terminal to display only a plurality of selected categories of interest.

S303 단계에서 서버(100)는, 사용자 단말을 통해, 상기 예상 판매량의 증가가 큰 순서로 게시된 제품 중 하나가 선택되면, 상기 선택된 제품의 판매량, 후기수, 검색수 추이 및 예상 판매량을 그래프로 시각화하여 표시할 수 있다.In step S303, when one of the products posted in order of increasing expected sales volume is selected through the user terminal, the server 100 graphs the sales volume, number of reviews, number of searches, and expected sales volume of the selected product. It can be visualized and displayed.

상기 그래프는 시간의 흐름에 따라 달라지는 판매량, 후기수, 검색수 추이를 한 눈에 알아볼 수 있도록 꺾은 선 그래프나 막대 그래프로 시각화하는 것이 바람직할 것이다.It would be desirable to visualize the graph as a line graph or bar graph so that trends in sales, reviews, and searches that change over time can be seen at a glance.

데이터 시각화를 위하여 Plotly, Chart.js, Google Charts, Highcharts, D3.js 중 어느 하나를 이용할 수 있다.For data visualization, you can use any of Plotly, Chart.js, Google Charts, Highcharts, or D3.js.

또한, 판매량, 후기수, 검색수와 함께 예상 판매량도 함께 표시하여 앞으로 예상되는 판매량의 변화를 시각화 자료로 확인할 수 있다.In addition, expected sales volume is displayed along with sales volume, number of reviews, and number of searches, allowing you to check the change in expected sales volume through visualization.

S304 단계에서 서버(100)는, 상기 웹사이트 또는 모바일앱을 통해 상기 사용자 단말에, 상기 제2 출력을 바탕으로 카테고리별 긍정적 언급수 증가 제품을 언급수가 많은 순서로 정렬하여 표시할 수 있다.In step S304, the server 100 may display products with an increase in the number of positive mentions for each category in order of the number of mentions, based on the second output, on the user terminal through the website or mobile app.

긍정적 언급수 증가 제품은, 급격하게 증가되는 카테고리별 제품을 가장 상위에 표시할 수 있다.For products with an increase in the number of positive mentions, products in each category that are rapidly increasing can be displayed at the top.

또한, 상기 기설정된 관심 카테고리가 가장 먼저 노출되도록 게시할 수 있다.Additionally, the preset interest category can be posted so that it is exposed first.

S305 단계에서 서버(100)는, 사용자 단말을 통해, 상기 언급수가 많은 순서로 게시된 제품 중 하나가 선택되면, 상기 선택된 제품의 언급수 추이를 그래프로 시각화하여 표시할 수 있다.In step S305, when one of the products posted in the order of the highest number of mentions is selected through the user terminal, the server 100 may visualize and display the trend of the number of mentions of the selected product in a graph.

시간의 변화에 따른 언급수 변화를 알아보기 용이하도록 꺾인 선 그래프나 막내 그래프로 표시하고, x축에는 시간, y축에는 언급수를 나타내는 것이 바람직할 수 있다.To make it easier to recognize changes in the number of mentions over time, it may be desirable to display it as a broken line graph or a youngest graph, with time on the x-axis and the number of mentions on the y-axis.

일실시예로서, S306 단계에서 서버는, 상기 사용자 단말을 통해, 상기 웹사이트 또는 모바일앱에서 최하위 카테고리가 선택되면, 상기 제3 데이터와 상기 제3 인공지능 모델의 출력을 바탕으로, 상기 웹사이트 또는 모바일앱에 상기 선택된 최하위 카테고리의 판매량 1위 상품의 해시태그, 상품설명을 게시할 수 있고, 상기 선택된 최하위 카테고리의 판매량 1위 상품의 긍정적, 중립적, 부정적인 리뷰 개수를 시각화하여 표시할 수 있으며, 상기 선택된 최하위 카테고리의 판매량 1위 상품의 판매량과 검색수 추이를 시각화하여 표시할 수 있다.As an example, in step S306, when the lowest category is selected on the website or mobile app through the user terminal, the server, based on the third data and the output of the third artificial intelligence model, Alternatively, you can post the hashtag and product description of the top-selling product in the selected lowest category on the mobile app, and visualize and display the number of positive, neutral, and negative reviews of the top-selling product in the selected lowest category, The sales volume and search count trends of the product with the highest sales volume in the selected lowest category can be visualized and displayed.

긍정적, 중립적, 부정적인 리뷰 개수는 원 그래프로 비율을 한눈에 볼 수 있게 시각화하는 것이 바람직할 것이며, 판매량과 검색수의 추이는 꺾은 선 그래프로 시각화하는 것이 바람직할 것이다.It would be desirable to visualize the number of positive, neutral, and negative reviews in a circle graph so that the ratio can be seen at a glance, and it would be desirable to visualize the trends in sales volume and number of searches as a line graph.

도 4을 참조하면, 일실시예로서, S401 단계에서 본 발명에 따른 서버(100)는, 사용자 단말을 통해 생성된, B2B 공급자가 공급하는 제품의 판매, 취소, 교환, 반품 데이터를 포함하는 제4 데이터를 수집할 수 있다.Referring to FIG. 4, as an example, in step S401, the server 100 according to the present invention generates a product including sales, cancellation, exchange, and return data of products supplied by a B2B supplier, generated through a user terminal. 4 Data can be collected.

사용자 단말을 통해 생성된 B2B 공급자가 공급하는 제품의 제4 데이터는, 사용자 단말을 통해 직접 입력된 데이터 또는 가격비교 사이트 서버에서 획득한 데이터일 수 있다.The fourth data of products supplied by B2B suppliers generated through the user terminal may be data directly input through the user terminal or data obtained from a price comparison site server.

다음으로, S402 단계에서 서버(100)는, 상기 B2B 공급자의 제품의 마진율, 취소율, 교환율, 반품율을 산출할 수 있다.Next, in step S402, the server 100 may calculate the margin rate, cancellation rate, exchange rate, and return rate of the B2B supplier's product.

또한, S403 단계에서 서버(100)는, 사용자 단말을 통해 생성된, B2C 판매자가 판매하는 제품의 판매, 취소, 교환, 반품 데이터를 포함하는 제5 데이터를 수집할 수 있다.Additionally, in step S403, the server 100 may collect fifth data generated through the user terminal, including sales, cancellation, exchange, and return data of products sold by B2C sellers.

상기 제5 데이터는 사용자 단말을 통해 입력된 데이터이거나 복수의 가격비교 사이트 서버에서 획득한 것일 수도 있다. 상기 제4 데이터와 제5 데이터는 사용자 또는 업체 서버를 통해 획득한 것일 수도 있다.The fifth data may be data input through a user terminal or may be obtained from a plurality of price comparison site servers. The fourth data and fifth data may be obtained through a user or company server.

상기 B2B 공급자와 B2C 판매자는 어떤 제품의 판매 과정 흐름의 구성요소일 수 있다. 예를 들어 Business to Business 거래에 의해 상품 A가 거래되었고, Business to Consumer 거래에 의해 상품 A가 판매된 경우, 이 상품 A의 B2B 거래와 B2C 거래 각각에서 발생하는 판매, 취소, 교환 반품 등의 데이터를 이용하여 상품 A의 전체 거래 흐름을 데이터 분석에 활용할 수 있다.The B2B supplier and B2C seller may be components of the sales process flow of a certain product. For example, if product A is traded through a business to business transaction and product A is sold through a business to consumer transaction, data such as sales, cancellations, exchanges and returns that occur in each B2B transaction and B2C transaction of this product A The entire transaction flow of product A can be used for data analysis.

일실시예로서 상기 제4 데이터와 제5 데이터는 기지정된 주기마다 수집할 수 있다.In one embodiment, the fourth data and fifth data may be collected at predetermined intervals.

S404 단계에서 서버(100)는, 상기 B2C 판매자의 제품의 마진율, 취소율, 교환율, 반품율을 산출할 수 있다.In step S404, the server 100 may calculate the margin rate, cancellation rate, exchange rate, and return rate of the B2C seller's product.

그리고 S405 단계에서 서버(100)는, 상기 제4 데이터와 상기 제5 데이터를 바탕으로, 기지정된 기준에 따라 문제 제품과 추천 제품을 추출할 수 있다.And in step S405, the server 100 may extract problem products and recommended products according to predetermined criteria based on the fourth data and the fifth data.

S406 단계에서 서버(100)는, 상기 웹사이트 또는 모바일앱에 상기 추출한 문제 제품과 추천 제품을 게시할 수 있다. 상기 문제 제품과 추천 제품을 복수 개 일 수 있고, 문제 제품과 추천 제품을 추출하기 위한 기준 값 대비 차이가 큰 순서대로 정렬할 수 있다.In step S406, the server 100 may post the extracted problem products and recommended products on the website or mobile app. There may be a plurality of problem products and recommended products, and they can be sorted in the order of the largest difference compared to the reference value for extracting the problem products and recommended products.

일실시예로서, 본 발명에 따른 서버(100)는, 카테고리별 문제 제품과 추천 제품을 추출하기 위한 점수를, [수학식 1]을 기반으로 산출할 수 있고,As an example, the server 100 according to the present invention can calculate scores for extracting problem products and recommended products for each category based on [Equation 1],

[수학식 1][Equation 1]

상기 상위 15% 이상, 하위 15% 이하로 지정된 기준치는 사용자 단말을 통해 변경될 수도 있다. 사용자 단말을 통해 상기 웹사이트나 모바일앱의 설정 기능이 선택되면, 상기 기준치를 사용자 편의에 의해 변경할 수 있다.The reference values designated as the top 15% or more and the bottom 15% or less may be changed through the user terminal. When the setting function of the website or mobile app is selected through the user terminal, the reference value can be changed according to the user's convenience.

상기 마진율(m), 판매율(p), 취소율(s), 교환율(t), 반품율(q)은 사용자 단말을 통해 가중치가 정해질 수 있다. 따라서 사용자가 중요하게 생각하는 데이터에 가중치를 많이 설정하여 상기 score 값을 산출하는 데 이용할 수 있다. 예를 들어, 취소율, 교환율, 반품율이 높아도 마진율과 판매율이 높은 제품을 추천받고 싶다면, 취소율, 교환율, 반품율의 가중치를 적게 지정하고, 마진율과 판매율의 가중치를 크게 지정하여 score 값을 산출할 수 있다.The weights of the margin rate (m), sales rate (p), cancellation rate (s), exchange rate (t), and return rate (q) can be determined through the user terminal. Therefore, it can be used to calculate the score value by setting a lot of weight to the data that the user considers important. For example, if you want to be recommended a product with a high margin and sales rate even if the cancellation rate, exchange rate, and return rate are high, specify a low weight for the cancellation rate, exchange rate, and return rate, and specify a large weight for the margin rate and sales rate to increase the score value. It can be calculated.

일실시예로서, 상기 마진율은 [수학식 2]를 기반으로 산출될 수 있고,As an example, the margin rate may be calculated based on [Equation 2],

[수학식 2][Equation 2]

상기 m은 마진율이고, x_c는 제품의 원가이며, x_sp는 제품의 공급가 또는 판매가일 수 있다.Where m is the margin rate, x_c is the cost of the product, and x_sp may be the supply price or sales price of the product.

판매가(x_sp)가 고정값일 때, 원가(x_c)가 증가하면 마진율(m)은 감소하고, 원가(x_c)가 감소하면 마진율(m)은 증가할 수 있다. 제품의 원가(x_c)가 고정값일 때, 공급가 또는 판매가(x_sp)가 증가하면 마진율(m)은 증가할 수 있고, 공급가 또는 판매가(x_sp)가 감소하면 마진율(m)은 감소할 수 있다.When the selling price (x_sp) is a fixed value, if the cost (x_c) increases, the margin rate (m) may decrease, and if the cost (x_c) decreases, the margin rate (m) may increase. When the cost of a product (x_c) is a fixed value, the margin rate (m) can increase if the supply price or sales price (x_sp) increases, and the margin rate (m) can decrease if the supply price or sales price (x_sp) decreases.

또한, 원가(x_c)와, 공급가 또는 판매가(x_sp)가 유사한 값일수록 마진율(m)이 감소할 수 있다.Additionally, the margin rate (m) may decrease as the cost (x_c) and the supply or sales price (x_sp) are similar in value.

이상에서 설명된 실시예들은 하드웨어 구성요소, 소프트웨어 구성요소, 및/또는 하드웨어 구성요소 및 소프트웨어 구성요소의 조합으로 구현될 수 있다. 예를 들어, 실시예들에서 설명된 장치, 방법 및 구성요소는, 예를 들어, 프로세서, 콘트롤러, ALU(arithmetic logic unit), 디지털 신호 프로세서(digital signal processor), 마이크로컴퓨터, FPGA(field programmable gate array), PLU(programmable logic unit), 마이크로프로세서, 또는 명령(instruction)을 실행하고 응답할 수 있는 다른 어떠한 장치와 같이, 하나 이상의 범용 컴퓨터 또는 특수 목적 컴퓨터를 이용하여 구현될 수 있다. 처리 장치는 운영 체제(OS) 및 상기 운영 체제 상에서 수행되는 하나 이상의 소프트웨어 애플리케이션을 수행할 수 있다. 또한, 처리 장치는 소프트웨어의 실행에 응답하여, 데이터를 접근, 저장, 조작, 처리 및 생성할 수도 있다. 이해의 편의를 위하여, 처리 장치는 하나가 사용되는 것으로 설명된 경우도 있지만, 해당 기술분야에서 통상의 지식을 가진 자는, 처리 장치가 복수 개의 처리 요소(processing element) 및/또는 복수 유형의 처리 요소를 포함할 수 있음을 알 수 있다. 예를 들어, 처리 장치는 복수 개의 프로세서 또는 하나의 프로세서 및 하나의 콘트롤러를 포함할 수 있다. 또한, 병렬 프로세서(parallel processor)와 같은, 다른 처리 구성(processing configuration)도 가능하다.The embodiments described above may be implemented with hardware components, software components, and/or a combination of hardware components and software components. For example, the devices, methods, and components described in the embodiments may include, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, and a field programmable gate (FPGA). It may be implemented using one or more general-purpose or special-purpose computers, such as an array, programmable logic unit (PLU), microprocessor, or any other device capable of executing and responding to instructions. A processing device may execute an operating system (OS) and one or more software applications that run on the operating system. Additionally, a processing device may access, store, manipulate, process, and generate data in response to the execution of software. For ease of understanding, a single processing device may be described as being used; however, those skilled in the art will understand that a processing device includes multiple processing elements and/or multiple types of processing elements. It can be seen that it may include. For example, a processing device may include a plurality of processors or one processor and one controller. Additionally, other processing configurations, such as parallel processors, are possible.

실시예에 따른 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체에 기록되는 프로그램 명령은 실시예를 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. 상기된 하드웨어 장치는 실시예의 동작을 수행하기 위해 하나 이상의 소프트웨어 모듈로서 작동하도록 구성될 수 있으며, 그 역도 마찬가지이다.The method according to the embodiment may be implemented in the form of program instructions that can be executed through various computer means and recorded on a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, etc., singly or in combination. Program instructions recorded on the medium may be specially designed and configured for the embodiment or may be known and available to those skilled in the art of computer software. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tapes, optical media such as CD-ROMs and DVDs, and magnetic media such as floptical disks. -Includes optical media (magneto-optical media) and hardware devices specifically configured to store and execute program instructions, such as ROM, RAM, flash memory, etc. Examples of program instructions include machine language code, such as that produced by a compiler, as well as high-level language code that can be executed by a computer using an interpreter, etc. The hardware devices described above may be configured to operate as one or more software modules to perform the operations of the embodiments, and vice versa.

소프트웨어는 컴퓨터 프로그램(computer program), 코드(code), 명령(instruction), 또는 이들 중 하나 이상의 조합을 포함할 수 있으며, 원하는 대로 동작하도록 처리 장치를 구성하거나 독립적으로 또는 결합적으로(collectively) 처리 장치를 명령할 수 있다. 소프트웨어 및/또는 데이터는, 처리 장치에 의하여 해석되거나 처리 장치에 명령 또는 데이터를 제공하기 위하여, 어떤 유형의 기계, 구성요소(component), 물리적 장치, 가상 장치(virtual equipment), 컴퓨터 저장 매체 또는 장치, 또는 전송되는 신호 파(signal wave)에 영구적으로, 또는 일시적으로 구체화(embody)될 수 있다. 소프트웨어는 네트워크로 연결된 컴퓨터 시스템 상에 분산되어서, 분산된 방법으로 저장되거나 실행될 수도 있다. 소프트웨어 및 데이터는 하나 이상의 컴퓨터 판독 가능 기록 매체에 저장될 수 있다.Software may include a computer program, code, instructions, or a combination of one or more of these, which may configure a processing unit to operate as desired, or may be processed independently or collectively. You can command the device. Software and/or data may be used on any type of machine, component, physical device, virtual equipment, computer storage medium or device to be interpreted by or to provide instructions or data to a processing device. , or may be permanently or temporarily embodied in a transmitted signal wave. Software may be distributed over networked computer systems and stored or executed in a distributed manner. Software and data may be stored on one or more computer-readable recording media.

이상과 같이 실시예들이 비록 한정된 도면에 의해 설명되었으나, 해당 기술분야에서 통상의 지식을 가진 자라면 상기를 기초로 다양한 기술적 수정 및 변형을 적용할 수 있다. 예를 들어, 설명된 기술들이 설명된 방법과 다른 순서로 수행되거나, 및/또는 설명된 시스템, 구조, 장치, 회로 등의 구성요소들이 설명된 방법과 다른 형태로 결합 또는 조합되거나, 다른 구성요소 또는 균등물에 의하여 대치되거나 치환되더라도 적절한 결과가 달성될 수 있다.Although the embodiments have been described with limited drawings as described above, those skilled in the art can apply various technical modifications and variations based on the above. For example, the described techniques are performed in a different order than the described method, and/or components of the described system, structure, device, circuit, etc. are combined or combined in a different form than the described method, or other components are used. Alternatively, appropriate results may be achieved even if substituted or substituted by an equivalent.

그러므로, 다른 구현들, 다른 실시예들 및 특허청구범위와 균등한 것들도 후술하는 청구범위의 범위에 속한다.Therefore, other implementations, other embodiments, and equivalents of the claims also fall within the scope of the following claims.

100: 서버100: Server

Claims

In a sales product recommendation system based on artificial intelligence-based trends,
Includes a server that recommends products for sale based on artificial intelligence-based trends,
The server is,
Operate a website or mobile app,
Through multiple price comparison site servers, primary data including sales volume, number of reviews, and number of searches for products by category are collected on a weekly basis,
The first data is input to a first artificial intelligence model that has been previously trained to predict an increase in sales using the sales volume, number of reviews, and number of searches as learning data, and a first output is obtained, wherein the first artificial intelligence model has a regression pattern. Based on an analysis algorithm (regression analysis),
Collecting secondary data including text every other day through a plurality of social media servers,
The second data is input to a second artificial intelligence model that has been previously trained to output products with an increased number of positive mentions, and a second output is obtained. The second artificial intelligence model is TF-IDF, Word2Vec for text mining. , based on one of the algorithms LDA, Deep Learning algorithm for sentiment analysis, ARIMA or Prophet algorithm for time series analysis,
Keywords for text mining are:
These are the names of the major, medium, and small categories of the price comparison site with the highest total sales volume.
The server is,
Through the price comparison site server with the highest total sales, third-party data including reviews, hashtags, product descriptions, sales volume, and number of searches for products with the highest sales volume in each lowest category are collected on a weekly basis.
The pre-trained third artificial intelligence model analyzes the reviews of the top-selling product, and outputs the number of positive, neutral, and negative reviews as a result of the sentiment analysis. The third artificial intelligence model is KoBERT, KoELECTRA, and HanBERT. Use one of the models,
When the user account is logged in through the website or mobile app, a preset interest category is obtained from the user account information through the user terminal,
Products are sorted and displayed on the user terminal through the website or mobile app in the order of the highest growth rate of expected sales volume for each category based on the first output, and the preset interest categories are displayed first,
When one of the products posted in order of increasing expected sales volume is selected through the user terminal, the sales volume, number of reviews, number of searches, and expected sales volume of the selected product are visualized and displayed in a graph,
On the user terminal through the website or mobile app, based on the second output, products with an increased number of positive mentions for each category are displayed in order of the highest number of mentions, with the preset interest category displayed first. ,
When one of the products posted in order of the number of mentions is selected through the user terminal, the trend of the number of mentions of the selected product is visualized and displayed in a graph,
When the lowest category is selected on the website or mobile app through the user terminal, based on the third data and the output of the third artificial intelligence model, the sales volume of the lowest category selected on the website or mobile app is 1. Post the hashtag and product description of the product above, visualize and display the number of positive, neutral, and negative reviews of the product with the highest sales volume in the selected lowest category, and trend the sales volume and number of searches for the product with the highest sales volume in the lowest category selected above. Visualize and display,
The server is,
Collects fourth data generated through the user terminal, including sales, cancellation, exchange, and return data of products supplied by B2B suppliers, and calculates the margin rate, cancellation rate, exchange rate, and return rate of the B2B supplier's products; ,
Collect the fifth data generated through the user terminal, including sales, cancellation, exchange, and return data of products sold by B2C sellers, and calculate the margin rate, cancellation rate, exchange rate, and return rate of the B2C seller's products, ,
Based on the fourth data and the fifth data, problem products and recommended products are extracted according to predetermined criteria,
Post the extracted problem products and recommended products on the website or mobile app,
The scores for extracting the problem products and recommended products are calculated based on [Equation 1],
[Equation 1]

The score is a score for extracting problem products and recommended products, m is the margin rate, p is the sales rate, s is the cancellation rate, t is the exchange rate, and q is the return rate,
If the score is above the top 15% of the score distribution for each category, it is extracted as a problem product, and if it is below the bottom 15%, it is extracted as a recommended product,
The margin rate can be calculated based on [Equation 2],
[Equation 2]

where m is the margin rate, and is the cost of the product, and is the supply price or selling price of the product, system.

delete