TW202232375A - Information processing device, inference method, and inference program - Google Patents

Information processing device, inference method, and inference program Download PDF

Info

Publication number
TW202232375A
TW202232375A TW110127304A TW110127304A TW202232375A TW 202232375 A TW202232375 A TW 202232375A TW 110127304 A TW110127304 A TW 110127304A TW 110127304 A TW110127304 A TW 110127304A TW 202232375 A TW202232375 A TW 202232375A
Authority
TW
Taiwan
Prior art keywords
user
information
skeleton
images
information processing
Prior art date
Application number
TW110127304A
Other languages
Chinese (zh)
Inventor
井手美優
Original Assignee
日商三菱電機股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 日商三菱電機股份有限公司 filed Critical 日商三菱電機股份有限公司
Publication of TW202232375A publication Critical patent/TW202232375A/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

An information items processing device (100) has: an acquisition unit (120) that acquires a plurality of images including a user and having different imaging times from each other; an extraction unit (130) for extracting, on the basis of the plurality of images, a plurality of skeleton information items that are information items indicating skeleton feature points of the user; a movement amount calculation unit (140) that, on the basis of the plurality of skeleton information items, calculates a movement amount of the user in a preset period; a determination unit (150) that determines whether or not the movement amount is equal to or less than a preset threshold value; and an inference unit (160) for, when the movement amount is equal to or less than the threshold value, inferring that the emotion of the user is discomfort.

Description

資訊處理裝置、估測方法、以及記錄媒體Information processing device, estimation method, and recording medium

本揭露關於資訊處理裝置、估測方法、以及記錄有估測程式的記錄媒體。The present disclosure relates to an information processing device, an estimation method, and a recording medium on which the estimation program is recorded.

近年來,期望依照人類的狀況來控制機器。舉例來說,以示意人類的狀況來說,有人類的行動或是心情。舉例來說,專利文獻1當中,有記載可以從透過時間序列追蹤臉部以及手的位置得到的該等一舉一動當中,來推測心情、發話的意圖等。 [先前技術文獻] [專利文獻] In recent years, it has been desired to control machines in accordance with human conditions. For example, to indicate the human condition, there are human actions or moods. For example, Patent Document 1 describes that it is possible to estimate the mood, the intention of speaking, and the like from the actions obtained by tracking the position of the face and the hand through time series. [Prior Art Literature] [Patent Literature]

[專利文獻1] 日本專利 特開2007-058615號公報[Patent Document 1] Japanese Patent Laid-Open No. 2007-058615

[發明所欲解決的課題][Problems to be solved by the invention]

如上所述,若大幅擺動手、腳等身體的部位時,可以估測心情。另一方面,若動作較小時,則無法用上述的想法來進行估測。As mentioned above, if you swing your body parts such as your hands and feet a lot, you can estimate your mood. On the other hand, if the motion is small, the above-mentioned thinking cannot be used for estimation.

本揭露的目的,可以估測基於小動作的心情。 [用以解決課題的手段] For the purpose of this disclosure, it is possible to estimate the mood based on small actions. [means to solve the problem]

提供關於本揭露的一態樣的資訊處理裝置。資訊處理裝置包含:取得部,取得複數個影像;該複數個影像包含使用者,且拍攝時刻各自相異;取出部,基於該複數個影像,取出複數個骨骼資訊;該複數個骨骼資訊示意該使用者的骨骼特徵點;動作量算出部,基於該複數個骨骼資訊,算出事先設定的期間內的該使用者的動作量;判定部,判定該動作量是否在設定的臨界值以下;以及估測部,當該動作量在該臨界值以下時,估測該使用者的心情為不悅。 [發明的效果] An information processing apparatus related to an aspect of the present disclosure is provided. The information processing device includes: an acquisition unit that acquires a plurality of images; the plurality of images includes a user, and the shooting times are different; an extraction unit, based on the plurality of images, retrieves a plurality of skeleton information; the plurality of skeleton information indicates the The skeletal feature points of the user; the motion amount calculation unit, based on the plurality of skeleton information, calculates the user's motion amount within a preset period; the determination unit determines whether the motion amount is below a set threshold value; The measuring unit estimates that the user's mood is unhappy when the movement amount is below the threshold value. [Effect of invention]

根據本揭露,可以估測基於小動作的心情。According to the present disclosure, it is possible to estimate a mood based on small actions.

以下,一邊參照圖式一邊說明實施形態。以下的實施形態僅為範例,在本揭露的範圍內可以進行各式各樣的變更。Hereinafter, embodiments will be described with reference to the drawings. The following embodiments are merely examples, and various modifications can be made within the scope of the present disclosure.

實施形態. 第1圖為一示意圖,示意實施形態的控制系統。控制系統包含:資訊處理裝置100、攝影裝置200、以及控制對象機器300。資訊處理裝置100、攝影裝置200、以及控制對象機器300透過網路進行通訊。 資訊處理裝置100為執行估測方法的裝置。資訊處理裝置100也可以稱為控制裝置或心情估測裝置。 Implementation form. Fig. 1 is a schematic diagram showing the control system of the embodiment. The control system includes an information processing device 100 , a photographing device 200 , and a control object device 300 . The information processing device 100 , the photographing device 200 , and the control target device 300 communicate through a network. The information processing apparatus 100 is an apparatus for executing the estimation method. The information processing device 100 may also be referred to as a control device or a mood estimation device.

首先,說明資訊處理裝置100包含的硬體。資訊處理裝置100包含:處理器101、揮發性記憶裝置102、非揮發性記憶裝置103、以及通訊介面104。First, the hardware included in the information processing apparatus 100 will be described. The information processing device 100 includes a processor 101 , a volatile memory device 102 , a non-volatile memory device 103 , and a communication interface 104 .

處理器101控制資訊處理裝置100整體。舉例來說,處理器101是中央處理器(Central Processing Unit,CPU),場域可程式化邏輯閘陣列(Field Programmable Gate Array,FPGA)等。處理器101也可以是多處理器。另外,資訊處理裝置100也可以包含處理電路。處理電路也可以是單一電路或是複合電路。The processor 101 controls the entire information processing device 100 . For example, the processor 101 is a central processing unit (Central Processing Unit, CPU), a field programmable gate array (Field Programmable Gate Array, FPGA) and the like. The processor 101 may also be a multiprocessor. In addition, the information processing apparatus 100 may also include a processing circuit. The processing circuit may also be a single circuit or a composite circuit.

揮發性記憶裝置102為資訊處理裝置100的主記憶裝置。舉例來說,揮發性記憶裝置102為隨機存取記憶體(Random Access Memory,RAM)。非揮發性記憶裝置103為資訊處理裝置100的輔助記憶裝置。舉例來說,非揮發性記憶裝置103為硬碟(Hard Disk Drive,HDD)或是固態硬碟(Solid State Drive,SSD)。 通訊介面104與攝影裝置200以及控制對象機器300進行通訊。 The volatile memory device 102 is the main memory device of the information processing device 100 . For example, the volatile memory device 102 is a random access memory (Random Access Memory, RAM). The non-volatile memory device 103 is an auxiliary memory device of the information processing device 100 . For example, the non-volatile memory device 103 is a Hard Disk Drive (HDD) or a Solid State Drive (SSD). The communication interface 104 communicates with the photographing device 200 and the control target device 300 .

舉例來說,攝影裝置200為監視攝影機、駕駛監控系統(Driver Monitor System,DMS)或是紅外線攝影機。攝影裝置200拍攝使用者。 資訊處理裝置100從攝影裝置200取得複數個影像(換言之,映像)。資訊處理裝置100基於複數個影像,估測使用者的心情。資訊處理裝置100可以基於估測的心情的資訊,對控制對象機器300進行控制。此處,示意控制對象機器300的控制的一例。 For example, the photographing device 200 is a surveillance camera, a driver monitor system (DMS) or an infrared camera. The photographing device 200 photographs the user. The information processing device 100 acquires a plurality of images (in other words, images) from the imaging device 200 . The information processing apparatus 100 estimates the user's mood based on the plurality of images. The information processing device 100 can control the control target device 300 based on the estimated mood information. Here, an example of the control of the control target apparatus 300 is shown.

第2圖為一示意圖,示意實施形態的控制對象機器的控制的具體例。第2圖示意電機的升降機301,作為控制對象機器300之例。另外,第2圖當中,示意進行小動作的使用者的上視圖。 攝影裝置200設置於天花板。攝影裝置200拍攝使用者。資訊處理裝置100從攝影裝置200取得複數個影像。資訊處理裝置100基於複數個影像,估測使用者的心情。資訊處理裝置100基於估測的心情的資訊,控制升降機301。 FIG. 2 is a schematic diagram showing a specific example of the control of the apparatus to be controlled according to the embodiment. FIG. 2 shows an elevator 301 of a motor as an example of the apparatus 300 to be controlled. In addition, in FIG. 2, the top view which shows the user who performs small movements is shown. The photographing device 200 is installed on the ceiling. The photographing device 200 photographs the user. The information processing device 100 acquires a plurality of images from the photographing device 200 . The information processing apparatus 100 estimates the user's mood based on the plurality of images. The information processing device 100 controls the elevator 301 based on the estimated mood information.

接著,說明資訊處理裝置100具有的機能。 第3圖為一方塊圖,示意實施形態的資訊處理裝置的機能。資訊處理裝置100包含:記憶部110、取得部120、取出部130、動作量算出部140、判定部150、以及估測部160。 Next, the functions of the information processing apparatus 100 will be described. FIG. 3 is a block diagram showing the function of the information processing apparatus according to the embodiment. The information processing apparatus 100 includes a storage unit 110 , an acquisition unit 120 , a retrieval unit 130 , an operation amount calculation unit 140 , a determination unit 150 , and an estimation unit 160 .

記憶部110也可以實現作為揮發性記憶裝置102或非揮發性記憶裝103當中保留的記憶區域。 取得部120、取出部130、動作量算出部140、判定部150、以及估測部160的一部分或全部,也可以由處理電路來實現。另外,取得部120、取出部130、動作量算出部140、判定部150、以及估測部160的一部分或全部,也可以作為處理器101執行的程式的模組來實現。舉例來說,處理器101執行的程式也稱為估測程式。舉例來說,估測程式記錄於記錄媒體。 The memory unit 110 can also be implemented as a memory area reserved in the volatile memory device 102 or the non-volatile memory device 103 . Some or all of the acquisition unit 120 , the extraction unit 130 , the motion amount calculation unit 140 , the determination unit 150 , and the estimation unit 160 may be implemented by a processing circuit. In addition, some or all of the acquisition unit 120 , the extraction unit 130 , the motion amount calculation unit 140 , the determination unit 150 , and the estimation unit 160 may be implemented as modules of programs executed by the processor 101 . For example, the program executed by the processor 101 is also referred to as an estimation program. For example, the estimation program is recorded on a recording medium.

取得部120從攝影裝置200取得拍攝時刻各自相異的複數個影像。取得部120也可以從外部裝置(例如,雲端伺服器)取得複數個影像。複數個影像的各個,是在相異時刻拍攝使用者所得到的影像。因此,複數個影像的各個當中,包含有使用者。The acquisition unit 120 acquires, from the imaging device 200 , a plurality of images whose shooting times are different from each other. The acquisition unit 120 may acquire a plurality of images from an external device (eg, a cloud server). Each of the plurality of images is an image captured by the user at different times. Therefore, the user is included in each of the plurality of images.

取出部130基於複數個影像,取出複數個骨骼資訊。複數個骨骼資訊的各個,是示意使用者的骨骼特徵點的資訊。舉例來說,示意使用者的骨骼特徵點的資訊為座標。以下,假設示意使用者的骨骼特徵點的資訊為座標。The extraction unit 130 extracts a plurality of skeleton information based on a plurality of images. Each of the plurality of skeleton information is information indicating a skeleton feature point of the user. For example, the information indicating the skeleton feature points of the user is the coordinates. Hereinafter, it is assumed that the information indicating the skeletal feature points of the user is the coordinates.

首先,說明基於1個影像取出1個骨骼資訊的情況。此處,示意骨骼資訊之例。First, the case of extracting one piece of skeleton information based on one image will be described. Here, an example of bone information is shown.

第4圖為一示意圖,示意實施形態的骨骼資訊之例。第4圖示意影像400,也就是複數個影像之中的1個影像。取出部130使用習知技術取出骨骼資訊500。習知技術舉例來說,為OpenPose或是OpenNet。骨骼資訊500是示意使用者的骨骼特徵點的座標。舉例來說,第4圖的座標501示意使用者的耳朵的骨骼特徵點。另外,該座標以X座標與Y座標來表示。FIG. 4 is a schematic diagram showing an example of bone information in an embodiment. FIG. 4 illustrates an image 400, that is, one image among a plurality of images. The extracting part 130 extracts the bone information 500 using conventional techniques. Examples of conventional technologies are OpenPose or OpenNet. The skeleton information 500 is the coordinates indicating the skeleton feature points of the user. For example, the coordinate 501 in FIG. 4 represents the skeletal feature point of the user's ear. In addition, the coordinates are represented by an X coordinate and a Y coordinate.

第4圖示意影像400當中包含1位使用者的情況。若影像當中包含有複數個使用者時,取出部130取出複數個使用者的各個的骨骼資訊。FIG. 4 illustrates a case where one user is included in the image 400 . If the image includes a plurality of users, the extracting unit 130 extracts the skeleton information of each of the plurality of users.

像這樣,取出部130基於1個影像,取出1個骨骼資訊。同樣地,取出部130針對複數個影像的各個,進行骨骼資訊的取出處理。藉此,可以得到複數個骨骼資訊。In this way, the extraction unit 130 extracts one piece of bone information based on one video. Similarly, the extraction unit 130 performs extraction processing of skeleton information for each of the plurality of images. In this way, a plurality of bone information can be obtained.

此處,控制系統當中也可以包含複數個攝影裝置。複數個攝影裝置的各個,設置於相異的位置。複數個攝影裝置的各個,從相異的方向拍攝使用者。若控制系統當中包含複數個攝影裝置時,取得部120取得複數個攝影裝置於同一時刻拍攝使用者所得到的複數個影像。舉例來說,取得部120從複數個攝影裝置當中,取得該複數個影像。另外,取得部120取得複數個攝影裝置的各自的位置資訊。舉例來說,取得部120從記憶部110取得複數個攝影裝置的各自的位置資訊。取出部130基於該複數個影像、以及複數個攝影裝置的各自的位置資訊,取出1個骨骼資訊。此處,舉例來說,當某個攝影裝置從某個方向拍攝使用者時,有可能拍攝不到使用者的一部分。然後,藉由其他的攝影裝置從相異的方向拍攝使用者,來拍攝該一部分。藉此,取出部130基於複數個攝影裝置的各個從相異的方向拍攝使用者所得到的複數個影像,取出骨骼資訊。藉此,資訊處理裝置100可以得到高精度的骨骼資訊。Here, a plurality of imaging devices may be included in the control system. Each of the plurality of photographing devices is installed at a different position. Each of the plurality of photographing devices photograph the user from different directions. If the control system includes a plurality of photographing devices, the obtaining unit 120 obtains a plurality of images obtained by the plurality of photographing devices photographing the user at the same time. For example, the obtaining unit 120 obtains the plurality of images from the plurality of photographing devices. In addition, the acquisition unit 120 acquires the position information of each of the plurality of imaging devices. For example, the obtaining unit 120 obtains the respective position information of the plurality of photographing devices from the memory unit 110 . The extraction unit 130 extracts one piece of bone information based on the plurality of images and the respective position information of the plurality of imaging devices. Here, for example, when a certain photographing device photographs the user from a certain direction, it is possible that a part of the user cannot be photographed. Then, the part is photographed by photographing the user from a different direction by another photographing device. Thereby, the extracting unit 130 extracts the skeleton information based on the plurality of images obtained by each of the plurality of imaging devices photographing the user from different directions. In this way, the information processing apparatus 100 can obtain high-precision bone information.

動作量算出部140基於複數個骨骼資訊,算出事先設定的期間T當中,使用者的動作量FP。動作量FP是透過在期間T當中,骨骼資訊示意的座標(換言之,骨骼特徵點)的移動量的平均所算出的。假設時刻t(0<t<T)時第i(1<i<N)號座標為(Xi(t)、Yi(t))時,動作量FP可以用算式(1)表示。The motion amount calculation unit 140 calculates the user's motion amount FP during a predetermined period T based on the plurality of pieces of skeleton information. The movement amount FP is calculated by averaging the movement amounts of the coordinates indicated by the skeleton information (in other words, the skeleton feature points) during the period T. Assuming that the ith (1<i<N) coordinate at time t (0<t<T) is (Xi(t), Yi(t)), the action amount FP can be expressed by the formula (1).

[算式1]

Figure 02_image001
[Equation 1]
Figure 02_image001

判定部150判定動作量FP是否在事先設定的臨界值以下。若動作量FP在事先設定的臨界值以下時,可以說使用者的動作較小。 當動作量FP在該臨界值以下時,估測部160估測使用者的心情為不悅。 The determination unit 150 determines whether or not the motion amount FP is equal to or less than a predetermined threshold value. When the motion amount FP is equal to or less than a predetermined threshold value, it can be said that the user's motion is small. When the motion amount FP is below the threshold value, the estimation unit 160 estimates that the user's mood is unpleasant.

此處,舉例來說,使用者在待機時產生不悅。使用者的不悅感增加時,會進行抖腳、翹腳、雙臂交叉等動作。然後,估測部160為了估測不悅的動作的種類,也可以進行以下的處理。估測部160使用複數個骨骼資訊,算出三角函數“Asin(2πft+ϕ)”。估測部160近似三角函數“Asin(2πft+ϕ)”。藉此得到“Asin(2πft+ϕ)+B”。此處,以示意“Asin(2πft+ϕ)+B”的圖表進行例示。Here, for example, the user is displeased while on standby. When the user's unpleasantness increases, he/she will perform actions such as shaking his feet, tilting his feet, and crossing his arms. Then, the estimation unit 160 may perform the following processing in order to estimate the type of unpleasant action. The estimation unit 160 calculates the trigonometric function "Asin(2πft+ϕ)" using the plurality of skeleton information. The estimation unit 160 approximates the trigonometric function "Asin(2πft+ϕ)". This results in "Asin(2πft+ϕ)+B". Here, a graph showing "Asin(2πft+ϕ)+B" is used as an example.

第5圖為一示意圖,示意實施形態的圖表之例。縱軸示意骨骼特徵點的位置。舉例來說,縱軸示意腳的骨骼特徵點。骨骼特徵點的位置可以考慮為示意骨骼特徵點的座標的X座標,也可以考慮為該座標的Y座標。橫軸示意時刻t。 如圖表所示,估測部160可以基於複數個骨骼資訊,算出“Asin(2πft+ϕ)+B”。 Fig. 5 is a schematic diagram showing an example of a graph of the embodiment. The vertical axis indicates the position of the bone feature points. For example, the vertical axis represents the bone feature points of the foot. The position of the skeleton feature point can be considered as the X coordinate indicating the coordinates of the skeleton feature point, and can also be considered as the Y coordinate of the coordinate. The horizontal axis indicates time t. As shown in the graph, the estimation unit 160 can calculate “Asin(2πft+ϕ)+B” based on a plurality of skeleton information.

估測部160基於“Asin(2πft+φ)+B”算出頻率。若振幅A在事先設定的臨界值以上時,估測部160估測不悅的動作為翹腳或是雙臂交叉。若頻率在事先設定的臨界值以上時,估測部160估測不悅的動作為抖腳。 藉此,估測部160基於複數個骨骼資訊,估測不悅的動作的種類。藉此,資訊處理裝置100可以估測不悅的動作的種類。 The estimation unit 160 calculates the frequency based on "Asin(2πft+φ)+B". If the amplitude A is greater than or equal to a preset threshold value, the estimation unit 160 estimates that the unpleasant action is lifting the feet or crossing the arms. If the frequency is above a preset threshold, the estimating unit 160 estimates that the unpleasant action is foot shaking. Thereby, the estimation part 160 estimates the kind of unpleasant motion based on a plurality of skeleton information. Thereby, the information processing apparatus 100 can estimate the type of unpleasant action.

估測部160也可以基於使用者停留的時間、以及估測為不悅的時間,算出不悅程度。舉例來說,估測部160基於複數個影像,算出使用者於同一位置停留的時間。估測部160基於複數個影像之中的複數個影像,算出估測為不悅的時間。估測部160基於使用者停留的時間、以及估測為不悅的時間,算出不悅程度。藉此,資訊處理裝置100可以檢測出使用者的不悅程度。資訊處理裝置100也可以對控制對象機器300,進行用以減低不悅程度的控制。The estimation unit 160 may calculate the degree of discomfort based on the time that the user stayed and the time when the user was estimated to be unhappy. For example, the estimation unit 160 calculates the time that the user stays at the same location based on the plurality of images. The estimation part 160 calculates the time estimated to be unpleasant based on the plurality of videos among the plurality of videos. The estimation unit 160 calculates the degree of discomfort based on the time spent by the user and the time estimated to be unpleasant. Thereby, the information processing apparatus 100 can detect the user's dissatisfaction level. The information processing device 100 may also perform control for reducing the degree of discomfort to the control target device 300 .

接著,使用流程圖說明資訊處理裝置100執行的處理。 第6圖為一流程圖,示意關於實施形態的資訊處理裝置執行的處理之例。 (步驟S11)取得部120從攝影裝置200取得拍攝時刻各自相異的複數個影像。 (步驟S12)取出部130基於複數個影像,取出複數個骨骼資訊。 (步驟S13)動作量算出部140基於複數個骨骼資訊,算出使用者的動作量FP。 Next, the processing performed by the information processing apparatus 100 will be described using a flowchart. FIG. 6 is a flowchart showing an example of processing executed by the information processing apparatus according to the embodiment. (Step S11 ) The acquisition unit 120 acquires, from the imaging device 200 , a plurality of video images whose shooting times are different from each other. (Step S12 ) The extraction unit 130 extracts a plurality of skeleton information based on a plurality of images. (Step S13 ) The motion amount calculation unit 140 calculates the motion amount FP of the user based on the plurality of pieces of skeleton information.

(步驟S14)判定部150判定動作量FP是否在事先設定的臨界值以下。若動作量FP在該臨界值以下時,處理進入步驟S15。若動作量FP大於該臨界值時,則結束處理。 (步驟S15)估測部160估測使用者的心情為不悅。 (Step S14 ) The determination unit 150 determines whether or not the motion amount FP is equal to or less than a predetermined threshold value. When the motion amount FP is equal to or less than the threshold value, the process proceeds to step S15. When the motion amount FP is larger than the threshold value, the process ends. (Step S15 ) The estimation unit 160 estimates that the user's mood is unpleasant.

如以上所述,根據實施形態,資訊處理裝置100可以估測基於小動作的心情。As described above, according to the embodiment, the information processing apparatus 100 can estimate the mood based on small actions.

100:資訊處理裝置 101:處理器 102:揮發性記憶裝置 103:非揮發性記憶裝置 104:通訊介面 110:記憶部 120:取得部 130:取出部 140:動作量算出部 150:判定部 160:估測部 200:攝影裝置 300:控制對象機器 301:升降機 400:影像 500:骨骼資訊 501:座標 S11~S15:步驟 100: Information processing device 101: Processor 102: Volatile memory device 103: Non-volatile memory device 104: Communication interface 110: Memory Department 120: Acquire Department 130: Take out part 140: Motion amount calculation section 150: Judgment Department 160: Estimation Department 200: Photographic Installation 300: Control the target machine 301: Lift 400: Video 500: Bone Information 501: Coordinates S11~S15: Steps

第1圖為一示意圖,示意實施形態的控制系統。 第2圖為一示意圖,示意實施形態的控制對象機器的控制的具體例。 第3圖為一方塊圖,示意實施形態的資訊處理裝置的機能。 第4圖為一示意圖,示意實施形態的骨骼資訊。 第5圖為一示意圖,示意實施形態的圖表。 第6圖為一流程圖,示意實施形態的資訊處理裝置執行的處理之例。 Fig. 1 is a schematic diagram showing the control system of the embodiment. FIG. 2 is a schematic diagram showing a specific example of the control of the apparatus to be controlled according to the embodiment. FIG. 3 is a block diagram showing the function of the information processing apparatus according to the embodiment. FIG. 4 is a schematic diagram illustrating bone information of an embodiment. Fig. 5 is a schematic diagram showing a diagram of an embodiment. FIG. 6 is a flowchart showing an example of processing executed by the information processing apparatus of the embodiment.

100:資訊處理裝置 100: Information processing device

110:記憶部 110: Memory Department

120:取得部 120: Acquire Department

130:取出部 130: Take out part

140:動作量算出部 140: Motion amount calculation section

150:判定部 150: Judgment Department

160:估測部 160: Estimation Department

Claims (6)

一種資訊處理裝置,包含: 取得部,取得複數個影像;該複數個影像包含使用者,且拍攝時刻各自相異; 取出部,基於該複數個影像,取出複數個骨骼資訊;該複數個骨骼資訊示意該使用者的骨骼特徵點; 動作量算出部,基於該複數個骨骼資訊,算出事先設定的期間內的該使用者的動作量; 判定部,判定該動作量是否在設定的臨界值以下;以及 估測部,當該動作量在該臨界值以下時,估測該使用者的心情為不悅。 An information processing device, comprising: the obtaining part obtains a plurality of images; the plurality of images include the user, and the shooting times are different from each other; The extraction part, based on the plurality of images, extracts a plurality of skeleton information; the plurality of skeleton information indicates the skeleton feature points of the user; an amount of motion calculation unit for calculating the amount of motion of the user within a preset period based on the plurality of skeleton information; a determination unit that determines whether the motion amount is below a set threshold; and The estimation unit estimates that the user's mood is unhappy when the movement amount is below the threshold value. 如請求項1之資訊處理裝置, 其中,該取得部取得拍攝該使用者的複數個攝影裝置於同一時刻從各自相異的方向拍攝該使用者所得到的複數個影像、以及該複數個攝影裝置各自的位置資訊; 其中,該取出部基於該複數個攝影裝置於同一時刻拍攝該使用者所得到的複數個影像、以及該複數個攝影裝置各自的位置資訊,取出該複數個骨骼資訊之中的1個骨骼資訊。 If the information processing device of claim 1, Wherein, the obtaining unit obtains a plurality of images obtained by photographing the user from different directions at the same time by a plurality of photographing devices of the user, and the respective position information of the plurality of photographing devices; Wherein, the extracting part extracts one piece of bone information from the plurality of bone information based on the plurality of images obtained by the plurality of photographing devices at the same time shooting the user and the respective position information of the plurality of photographing devices. 如請求項1或2之資訊處理裝置, 其中,該估測部基於該骨骼的骨骼資訊,估測該不悅的動作的種類。 If the information processing device of claim 1 or 2, The estimating unit estimates the type of the unpleasant action based on the skeleton information of the skeleton. 如請求項1或2之資訊處理裝置, 其中,該估測部基於該使用者停留的時間、以及估測為該不悅的時間,算出不悅程度。 If the information processing device of claim 1 or 2, The estimation unit calculates the degree of discomfort based on the time the user stayed and the time estimated to be the discomfort. 一種估測方法,由資訊處理裝置執行: 取得複數個影像;該複數個影像包含使用者,且拍攝時刻各自相異; 基於該複數個影像,取出複數個骨骼資訊;該複數個骨骼資訊示意該使用者的骨骼特徵點; 基於該複數個骨骼資訊,算出事先設定的期間內的該使用者的動作量; 判定該動作量是否在設定的臨界值以下;以及 當該動作量在該臨界值以下時,估測該使用者的心情為不悅。 An estimation method, executed by an information processing device: Acquiring a plurality of images; the plurality of images include users, and the shooting times are different from each other; Based on the plurality of images, extracting a plurality of skeleton information; the plurality of skeleton information indicates the skeleton feature points of the user; based on the plurality of skeleton information, calculate the motion amount of the user within a preset period; determine whether the movement amount is below a set threshold; and When the motion amount is below the threshold value, it is estimated that the user's mood is unhappy. 一種記錄媒體,記錄有估測程式,該估測程式使資訊處理裝置執行以下處理: 取得複數個影像;該複數個影像包含使用者,且拍攝時刻各自相異; 基於該複數個影像,取出複數個骨骼資訊;該複數個骨骼資訊示意該使用者的骨骼特徵點; 基於該複數個骨骼資訊,算出事先設定的期間內的該使用者的動作量; 判定該動作量是否在設定的臨界值以下;以及 當該動作量在該臨界值以下時,估測該使用者的心情為不悅。 A recording medium recording an estimation program, the estimation program causes an information processing device to perform the following processing: Acquiring a plurality of images; the plurality of images include users, and the shooting times are different from each other; Based on the plurality of images, extracting a plurality of skeleton information; the plurality of skeleton information indicates the skeleton feature points of the user; based on the plurality of skeleton information, calculate the motion amount of the user within a preset period; determine whether the movement amount is below a set threshold; and When the motion amount is below the threshold value, it is estimated that the user's mood is unhappy.
TW110127304A 2021-02-12 2021-07-26 Information processing device, inference method, and inference program TW202232375A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
PCT/JP2021/005141 WO2022172384A1 (en) 2021-02-12 2021-02-12 Information processing device, inference method, and inference program
WOPCT/JP2021/005141 2021-02-12

Publications (1)

Publication Number Publication Date
TW202232375A true TW202232375A (en) 2022-08-16

Family

ID=82838524

Family Applications (1)

Application Number Title Priority Date Filing Date
TW110127304A TW202232375A (en) 2021-02-12 2021-07-26 Information processing device, inference method, and inference program

Country Status (3)

Country Link
JP (1) JP7224569B2 (en)
TW (1) TW202232375A (en)
WO (1) WO2022172384A1 (en)

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4922095B2 (en) * 2007-08-01 2012-04-25 日本放送協会 Emotion expression extraction processing apparatus and program
JP6244643B2 (en) * 2013-04-15 2017-12-13 オムロン株式会社 Facial expression estimation apparatus, control method, control program, and recording medium
JP2019125061A (en) * 2018-01-12 2019-07-25 ナブテスコ株式会社 System for estimating condition of person

Also Published As

Publication number Publication date
WO2022172384A1 (en) 2022-08-18
JP7224569B2 (en) 2023-02-17
JPWO2022172384A1 (en) 2022-08-18

Similar Documents

Publication Publication Date Title
JP4756660B2 (en) Image processing apparatus and image processing method
JP6561830B2 (en) Information processing system, information processing method, and program
WO2019228196A1 (en) Method for tracking target in panoramic video, and panoramic camera
JP6024658B2 (en) Object detection apparatus, object detection method, and program
WO2019071613A1 (en) Image processing method and device
US20150146006A1 (en) Display control apparatus and display control method
JP6914007B2 (en) Information processing device and information processing method
JP6217635B2 (en) Fall detection device, fall detection method, fall detection camera, and computer program
JP2021534600A (en) Video stabilization methods and equipment, as well as non-temporary computer-readable media
JP5087037B2 (en) Image processing apparatus, method, and program
JP6547563B2 (en) Detection program, detection method and detection apparatus
JP2007088644A5 (en)
JP2018081402A (en) Image processing system, image processing method, and program
TW202232375A (en) Information processing device, inference method, and inference program
JP6178646B2 (en) Imaging apparatus and image shake correction processing method
KR102126370B1 (en) Apparatus and method for analyzing motion
JPWO2018179119A1 (en) Video analysis device, video analysis method, and program
US20220262031A1 (en) Information processing apparatus, information processing method, and storage medium
JP2019180017A5 (en)
JP2018124786A (en) Image processing device, image processing method, and image processing program
JP2019192154A (en) Image processing apparatus, imaging device, image processing method, and program
KR102598861B1 (en) Method and system for analysis of obesity in pets
TW202036593A (en) Information processing device, determination output method, and determination output program
JP7462869B2 (en) Information processing method, information processing device, and program
JP7370840B2 (en) Image processing device, image processing method, computer program and storage medium