WO2020078119A1 - Method, device and system for simulating user wearing clothing and accessories - Google Patents

Method, device and system for simulating user wearing clothing and accessories Download PDF

Info

Publication number
WO2020078119A1
WO2020078119A1 PCT/CN2019/103681 CN2019103681W WO2020078119A1 WO 2020078119 A1 WO2020078119 A1 WO 2020078119A1 CN 2019103681 W CN2019103681 W CN 2019103681W WO 2020078119 A1 WO2020078119 A1 WO 2020078119A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
image
gesture
posture
identity
Prior art date
Application number
PCT/CN2019/103681
Other languages
French (fr)
Chinese (zh)
Inventor
潘宗涛
王德鑫
Original Assignee
京东数字科技控股有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 京东数字科技控股有限公司 filed Critical 京东数字科技控股有限公司
Publication of WO2020078119A1 publication Critical patent/WO2020078119A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0641Shopping interfaces
    • G06Q30/0643Graphical representation of items or shoppers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/103Static body considered as a whole, e.g. static pedestrian or occupant recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/107Static hand or arm

Definitions

  • the present invention relates to the field of computer technology, and in particular, to a method, device, and system for simulating a user to wear clothing accessories.
  • the system uses existing face recognition, gender recognition, age recognition, and gesture recognition technologies to create an intelligent matching system that combines gesture recognition with real-time switching to allow users to experience different styles of matching clothes.
  • the embodiments of the present invention provide a method, device, and system for simulating a user wearing a clothing ornament, which can reduce the amount of calculation, improve the calculation speed, and recognition accuracy.
  • a method for simulating a user wearing a clothing accessory including: acquiring a user's full-body image, and detecting the whole-body image based on an evaluation model to evaluate the user's posture;
  • the evaluation model is an openpose network structure based on the MobileNet algorithm; identify the user's identity and recommend costume accessories based on the user's identity; the recognition algorithm based on geometric moments and edge detection recognizes the user's gesture and obtains a gesture command; based on the user's gesture And the gesture command simulates the user wearing a costume ornament.
  • an apparatus for simulating a user's wearing of clothing accessories including: an evaluation module for acquiring a user's whole-body image, and detecting the whole-body image based on an evaluation model To evaluate the user's posture; wherein, the evaluation model is an openpose network structure based on the MobileNet algorithm; the first recognition module is used to recognize the user's identity, and the clothing accessories are recommended according to the user's identity; the second recognition module is used to based on the geometry
  • the recognition algorithm of moment and edge detection recognizes the user's gesture to obtain a gesture command; a simulation module is used to simulate the user's wearing of clothing accessories based on the user's gesture and the gesture command.
  • a system for simulating a user's clothing accessories including an image acquisition terminal, an identification terminal, a recommendation server, a display terminal, and a data storage terminal, where:
  • the image acquisition terminal is used to obtain the user's full-body image, facial image and gesture image;
  • the recognition terminal is used to evaluate the user's posture, user identity and obtain gesture commands;
  • the recommendation server is used to recommend clothing accessories based on the user identity;
  • the display end is used to display simulated user wearing clothing accessories;
  • the data storage end is used to store data of the image acquisition end, the recognition end, and the recommendation service end.
  • an electronic device simulating a user's clothing accessories including: one or more processors; a storage device, used to store one or more programs, when The one or more programs are executed by the one or more processors, so that the one or more processors implement a method for simulating a user to wear clothing accessories according to an embodiment of the present invention.
  • a computer-readable storage medium is provided on which a computer program is stored, and when the program is executed by a processor, a simulated user of the embodiment of the present invention Ways to wear clothing accessories.
  • An embodiment of the above invention has the following advantages or beneficial effects: because the user's full-body image is acquired, the whole-body image is detected based on the evaluation model to evaluate the user's posture; the user's identity is identified, and clothing accessories are recommended based on the user's identity; based on the geometric moment And edge detection recognition algorithms recognize user gestures and obtain gesture commands; based on user gestures and gesture commands to simulate the technical means of wearing clothing accessories, the evaluation model is an Openpose network structure based on MobileNet algorithm, using MobileNet as a feature extraction layer to replace openpose network The ordinary convolutional layer in the structure can reduce the number of parameters and the amount of calculation.
  • the recognition algorithm based on geometric moments and edge detection can improve the accuracy of gesture recognition, so it overcomes the huge amount of calculation, time-consuming, and high cost; accurate gesture recognition Low degree, the technical problem that requires users to wave their hands many times, and then to achieve the technical effect of reducing the amount of calculation, increasing the calculation speed and identifying accuracy.
  • FIG. 1 is a schematic diagram of main steps of a method for simulating a user to wear clothing accessories according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of the main flow of a method for simulating a user to wear a clothing accessory according to a reference embodiment of the present invention
  • FIG. 3 is a schematic diagram of gesture recognition for simulating a method of wearing a costume ornament by a user according to an embodiment of the present invention
  • FIG. 4 is a schematic diagram of main modules of a device for simulating a user to wear a clothing accessory according to an embodiment of the present invention
  • FIG. 5 is a schematic diagram of a system for simulating a user to wear clothing accessories according to an embodiment of the present invention
  • FIG. 6 is an exemplary system architecture diagram to which embodiments of the present invention can be applied;
  • FIG. 7 is a schematic structural diagram of a computer system suitable for implementing a terminal device or a server according to an embodiment of the present invention.
  • FIG. 1 is a schematic diagram of main steps of a method for simulating a user to wear clothing accessories according to an embodiment of the present invention. As shown in FIG. 1, the method for simulating a user to wear a clothing accessory according to an embodiment of the present invention mainly includes the following steps:
  • Step S101 Acquire the user's whole-body image, and detect the whole-body image based on the evaluation model to evaluate the user's posture.
  • a full-body scan can be performed on the user to obtain a full-body image, and the full-body image can be detected based on the MobileNet algorithm and the openpose network structure to identify the user's user posture.
  • the evaluation model of the embodiment of the present invention is an Openpose network structure based on the MobileNet algorithm, wherein the evaluation model selects MobileNet as the feature extraction layer to replace the ordinary convolutional layer in the openpose network structure.
  • Using the depth-separable convolutions of the MobileNet algorithm to extract features in the whole-body image can reduce the number of parameters and the amount of calculation.
  • this separation structure can core decompose the amount of compression parameters, which can increase the calculation speed for the CPU of most mobile terminals.
  • MobileNets is based on a streamlined architecture, which uses deep separable convolutions to build lightweight deep neural networks.
  • the openpose network structure is a human pose estimation algorithm. The principle of openpose is: input an image, extract the features through the convolution network, get a set of feature maps, and then divide it into two parts, using ordinary convolutional layers (such as convolution) Neural network CNN) extracts partial confidence maps (Part Confidence Maps) and partial affinity fields (Part Affinity Fields); after obtaining these two pieces of information, uses even matching (Bipartite Matching) to find partial associations (Part Associations), which will be the same The joint points of the individuals are connected.
  • each stage has two Branch detection to generate keypoint heatmap and vectmap respectively.
  • heatmap and vectmap you can know all the key points in the picture.
  • the evaluation model can be trained using the MobileNet algorithm, and the training process is similar to the evaluation process.
  • step S101 can be implemented in the following ways: using the MobileNet algorithm to extract bone features from the whole-body image; using the openpose network structure to extract bone line segments and key point vectors of the bone line segments from the bone features; using cosine similarity, clips The weights of angular cosine and skeletal line segments are calculated on key point vectors to evaluate user poses.
  • Bone features are the characteristics of the user's limbs.
  • Bone line segments are the line segments on the limbs in the image. Generally, each limb in the image can be extracted into two bone line segments. The bone line segments represent the user's limbs, which is more in line with the human physiological structure and can be intuitively reflected The user's posture (that is, the movement of the human arm and leg).
  • the cosine similarity can be calculated according to the following formula:
  • the angle cosine can be calculated according to the following formula:
  • ⁇ 1 - ⁇ 2 represents the angle between two adjacent bone segments
  • ⁇ 1 and ⁇ 2 are the included angle between two adjacent bone segments and the horizontal line, or ⁇ 1 and ⁇ 2 are respectively The angle between two adjacent bone segments and the vertical line;
  • bone segment weights can be assigned corresponding weight values for bone segments corresponding to limbs, or bone segments corresponding to each part of a limb Assign corresponding weight values, etc.
  • the bone segment weights of the bone segments corresponding to the thigh and the lower leg may be the same or different.
  • Step S102 Identify the user's identity, and recommend costume accessories based on the user's identity.
  • Recognizing the user's identity can more accurately recommend suitable clothing accessories to the user. You can recommend clothing accessories to the user according to the recommendation strategy or user preferences set by the merchant, or you can recommend clothing accessories with high sales or high evaluation to the user.
  • the recommendation of the clothing accessory according to the identity of the user may also be to obtain the clothing accessory selected by the user after determining the user name of the user, that is, to recommend the clothing accessory to himself.
  • the user identity may include a user name, user gender, and user age.
  • Step S102 can be implemented by: acquiring the user's facial image when the user's posture matches the preset posture; adjusting the size of the facial image to generate facial images of different sizes to construct an image pyramid; using multi-layer neural network structure detection
  • the image pyramid obtains the face frame, and recognizes the user name based on the face frame; the face image is detected using a classification model to determine the user's gender and user's age.
  • the multi-layer neural network structure includes an input layer, a hidden layer, and an output layer, where the number of hidden layers depends on needs.
  • the embodiment of the present invention uses a multi-layer neural network structure to detect the facial image to obtain the user's face frame.
  • the face frame is composed of face feature points.
  • the face feature points are the contours of the eyes, eyebrows, nose, mouth, and outer contour of the face. Location, you can identify the user name through the face frame.
  • the number of layers to build the image pyramid is determined by two factors, the first is the minimum face size, the second is the scaling factor, the minimum face size (minsize) can be expressed by min (w, h), w is the face The width of the image and h is the height of the facial image. Since the human face is almost indistinguishable when the image size is less than 12, the adjusted facial image size "minL" cannot be less than 12.
  • the number of layers of the image pyramid can be determined according to the following formula:
  • minL 12
  • org_L is the size of the face image
  • minsize is the minimum face size
  • factor is the scaling factor
  • n is the number of layers of the image pyramid.
  • the minisize is artificially set according to the application scenario. In the case where minL is greater than 12, all n constitute the pyramid layer. Therefore, the smaller the value of minsize, the larger the range of n, and the amount of calculation will increase accordingly.
  • the smaller the face that can be detected you can continuously adjust minSize to determine the range of n to ensure a suitable position interval Of users can be detected, neither too close nor too far away, which can improve the user experience while optimizing computing.
  • the classification model can include three convolutional layers and two fully connected layers to avoid overfitting.
  • Each convolutional layer in a convolutional neural network is composed of several convolutional units, and the parameters of each convolutional unit are optimized by a back-propagation algorithm.
  • the purpose of the convolution operation is to extract different features of the input.
  • the first convolutional layer may only extract some low-level features such as edges, lines, and corners. More layers of the network can iteratively extract more complex features from the low-level features. Characteristics.
  • Each node of the fully connected layer is connected to all nodes of the previous layer, and is used to synthesize the extracted features. Since both age and gender can be divided into a limited number of categories, the detection of user gender and user age is a classification problem.
  • a classification model can be used to detect facial images to determine the user gender and user age corresponding to the facial image.
  • Step S103 A recognition algorithm based on geometric moments and edge detection recognizes user gestures and obtains gesture commands.
  • the user can implement operations such as replacing simulated clothing accessories or collecting simulated clothing accessories through gestures.
  • step S101 may be implemented by: positioning and tracking the user's hand; acquiring the user's gesture image, and binarizing the gesture image; using a geometric moment and edge detection recognition algorithm Calculate the seven geometric moment feature components of the gesture image, and select four geometric moment feature components from the seven geometric moment feature components as the geometric moment feature vector; generate the gray image of the gesture image, detect the edges of the gray image, and obtain the gesture image
  • the boundary direction feature vector of based on the geometric moment feature vector and the boundary direction feature vector, calculate the distance between the gesture image and any gesture in the gesture library to obtain the gesture command.
  • various gesture commands are stored in the gesture library.
  • the gestures closest to the user's gesture image in the gesture library are directly searched to obtain the user's gesture commands.
  • the binarization process is to set the gray value of the pixels on the image to 0 or 255, that is, the process of displaying the entire image with a clear black and white effect.
  • the black and white image can make the calculation of the image more accurate.
  • Geometric moment is a gesture recognition method based on statistical analysis. The translation, rotation, and scale transformation of the seven moment group images are kept unchanged.
  • the moment group is the geometric moment feature component; edge detection is image processing and computer
  • the basic problem in vision, the purpose of edge detection is to identify the obvious changes in brightness in digital images.
  • the distance between images is the geometric feature distance between the input gesture image and any gesture image in the gesture library.
  • the gesture recognition process based on statistics and probability is easier to control and can recognize more complex and delicate gestures.
  • weights can be set for the geometric moment feature vector and the boundary direction feature vector according to actual needs.
  • Step S104 Simulate the user's wearing of clothing accessories based on the user's posture and gesture commands.
  • the whole body image of the user is acquired and the whole body image is detected based on the evaluation model to evaluate the user's posture; the user identity is identified and the clothing accessory is recommended according to the user identity; Recognition algorithms based on geometric moments and edge detection recognize user gestures and obtain gesture commands; based on user gestures and gesture commands to simulate the technical means of wearing clothing accessories, so overcoming the huge amount of calculation, time-consuming, high cost; gesture recognition accuracy is low ,
  • the technical problem that requires users to wave their hands many times to achieve the technical effect of reducing the amount of calculation, increasing the calculation speed and identifying accuracy.
  • FIG. 2 is a schematic diagram of a main flow of a method for simulating a user to wear a clothing accessory according to a reference embodiment of the present invention. As shown in FIG. 2, the method for simulating a user wearing a clothing accessory according to an embodiment of the present invention may be implemented using the following process:
  • Step S201 gesture recognition: the camera can be used for continuous scanning, and only the actions matching the fixed gesture can be recognized, so as to enter the next step;
  • Step S202 Face recognition: the camera is also used to scan the face, and then the face is detected to identify the user, and it can also determine whether a member has been registered, etc .;
  • Step S203 gender identification
  • Step S204 age recognition
  • Step S205 Recommended strategy:
  • the result after recognition enters the recommendation system, intelligently recommends according to gender, age and recommendation strategy, and can also set member users to browse more preferential matching products, etc .;
  • Step S206 Gesture recognition: the user's gesture is recognized, so that different matching schemes are switched for display.
  • the relevant data can be stored to the data storage side, and the relevant data can also be obtained from the data storage side.
  • FIG. 3 is a schematic diagram of gesture recognition for simulating a method of wearing a costume ornament by a user according to an embodiment of the present invention. As shown in Figure 3, gesture recognition mainly includes the following steps:
  • Step S301 hand positioning and tracking
  • Step S302 Hand feature extraction and preprocessing:
  • Gesture images are segmented, and binarization is performed on the gesture images
  • Step S303 hand feature vector parameters: the geometric moment and edge detection recognition algorithm is used to calculate the seven geometric moment feature components of the gesture image, and four geometric moment feature components are selected from the seven geometric moment feature components as the geometric moment feature vector; And generating a grayscale image of the gesture image, detecting the edges of the grayscale image, and obtaining the boundary direction feature vector of the gesture image;
  • Step S304 Gesture recognition: based on the geometric moment feature vector and the boundary direction feature vector, calculate the distance between the gesture image and any gesture in the gesture library;
  • Step S305 Recognition result: obtain a gesture command according to the distance between the gesture image and any gesture in the gesture library.
  • the first layer the convolution layer: 96 convolution kernels can be used, and the number of parameters of each convolution kernel is 3 * 7 * 7, which is equivalent to three 7 * 7 convolution kernels in each channel convolution.
  • the activation function uses a linear rectification function (ReLU), the pooling uses the maximum overlapping pooling, the size of the pooling is 3 * 3, and the strides is 2; Among them, ReLU is also called a modified linear unit, which is a commonly used activation in artificial neural networks.
  • Function (activation) usually refers to the nonlinear function represented by the ramp function and its variants;
  • the second layer the convolution layer: the input of the second layer is a 96 * 28 * 28 single-channel image, because the three channels have been combined for convolution in the previous step;
  • the third layer the convolution layer: the number of filters can be 384, and the size of the convolution kernel can be 3 * 3;
  • the fourth layer fully connected layer: the first fully connected layer, the number of neurons can be selected 512;
  • the image processing can be directly processed with 3-channel color images, and the resized facial images can be uniformly scaled to 256 * 256, and then cropped to 227 * 227, and the training process can be random Cropping, randomly cropping multiple pictures to train helps to make the network recognition rate higher.
  • the four corners of the rectangle + the center are cropped, which means that the input of the network is a 227 * 227 3-channel color image, and the model is continuously optimized by using a small learning rate and using parameter regularization (dropout). Make it more accurate.
  • FIG. 4 is a schematic diagram of main modules of a device for simulating a user to wear a clothing accessory according to an embodiment of the present invention.
  • the apparatus 400 for simulating a user's wearing of clothing accessories according to an embodiment of the present invention includes: an evaluation module 401, a first recognition module 402, a second recognition module 403, and a simulation module 404. among them,
  • the evaluation module 401 is used to obtain a user's whole-body image and detect the whole-body image based on an evaluation model to evaluate the user's posture; wherein the evaluation model is an openpose network structure based on the MobileNet algorithm;
  • the first identification module 402 is used to identify the user's identity and recommend costume accessories according to the user's identity;
  • the second recognition module 403 is used to recognize the user's gesture based on the recognition algorithm based on geometric moment and edge detection and obtain the gesture command;
  • the simulation module 404 is used for simulating a user to wear a costume ornament based on the user posture and the gesture command.
  • the evaluation module 401 is further configured to: use the MobileNet algorithm to extract bone features from the whole-body image; use the openpose network structure to extract bone line segments and the bone line segments from the bone features The key point vector of; calculates the key point vector using the cosine similarity, the angle cosine and the weight of the bone line segment to evaluate the user's posture.
  • the user identity includes a user name, a user's gender, and a user's age
  • the first recognition module 402 is further used to: obtain a user's facial image when the user's posture matches a preset posture; adjust the face The size of the image, generating facial images of different sizes to construct an image pyramid; wherein, the number of layers of the image pyramid is determined according to the following formula: minL > 12, org_L is the size of the face image, minsize is the minimum face size, factor is the scaling factor, and n is the number of layers of the image pyramid; the multi-layer neural network structure is used to detect the image pyramid to obtain the face Frame, identifying the user name based on the face frame; detecting the facial image using a classification model to determine the user's gender and the user's age; wherein, the classification model includes three convolutional layers And two fully connected layers.
  • the second recognition module 403 is further used to: locate and track the user's hand; acquire the user's gesture image, and binarize the gesture image; use geometric moment and The edge detection recognition algorithm calculates seven geometric moment feature components of the gesture image, and selects four geometric moment feature components from the seven geometric moment feature components as geometric moment feature vectors; generates the gray of the gesture image Degree map, detecting the edge of the grayscale image to obtain the boundary direction feature vector of the gesture image; based on the geometric moment feature vector and the boundary direction feature vector, calculating any of the gesture image and the gesture library Gesture distance to get gesture commands.
  • the device for simulating the wearing of clothing accessories according to the embodiment of the present invention can be seen because the whole body image of the user is acquired and the whole body image is detected based on the evaluation model , To evaluate the user's posture; identify the user's identity, and recommend clothing accessories based on the user's identity; recognition algorithms based on geometric moments and edge detection recognize user gestures and obtain gesture commands; based on the user's gestures and gesture commands to simulate the technical means of wearing clothing accessories, so It overcomes the technical problem of huge calculation amount, long time consumption and high cost; low accuracy of gesture recognition, which requires users to wave their hands many times, thereby achieving the technical effects of reducing the amount of calculation, increasing the calculation speed and identifying accuracy.
  • FIG. 5 is a schematic diagram of a system for simulating a user to wear clothing accessories according to an embodiment of the present invention.
  • an embodiment of the present invention also provides a system 500 for simulating a user's clothing accessories.
  • the system 500 for simulating a user's clothing accessories includes an image acquisition terminal 501, an identification terminal 502, a recommendation server 503, and a display terminal 504 ⁇ ⁇ ⁇ ⁇ 505 ⁇ 504 and data storage terminal 505. among them,
  • the image acquisition terminal 501 is used to acquire the user's full-body image, facial image and gesture image;
  • the recognition terminal 502 is used to evaluate the user's posture, user identity and obtain gesture commands;
  • the recommendation server 503 is used to recommend clothing accessories based on the user's identity
  • the display end 504 is used to display simulated user wearing clothing accessories
  • the data storage terminal 505 is used to store data of the image acquisition terminal, the recognition terminal, and the recommendation server.
  • the whole body image of the user is acquired and the whole body image is detected based on the evaluation model to evaluate the user's posture; the user identity is identified and the clothing accessory is recommended according to the user identity; Recognition algorithms based on geometric moments and edge detection recognize user gestures and obtain gesture commands; based on user gestures and gesture commands to simulate the technical means of wearing clothing accessories, so overcoming the huge amount of calculation, time-consuming, high cost; gesture recognition accuracy is low ,
  • the technical problem that requires users to wave their hands many times to achieve the technical effect of reducing the amount of calculation, increasing the calculation speed and identifying accuracy.
  • FIG. 6 shows an exemplary system architecture 600 to which the method for simulating the wearing of clothing accessories or the device for simulating the wearing of clothing accessories of the embodiments of the present invention can be applied.
  • the system architecture 600 may include terminal devices 601, 602, and 603, a network 604, and a server 605.
  • the network 604 is used as a medium for providing communication links between the terminal devices 601, 602, and 603 and the server 605.
  • the network 604 may include various connection types, such as wired, wireless communication links, or fiber optic cables, and so on.
  • the user can use the terminal devices 601, 602, 603 to interact with the server 605 via the network 604 to receive or send messages, etc.
  • Various communication client applications such as shopping applications, web browser applications, search applications, instant communication tools, email clients, and social platform software, can be installed on the terminal devices 601, 602, and 603.
  • the terminal devices 601, 602, and 603 may be various electronic devices that have a display screen and support web browsing, including but not limited to smart phones, tablet computers, laptop portable computers, desktop computers, and so on.
  • the server 605 may be a server that provides various services, for example, a background management server that provides support for shopping websites browsed by users using terminal devices 601, 602, and 603.
  • the background management server may perform analysis and other processing on the received product information query request and other data, and feed back the processing results (such as target push information and product information) to the terminal device.
  • the method for simulating the wearing of clothing accessories provided by the embodiment of the present invention is generally executed by the server 605, and accordingly, the device for simulating the wearing of clothing accessories by the user is generally provided in the server 605.
  • terminal devices, networks, and servers in FIG. 6 are only schematic. According to the implementation needs, there can be any number of terminal devices, networks and servers.
  • the computer system 700 includes a central processing unit (CPU) 701 that can be loaded into a random access memory (RAM) 703 from a program stored in a read-only memory (ROM) 702 or from a storage section 708 Instead, perform various appropriate actions and processing.
  • RAM random access memory
  • ROM read-only memory
  • the CPU 701, ROM 702, and RAM 703 are connected to each other through a bus 704.
  • An input / output (I / O) interface 705 is also connected to the bus 704.
  • the following components are connected to the I / O interface 705: an input section 706 including a keyboard, a mouse, etc .; an output section 707 including a cathode ray tube (CRT), a liquid crystal display (LCD), etc., and a speaker, etc .; a storage section 708 including a hard disk, etc. ; And a communication section 709 including a network interface card such as a LAN card, a modem, etc. The communication section 709 performs communication processing via a network such as the Internet.
  • the drive 710 is also connected to the I / O interface 705 as needed.
  • a removable medium 711 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like, is installed on the drive 710 as necessary, so that the computer program read out therefrom is installed into the storage portion 708 as needed.
  • the process described above with reference to the flowchart may be implemented as a computer software program.
  • the disclosed embodiments of the present invention include a computer program product including a computer program carried on a computer-readable medium, the computer program containing program code for performing the method shown in the flowchart.
  • the computer program may be downloaded and installed from the network through the communication section 709, and / or installed from the removable medium 711.
  • the central processing unit (CPU) 701 the above-described functions defined in the system of the present invention are executed.
  • the computer-readable medium shown in the present invention may be a computer-readable signal medium or a computer-readable storage medium or any combination of the two.
  • the computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or any combination of the above. More specific examples of computer-readable storage media may include, but are not limited to: electrical connections with one or more wires, portable computer diskettes, hard drives, random access memory (RAM), read-only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the foregoing.
  • the computer-readable storage medium may be any tangible medium containing or storing a program, which may be used by or in combination with an instruction execution system, apparatus, or device.
  • the computer-readable signal medium may include a data signal that is propagated in baseband or as part of a carrier wave, in which computer-readable program code is carried. This propagated data signal can take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above.
  • the computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, and the computer-readable medium may send, propagate, or transmit a program for use by or in combination with an instruction execution system, apparatus, or device. .
  • the program code contained on the computer-readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, optical cable, RF, etc., or any suitable combination of the foregoing.
  • each block in the flowchart or block diagram may represent a module, a program segment, or a part of code, and the above-mentioned module, program segment, or part of code contains one or more for implementing a specified logical function Executable instructions.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession can actually be executed in parallel, and sometimes they can also be executed in reverse order, depending on the functions involved.
  • each block in the block diagram or flowchart, and a combination of blocks in the block diagram or flowchart can be implemented with a dedicated hardware-based system that performs the specified function or operation, or can be used It is realized by a combination of dedicated hardware and computer instructions.
  • the modules described in the embodiments of the present invention may be implemented in software or hardware.
  • the described module may also be provided in the processor.
  • a processor includes an evaluation module, a first identification module, a second identification module, and an analog module.
  • the names of these modules do not constitute a limitation on the module itself.
  • the simulation module may also be described as "a module for simulating a user to wear a costume ornament based on the user gesture and the gesture command".
  • the present invention also provides a computer-readable medium.
  • the computer-readable medium may be included in the device described in the foregoing embodiments; or it may exist alone without being assembled into the device.
  • the computer-readable medium carries one or more programs.
  • the device includes: Step S101: Acquire a user's whole-body image and detect the whole-body image based on an evaluation model, To evaluate the user's posture; Step S102: Recognize the user's identity and recommend clothing accessories according to the user's identity; Step S103: Recognize the user's gesture based on the recognition algorithm based on geometric moments and edge detection to obtain the gesture command; Step S104: Simulate the user based on the user's gesture and gesture command Wear clothing accessories.
  • the whole-body image of the user is acquired and the whole-body image is detected based on the evaluation model to evaluate the user's posture; the user's identity is recognized, and clothing accessories are recommended according to the user's identity; recognition based on geometric moments and edge detection
  • the algorithm recognizes user gestures and obtains gesture commands; based on the user's gestures and gesture commands, the technical means of simulating the user's wearing clothing accessories, so it overcomes the huge calculation, time-consuming, and high cost; the gesture recognition accuracy is low, requiring the user to wave many times.
  • the problem is to achieve the technical effect of reducing the amount of calculation, increasing the calculation speed and identifying accuracy.

Abstract

The present invention relates to the technical field of computers, and disclosed thereby are a method, device and system for simulating a user wearing clothing and accessories. A specific embodiment of the method comprises: acquiring a full body image of a user, and performing detection on the full body image on the basis of an evaluation model so as to evaluate the pose of the user; identifying the identity of the user, and recommending clothing and accessories according to the identity of the user; identifying a gesture of the user on the basis of an identification algorithm of a geometric moment and edge detection to obtain a gesture command; and simulating the user wearing the clothing and accessories according to the pose of the user and the gesture command. The embodiment may reduce the amount of computation, and increase computation speed and identification accuracy.

Description

模拟用户穿戴服装饰品的方法、装置和系统Method, device and system for simulating user wearing clothing accessories 技术领域Technical field
本发明涉及计算机技术领域,尤其涉及一种模拟用户穿戴服装饰品的方法、装置和系统。The present invention relates to the field of computer technology, and in particular, to a method, device, and system for simulating a user to wear clothing accessories.
背景技术Background technique
目前,随着数字化、智能化的发展,各种新兴的技术与应用不断涌现,尤其是在识别领域,识别行业已经从稳步发展的初级阶段成功过渡到高速发展的成熟期。例如:人脸支付,刷脸签到、安防保卫、人机交互,跳舞机等。At present, with the development of digitalization and intelligence, various emerging technologies and applications continue to emerge, especially in the field of identification, the identification industry has successfully transitioned from the initial stage of steady development to the mature stage of high-speed development. For example: face payment, face check-in, security protection, human-computer interaction, dancing machine, etc.
通常,用户在购买服装时,本系统通过现有人脸识别,性别识别,年龄识别,姿态识别技术打造一种智能搭配系统,结合手势识别实时切换,让用户体验不一样的搭配穿衣风格。Generally, when users buy clothing, the system uses existing face recognition, gender recognition, age recognition, and gesture recognition technologies to create an intelligent matching system that combines gesture recognition with real-time switching to allow users to experience different styles of matching clothes.
在实现本发明过程中,发明人发现现有技术中至少存在如下问题:In the process of implementing the present invention, the inventor found that there are at least the following problems in the prior art:
1.计算量巨大,耗时长,成本高1. The calculation is huge, time-consuming and costly
2.手势识别准确度低,需要用户多次挥手。2. The accuracy of gesture recognition is low, requiring users to wave their hands many times.
发明内容Summary of the invention
有鉴于此,本发明实施例提供一种模拟用户穿戴服装饰品的方法、装置和系统,能够减少计算量,提高计算速度以及识别准确度。In view of this, the embodiments of the present invention provide a method, device, and system for simulating a user wearing a clothing ornament, which can reduce the amount of calculation, improve the calculation speed, and recognition accuracy.
为实现上述目的,根据本发明实施例的一个方面,提供了一种模拟用户穿戴服装饰品的方法,包括:获取用户的全身图像,基于评估模型对所述全身图像进行检测,以评估用户姿态;其中,所述评估模型为基于MobileNet算法的openpose网络结构;识别用户身份,根据所述用户身份推荐服装饰品;基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令;基于所述用户姿态和所述手势命令模拟用户穿戴服装饰品。To achieve the above object, according to an aspect of an embodiment of the present invention, a method for simulating a user wearing a clothing accessory is provided, including: acquiring a user's full-body image, and detecting the whole-body image based on an evaluation model to evaluate the user's posture; Wherein, the evaluation model is an openpose network structure based on the MobileNet algorithm; identify the user's identity and recommend costume accessories based on the user's identity; the recognition algorithm based on geometric moments and edge detection recognizes the user's gesture and obtains a gesture command; based on the user's gesture And the gesture command simulates the user wearing a costume ornament.
为实现上述目的,根据本发明实施例的另一方面,提供了一种模 拟用户穿戴服装饰品的装置,包括:评估模块,用于获取用户的全身图像,基于评估模型对所述全身图像进行检测,以评估用户姿态;其中,所述评估模型为基于MobileNet算法的openpose网络结构;第一识别模块,用于识别用户身份,根据所述用户身份推荐服装饰品;第二识别模块,用于基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令;模拟模块,用于基于所述用户姿态和所述手势命令模拟用户穿戴服装饰品。To achieve the above object, according to another aspect of the embodiments of the present invention, there is provided an apparatus for simulating a user's wearing of clothing accessories, including: an evaluation module for acquiring a user's whole-body image, and detecting the whole-body image based on an evaluation model To evaluate the user's posture; wherein, the evaluation model is an openpose network structure based on the MobileNet algorithm; the first recognition module is used to recognize the user's identity, and the clothing accessories are recommended according to the user's identity; the second recognition module is used to based on the geometry The recognition algorithm of moment and edge detection recognizes the user's gesture to obtain a gesture command; a simulation module is used to simulate the user's wearing of clothing accessories based on the user's gesture and the gesture command.
为实现上述目的,根据本发明实施例的又一方面,提供了一种模拟用户穿戴服装饰品的系统,包括图像采集端、识别端、推荐服务端、显示端和数据存储端,其中:所述图像采集端用于获取用户的全身图像、面部图像和手势图像;所述识别端用于评估用户姿态、用户身份和获得手势命令;所述推荐服务端用于根据所述用户身份推荐服装饰品;所述显示端用于展示模拟用户穿戴服装饰品;所述数据存储端用于存储所述图像采集端、所述识别端和所述推荐服务端的数据。To achieve the above object, according to still another aspect of the embodiments of the present invention, a system for simulating a user's clothing accessories is provided, including an image acquisition terminal, an identification terminal, a recommendation server, a display terminal, and a data storage terminal, where: The image acquisition terminal is used to obtain the user's full-body image, facial image and gesture image; the recognition terminal is used to evaluate the user's posture, user identity and obtain gesture commands; the recommendation server is used to recommend clothing accessories based on the user identity; The display end is used to display simulated user wearing clothing accessories; the data storage end is used to store data of the image acquisition end, the recognition end, and the recommendation service end.
为实现上述目的,根据本发明实施例的再一方面,提供了一种模拟用户穿戴服装饰品的电子设备,包括:一个或多个处理器;存储装置,用于存储一个或多个程序,当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现本发明实施例的一种模拟用户穿戴服装饰品的方法。To achieve the above object, according to still another aspect of the embodiments of the present invention, there is provided an electronic device simulating a user's clothing accessories, including: one or more processors; a storage device, used to store one or more programs, when The one or more programs are executed by the one or more processors, so that the one or more processors implement a method for simulating a user to wear clothing accessories according to an embodiment of the present invention.
为实现上述目的,根据本发明实施例的再一方面,提供了一种计算机可读存储介质,其上存储有计算机程序,所述程序被处理器执行时实现本发明实施例的一种模拟用户穿戴服装饰品的方法。To achieve the above object, according to still another aspect of the embodiments of the present invention, a computer-readable storage medium is provided on which a computer program is stored, and when the program is executed by a processor, a simulated user of the embodiment of the present invention Ways to wear clothing accessories.
上述发明中的一个实施例具有如下优点或有益效果:因为采用获取用户的全身图像,基于评估模型对全身图像进行检测,以评估用户姿态;识别用户身份,根据用户身份推荐服装饰品;基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令;基于用户姿态和手势命令模拟用户穿戴服装饰品的技术手段,该评估模型为基于MobileNet算法的openpose网络结构,利用MobileNet作为特征提取层替换openpose网络结构中的普通卷积层,可以降低参数数量和计算量,同时,基于几何矩和边缘检测的识别算法可提高手势识别准确度,所 以克服了计算量巨大,耗时长,成本高;手势识别准确度低,需要用户多次挥手的技术问题,进而达到减少计算量,提高计算速度以及识别准确度的技术效果。An embodiment of the above invention has the following advantages or beneficial effects: because the user's full-body image is acquired, the whole-body image is detected based on the evaluation model to evaluate the user's posture; the user's identity is identified, and clothing accessories are recommended based on the user's identity; based on the geometric moment And edge detection recognition algorithms recognize user gestures and obtain gesture commands; based on user gestures and gesture commands to simulate the technical means of wearing clothing accessories, the evaluation model is an Openpose network structure based on MobileNet algorithm, using MobileNet as a feature extraction layer to replace openpose network The ordinary convolutional layer in the structure can reduce the number of parameters and the amount of calculation. At the same time, the recognition algorithm based on geometric moments and edge detection can improve the accuracy of gesture recognition, so it overcomes the huge amount of calculation, time-consuming, and high cost; accurate gesture recognition Low degree, the technical problem that requires users to wave their hands many times, and then to achieve the technical effect of reducing the amount of calculation, increasing the calculation speed and identifying accuracy.
上述的非惯用的可选方式所具有的进一步效果将在下文中结合具体实施方式加以说明。The further effects provided by the above-mentioned non-conventional alternatives will be described in conjunction with specific implementations below.
附图说明BRIEF DESCRIPTION
附图用于更好地理解本发明,不构成对本发明的不当限定。其中:The drawings are used to better understand the present invention and do not constitute an undue limitation on the present invention. among them:
图1是根据本发明实施例的模拟用户穿戴服装饰品的方法的主要步骤的示意图;FIG. 1 is a schematic diagram of main steps of a method for simulating a user to wear clothing accessories according to an embodiment of the present invention;
图2是根据本发明一个可参考实施例的模拟用户穿戴服装饰品的方法的主要流程的示意图;2 is a schematic diagram of the main flow of a method for simulating a user to wear a clothing accessory according to a reference embodiment of the present invention;
图3是根据本发明实施例的模拟用户穿戴服装饰品的方法的手势识别的示意图;FIG. 3 is a schematic diagram of gesture recognition for simulating a method of wearing a costume ornament by a user according to an embodiment of the present invention;
图4是根据本发明实施例的模拟用户穿戴服装饰品的装置的主要模块的示意图;4 is a schematic diagram of main modules of a device for simulating a user to wear a clothing accessory according to an embodiment of the present invention;
图5是根据本发明实施例的模拟用户穿戴服装饰品的系统的示意图;FIG. 5 is a schematic diagram of a system for simulating a user to wear clothing accessories according to an embodiment of the present invention;
图6是本发明实施例可以应用于其中的示例性系统架构图;6 is an exemplary system architecture diagram to which embodiments of the present invention can be applied;
图7是适于用来实现本发明实施例的终端设备或服务器的计算机系统的结构示意图。7 is a schematic structural diagram of a computer system suitable for implementing a terminal device or a server according to an embodiment of the present invention.
具体实施方式detailed description
以下结合附图对本发明的示范性实施例做出说明,其中包括本发明实施例的各种细节以助于理解,应当将它们认为仅仅是示范性的。因此,本领域普通技术人员应当认识到,可以对这里描述的实施例做出各种改变和修改,而不会背离本发明的范围和精神。同样,为了清楚和简明,以下的描述中省略了对公知功能和结构的描述。The following describes exemplary embodiments of the present invention with reference to the accompanying drawings, which includes various details of the embodiments of the present invention to facilitate understanding, and they should be considered as merely exemplary. Therefore, those of ordinary skill in the art should recognize that various changes and modifications can be made to the embodiments described herein without departing from the scope and spirit of the present invention. Also, for clarity and conciseness, descriptions of well-known functions and structures are omitted in the following description.
需要指出的是,在不冲突的情况下,本发明的实施例以及实施例中的技术特征可以相互结合。It should be noted that the embodiments of the present invention and the technical features in the embodiments can be combined with each other without conflict.
图1是根据本发明实施例的模拟用户穿戴服装饰品的方法的主要步骤的示意图。如图1所示,本发明实施例的模拟用户穿戴服装饰品的方法主要包括以下步骤:FIG. 1 is a schematic diagram of main steps of a method for simulating a user to wear clothing accessories according to an embodiment of the present invention. As shown in FIG. 1, the method for simulating a user to wear a clothing accessory according to an embodiment of the present invention mainly includes the following steps:
步骤S101:获取用户的全身图像,基于评估模型对全身图像进行检测,以评估用户姿态。Step S101: Acquire the user's whole-body image, and detect the whole-body image based on the evaluation model to evaluate the user's posture.
在模拟用户穿戴服装饰品时,可以对用户进行全身扫描,以获取全身图像,基于MobileNet算法和openpose网络结构对全身图像检测,可以识别出该用户的用户姿态。本发明实施例的评估模型为基于MobileNet算法的openpose网络结构,其中,评估模型选用MobileNet作为特征提取层替换openpose网络结构中的普通卷积层。利用MobileNet算法的深度可分离卷积层(depthwise separable convolutions)提取全身图像中的特征,可以降低参数数量和计算量。同时,这种分离结构能够核分解压缩参数量,对于绝大多数移动终端的CPU能够提高运算速度。When simulating a user wearing a costume adornment, a full-body scan can be performed on the user to obtain a full-body image, and the full-body image can be detected based on the MobileNet algorithm and the openpose network structure to identify the user's user posture. The evaluation model of the embodiment of the present invention is an Openpose network structure based on the MobileNet algorithm, wherein the evaluation model selects MobileNet as the feature extraction layer to replace the ordinary convolutional layer in the openpose network structure. Using the depth-separable convolutions of the MobileNet algorithm to extract features in the whole-body image can reduce the number of parameters and the amount of calculation. At the same time, this separation structure can core decompose the amount of compression parameters, which can increase the calculation speed for the CPU of most mobile terminals.
MobileNets是基于一个流线型的架构,它使用深度可分离的卷积来构建轻量级的深层神经网络。openpose网络结构是一种人体姿态估计算法,openpose的原理是:输入一幅图像,经过卷积网络提取特征,得到一组特征图,然后分为两部分,分别使用普通卷积层(例如卷积神经网络CNN)提取部分置信图(Part Confidence Maps)和部分亲和字段(Part Affinity Fields);得到这两个信息后,使用偶匹配(Bipartite Matching)求出部分联想(Part Association),从而将同一个人的关节点连接起来,由于部分亲和字段自身的矢量性,使得生成的偶匹配很正确,最终合并为一个人的整体骨架,再经过六个阶段检测(stage),每个stage有二个分支检测,分别生成关键点热力图(heatmap)和向量图(vectmap)。有了heatmap和vectmap就可以知道图片中所有的关键点。此外,该评估模型可以利用MobileNet算法对其进行训练,训练过程与评估过程类似。MobileNets is based on a streamlined architecture, which uses deep separable convolutions to build lightweight deep neural networks. The openpose network structure is a human pose estimation algorithm. The principle of openpose is: input an image, extract the features through the convolution network, get a set of feature maps, and then divide it into two parts, using ordinary convolutional layers (such as convolution) Neural network CNN) extracts partial confidence maps (Part Confidence Maps) and partial affinity fields (Part Affinity Fields); after obtaining these two pieces of information, uses even matching (Bipartite Matching) to find partial associations (Part Associations), which will be the same The joint points of the individuals are connected. Due to the vector nature of the partial affinity field, the generated even match is very correct, and finally merged into a person's overall skeleton, and then through six stages of detection (stage), each stage has two Branch detection to generate keypoint heatmap and vectmap respectively. With heatmap and vectmap, you can know all the key points in the picture. In addition, the evaluation model can be trained using the MobileNet algorithm, and the training process is similar to the evaluation process.
在本发明实施例中,步骤S101可以通过以下方式实现:利用MobileNet算法从全身图像中提取骨骼特征;利用openpose网络结构从 骨骼特征提取骨骼线段和骨骼线段的关键点向量;利用余弦相似度、夹角余弦和骨骼线段的权重对关键点向量进行计算,以评估用户姿态。In the embodiment of the present invention, step S101 can be implemented in the following ways: using the MobileNet algorithm to extract bone features from the whole-body image; using the openpose network structure to extract bone line segments and key point vectors of the bone line segments from the bone features; using cosine similarity, clips The weights of angular cosine and skeletal line segments are calculated on key point vectors to evaluate user poses.
骨骼特征即用户的肢体特征,骨骼线段即图像中肢体上的线段,通常图像中的每个肢体能够提取到两个骨骼线段,通过骨骼线段表示用户肢体,更符合人体生理结构,且能够直观体现出用户姿态(即人体手臂和腿部的动作)。Bone features are the characteristics of the user's limbs. Bone line segments are the line segments on the limbs in the image. Generally, each limb in the image can be extracted into two bone line segments. The bone line segments represent the user's limbs, which is more in line with the human physiological structure and can be intuitively reflected The user's posture (that is, the movement of the human arm and leg).
其中,余弦相似度可以根据以下公式计算:Among them, the cosine similarity can be calculated according to the following formula:
Figure PCTCN2019103681-appb-000001
x和y代表骨骼线段的两个向量,i=1,2,3,...,n;
Figure PCTCN2019103681-appb-000001
x and y represent two vectors of skeletal line segments, i = 1, 2, 3, ..., n;
夹角余弦可以根据以下公式计算:The angle cosine can be calculated according to the following formula:
cos(θ 12),θ 12表示两相临骨骼线段的夹角,θ 1和θ 2分别是两相临骨骼线段与水平线的夹角,或θ 1和θ 2分别是两相临骨骼线段与竖直线的夹角; cos (θ 12 ), θ 12 represents the angle between two adjacent bone segments, θ 1 and θ 2 are the included angle between two adjacent bone segments and the horizontal line, or θ 1 and θ 2 are respectively The angle between two adjacent bone segments and the vertical line;
对于特殊场景下人的行为,不同的骨骼线段应当赋予不同的权重,骨骼线段权重可以根据历史评估经验,为肢体对应的骨骼线段分配相应的权重值,或为肢体的每个部位对应的骨骼线段分配相应的权重值等,例如大腿和小腿对应的骨骼线段的骨骼线段权重可以相同或不同。For human behavior in special scenes, different bone segments should be given different weights. Based on historical evaluation experience, bone segment weights can be assigned corresponding weight values for bone segments corresponding to limbs, or bone segments corresponding to each part of a limb Assign corresponding weight values, etc. For example, the bone segment weights of the bone segments corresponding to the thigh and the lower leg may be the same or different.
步骤S102:识别用户身份,根据用户身份推荐服装饰品。Step S102: Identify the user's identity, and recommend costume accessories based on the user's identity.
识别用户身份能够更精准地向用户推荐适合的服装饰品,可以根据商家设置的推荐策略或用户喜好等向用户推荐服装饰品,也可以采用向用户推荐销量或评价高的服装饰品等方式。Recognizing the user's identity can more accurately recommend suitable clothing accessories to the user. You can recommend clothing accessories to the user according to the recommendation strategy or user preferences set by the merchant, or you can recommend clothing accessories with high sales or high evaluation to the user.
需要说明的是,根据用户身份推荐服装饰品还可以是确定用户的用户名后,获取用户自己选择的服装饰品,即自己向自己推荐服装饰品。It should be noted that the recommendation of the clothing accessory according to the identity of the user may also be to obtain the clothing accessory selected by the user after determining the user name of the user, that is, to recommend the clothing accessory to himself.
在本发明实施例中,用户身份可以包括用户名、用户性别和用户年龄。In the embodiments of the present invention, the user identity may include a user name, user gender, and user age.
则步骤S102可以通过以下方式实现:在用户姿态与预设姿态匹配时,获取用户的面部图像;调整面部图像的尺寸,生成不同尺寸的面部图像,以构建图像金字塔;利用多层神经网络结构检测图像金字塔, 得到人脸框,基于人脸框识别出用户名;利用分类模型对面部图像进行检测,以确定用户性别和用户年龄。Step S102 can be implemented by: acquiring the user's facial image when the user's posture matches the preset posture; adjusting the size of the facial image to generate facial images of different sizes to construct an image pyramid; using multi-layer neural network structure detection The image pyramid obtains the face frame, and recognizes the user name based on the face frame; the face image is detected using a classification model to determine the user's gender and user's age.
神经网络最早由心理学家和神经生物学家提出,由于神经网络在解决复杂问题时能够提供一种相对简单的方法,常用图像分析和处理。各式各样的神经网络模型从不同的角度对生物神经系统进行不同层次的描述和模拟,可实现函数逼近、数据聚类、模式分类、优化计算等功能。多层神经网络结构包括输入层、隐藏层和输出层,其中隐藏层的层数根据需要而定。本发明实施例利用多层神经网络结构对面部图像进行检测得到用户的人脸框,人脸框由人脸特征点组成,人脸特征点是眼睛、眉毛、鼻子、嘴巴、脸部外轮廓的位置,通过人脸框便可以别出用户名。其中,构建图像金字塔的层数由两个因素决定,第一个是最小人脸尺寸,第二个是缩放因子,最小人脸尺寸(minsize)可以用min(w,h)表示,w是面部图像的宽、h是面部图像的高,由于当图像尺寸小于12时人脸几乎不可分辨,因此调整后的面部图像尺寸“minL”不能小于12。图像金字塔的层数可以根据以下公式确定:Neural networks were first proposed by psychologists and neurobiologists. Because neural networks can provide a relatively simple method when solving complex problems, image analysis and processing are commonly used. Various neural network models describe and simulate biological neural systems at different levels from different angles, which can realize functions such as function approximation, data clustering, pattern classification, and optimization calculation. The multi-layer neural network structure includes an input layer, a hidden layer, and an output layer, where the number of hidden layers depends on needs. The embodiment of the present invention uses a multi-layer neural network structure to detect the facial image to obtain the user's face frame. The face frame is composed of face feature points. The face feature points are the contours of the eyes, eyebrows, nose, mouth, and outer contour of the face. Location, you can identify the user name through the face frame. Among them, the number of layers to build the image pyramid is determined by two factors, the first is the minimum face size, the second is the scaling factor, the minimum face size (minsize) can be expressed by min (w, h), w is the face The width of the image and h is the height of the facial image. Since the human face is almost indistinguishable when the image size is less than 12, the adjusted facial image size "minL" cannot be less than 12. The number of layers of the image pyramid can be determined according to the following formula:
Figure PCTCN2019103681-appb-000002
minL﹥12,org_L是面部图像的尺寸,minsize是最小人脸尺寸,factor是缩放因子,n是图像金字塔的层数。minisize是人为根据应用场景设定,在保证minL大于12的情况下,所有的n就构成金字塔的层。所以minsize的值越小,n的取值范围就越大,计算量就相应地增加,能够检测到的人脸越小,可以通过不断调整minSize,从而决定n的取值范围,保证合适位置区间的用户能被检测到,不会太近也不会太远,在优化计算的同时可以提升用户体验。
Figure PCTCN2019103681-appb-000002
minL ﹥ 12, org_L is the size of the face image, minsize is the minimum face size, factor is the scaling factor, and n is the number of layers of the image pyramid. The minisize is artificially set according to the application scenario. In the case where minL is greater than 12, all n constitute the pyramid layer. Therefore, the smaller the value of minsize, the larger the range of n, and the amount of calculation will increase accordingly. The smaller the face that can be detected, you can continuously adjust minSize to determine the range of n to ensure a suitable position interval Of users can be detected, neither too close nor too far away, which can improve the user experience while optimizing computing.
此外,分类模型可以包括三个卷积层和二个全连接层,从而避免过拟合。卷积神经网络中每层卷积层由若干卷积单元组成,每个卷积单元的参数都是通过反向传播算法最佳化得到的。卷积运算的目的是提取输入的不同特征,第一层卷积层可能只能提取一些低级的特征如边缘、线条和角等层级,更多层的网路能从低级特征中迭代提取更复杂的特征。全连接层的每一个结点都与上一层的所有结点相连,用来 把前边提取到的特征综合起来。由于年龄和性别都可以划分为有限个类别,因此用户性别和用户年龄检测,属于分类问题,可以用分类模型对面部图像进行检测,从而确定面部图像对应的用户性别和用户年龄。In addition, the classification model can include three convolutional layers and two fully connected layers to avoid overfitting. Each convolutional layer in a convolutional neural network is composed of several convolutional units, and the parameters of each convolutional unit are optimized by a back-propagation algorithm. The purpose of the convolution operation is to extract different features of the input. The first convolutional layer may only extract some low-level features such as edges, lines, and corners. More layers of the network can iteratively extract more complex features from the low-level features. Characteristics. Each node of the fully connected layer is connected to all nodes of the previous layer, and is used to synthesize the extracted features. Since both age and gender can be divided into a limited number of categories, the detection of user gender and user age is a classification problem. A classification model can be used to detect facial images to determine the user gender and user age corresponding to the facial image.
步骤S103:基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令。Step S103: A recognition algorithm based on geometric moments and edge detection recognizes user gestures and obtains gesture commands.
为满足用户模拟穿戴不同服装饰品的需求,提高用户体验,使用户可以仅通过手势就能实现更换模拟穿戴的服装饰品,或收藏模拟穿戴的服装饰品等操作。In order to meet the needs of users to simulate wearing different clothing accessories and improve the user experience, the user can implement operations such as replacing simulated clothing accessories or collecting simulated clothing accessories through gestures.
在本发明实施例中,步骤S101可以通过以下方式实现:对用户的手部进行定位与追踪;获取用户的手势图像,并对手势图像进行二值化处理;采用几何矩和边缘检测的识别算法计算手势图像的七个几何矩特征分量,从七个几何矩特征分量中选择四个几何矩特征分量作为几何矩特征向量;生成手势图像的灰度图,检测灰度图的边缘,得到手势图像的边界方向特征向量;基于几何矩特征向量和边界方向特征向量,计算手势图像与手势库中的任一手势的距离,以获得手势命令。In the embodiment of the present invention, step S101 may be implemented by: positioning and tracking the user's hand; acquiring the user's gesture image, and binarizing the gesture image; using a geometric moment and edge detection recognition algorithm Calculate the seven geometric moment feature components of the gesture image, and select four geometric moment feature components from the seven geometric moment feature components as the geometric moment feature vector; generate the gray image of the gesture image, detect the edges of the gray image, and obtain the gesture image The boundary direction feature vector of; based on the geometric moment feature vector and the boundary direction feature vector, calculate the distance between the gesture image and any gesture in the gesture library to obtain the gesture command.
本发明实施例通过手势库存储各种手势命令,在检测手势命令时,直接查找手势库中与用户的手势图像最接近的手势,从而获得用户的手势命令。其中,二值化处理是将图像上的像素点的灰度值设置为0或255,也就是将整个图像呈现出明显的黑白效果的过程,黑白图像可使对图像的计算更加准确。几何矩是一种基于统计分析的手势识别方法,其中的七个矩组队图像的平移,旋转,尺度变换均保持不变,该矩组队即几何矩特征分量;边缘检测是图像处理和计算机视觉中的基本问题,边缘检测的目的是标识数字图像中亮度变化明显的点。图像间的距离是输入的手势图像和手势库中任一个手势图像之间的几何特征距离。In the embodiment of the present invention, various gesture commands are stored in the gesture library. When detecting the gesture commands, the gestures closest to the user's gesture image in the gesture library are directly searched to obtain the user's gesture commands. Among them, the binarization process is to set the gray value of the pixels on the image to 0 or 255, that is, the process of displaying the entire image with a clear black and white effect. The black and white image can make the calculation of the image more accurate. Geometric moment is a gesture recognition method based on statistical analysis. The translation, rotation, and scale transformation of the seven moment group images are kept unchanged. The moment group is the geometric moment feature component; edge detection is image processing and computer The basic problem in vision, the purpose of edge detection is to identify the obvious changes in brightness in digital images. The distance between images is the geometric feature distance between the input gesture image and any gesture image in the gesture library.
基于统计和概率的手势识别过程更容易控制,能够识别更复杂精细的手势。此外,基于几何矩特征向量和边界方向特征向量,计算手势图像与手势库中的任一手势的距离时,可以根据实际需要为几何矩特征向量和边界方向特征向量设置权重。The gesture recognition process based on statistics and probability is easier to control and can recognize more complex and delicate gestures. In addition, based on the geometric moment feature vector and the boundary direction feature vector, when calculating the distance between the gesture image and any gesture in the gesture library, weights can be set for the geometric moment feature vector and the boundary direction feature vector according to actual needs.
步骤S104:基于用户姿态和手势命令模拟用户穿戴服装饰品。Step S104: Simulate the user's wearing of clothing accessories based on the user's posture and gesture commands.
得到骨骼线段表达的用户姿态后,可以基于用户的命令(即手势命令)模拟用户穿戴推荐的服装饰品。After obtaining the user's posture expressed by the skeleton line segment, it is possible to simulate the user wearing the recommended costume ornament based on the user's command (that is, the gesture command).
根据本发明实施例的模拟用户穿戴服装饰品的方法可以看出,因为采用获取用户的全身图像,基于评估模型对全身图像进行检测,以评估用户姿态;识别用户身份,根据用户身份推荐服装饰品;基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令;基于用户姿态和手势命令模拟用户穿戴服装饰品的技术手段,所以克服了计算量巨大,耗时长,成本高;手势识别准确度低,需要用户多次挥手的技术问题,进而达到减少计算量,提高计算速度以及识别准确度的技术效果。According to the method of simulating a user wearing a clothing accessory according to an embodiment of the present invention, it can be seen that the whole body image of the user is acquired and the whole body image is detected based on the evaluation model to evaluate the user's posture; the user identity is identified and the clothing accessory is recommended according to the user identity; Recognition algorithms based on geometric moments and edge detection recognize user gestures and obtain gesture commands; based on user gestures and gesture commands to simulate the technical means of wearing clothing accessories, so overcoming the huge amount of calculation, time-consuming, high cost; gesture recognition accuracy is low , The technical problem that requires users to wave their hands many times to achieve the technical effect of reducing the amount of calculation, increasing the calculation speed and identifying accuracy.
图2是根据本发明一个可参考实施例的模拟用户穿戴服装饰品的方法的主要流程的示意图。如图2所示,本发明实施例的模拟用户穿戴服装饰品的方法可采用以下流程实施:FIG. 2 is a schematic diagram of a main flow of a method for simulating a user to wear a clothing accessory according to a reference embodiment of the present invention. As shown in FIG. 2, the method for simulating a user wearing a clothing accessory according to an embodiment of the present invention may be implemented using the following process:
步骤S201:姿态识别:可以利用摄像头不断扫描,只有匹配固定姿态的动作才能被识别,从而进入下一步;Step S201: gesture recognition: the camera can be used for continuous scanning, and only the actions matching the fixed gesture can be recognized, so as to enter the next step;
步骤S202:人脸识别:同样利用摄像头扫描到人脸,然后对人脸进行检测以识别用户,还可以判断是否已注册会员等;Step S202: Face recognition: the camera is also used to scan the face, and then the face is detected to identify the user, and it can also determine whether a member has been registered, etc .;
步骤S203:性别识别;Step S203: gender identification;
步骤S204:年龄识别;Step S204: age recognition;
步骤S205:推荐策略:Step S205: Recommended strategy:
识别后的结果进入推荐系统,按照性别、年龄和推荐策略进行智能推荐,还可以设置会员用户可以浏览更多优惠的搭配商品等;The result after recognition enters the recommendation system, intelligently recommends according to gender, age and recommendation strategy, and can also set member users to browse more preferential matching products, etc .;
步骤S206:手势识别:识别到用户的手势,从而切换不同的搭配方案进行展示。Step S206: Gesture recognition: the user's gesture is recognized, so that different matching schemes are switched for display.
在实施步骤S201-步骤S206的过程中,可以将相关数据存储到数据存储端,还可以从数据存储端获取相关数据。During the implementation of steps S201-S206, the relevant data can be stored to the data storage side, and the relevant data can also be obtained from the data storage side.
图3是根据本发明实施例的模拟用户穿戴服装饰品的方法的手势识别的示意图。如图3所示,手势识别主要包括以下步骤:FIG. 3 is a schematic diagram of gesture recognition for simulating a method of wearing a costume ornament by a user according to an embodiment of the present invention. As shown in Figure 3, gesture recognition mainly includes the following steps:
步骤S301:手部定位与追踪;Step S301: hand positioning and tracking;
步骤S302:手部特征提取和预处理:Step S302: Hand feature extraction and preprocessing:
分割出手势图像,并对手势图像进行二值化处理;Gesture images are segmented, and binarization is performed on the gesture images;
步骤S303:手部特征向量参数:采用几何矩和边缘检测的识别算法计算手势图像的七个几何矩特征分量,从七个几何矩特征分量中选择四个几何矩特征分量作为几何矩特征向量;以及生成手势图像的灰度图,检测灰度图的边缘,得到手势图像的边界方向特征向量;Step S303: hand feature vector parameters: the geometric moment and edge detection recognition algorithm is used to calculate the seven geometric moment feature components of the gesture image, and four geometric moment feature components are selected from the seven geometric moment feature components as the geometric moment feature vector; And generating a grayscale image of the gesture image, detecting the edges of the grayscale image, and obtaining the boundary direction feature vector of the gesture image;
步骤S304:手势识别:基于几何矩特征向量和边界方向特征向量,计算手势图像与手势库中的任一手势的距离;Step S304: Gesture recognition: based on the geometric moment feature vector and the boundary direction feature vector, calculate the distance between the gesture image and any gesture in the gesture library;
步骤S305:识别结果:根据手势图像与手势库中的任一手势的距离获得手势命令。Step S305: Recognition result: obtain a gesture command according to the distance between the gesture image and any gesture in the gesture library.
为了进一步阐述本发明的技术思想,现结合具体的应用场景,对本发明实施例的分类模型进行说明。In order to further illustrate the technical idea of the present invention, the classification model of the embodiment of the present invention will now be described in conjunction with specific application scenarios.
本发明实施例的分类模型包括:The classification model of the embodiment of the present invention includes:
第一层:卷积层:可以采用96个卷积核,每个卷积核参数个数为3*7*7,这个就相当于3个7*7大小的卷积核在每个通道进行卷积。激活函数采用线性整流函数(ReLU),池化采用最大重叠池化,池化的size选择3*3,strides选择2;其中,ReLU又称修正线性单元,是一种人工神经网络中常用的激活函数(activation function),通常指代以斜坡函数及其变种为代表的非线性函数;The first layer: the convolution layer: 96 convolution kernels can be used, and the number of parameters of each convolution kernel is 3 * 7 * 7, which is equivalent to three 7 * 7 convolution kernels in each channel convolution. The activation function uses a linear rectification function (ReLU), the pooling uses the maximum overlapping pooling, the size of the pooling is 3 * 3, and the strides is 2; Among them, ReLU is also called a modified linear unit, which is a commonly used activation in artificial neural networks. Function (activation), usually refers to the nonlinear function represented by the ramp function and its variants;
第二层:卷积层:第二层的输入也就是96*28*28的单通道图片,因为上一步已经把三通道合在一起进行卷积了;The second layer: the convolution layer: the input of the second layer is a 96 * 28 * 28 single-channel image, because the three channels have been combined for convolution in the previous step;
第三层:卷积层:滤波器个数可以选择384,卷积核大小可以为3*3;The third layer: the convolution layer: the number of filters can be 384, and the size of the convolution kernel can be 3 * 3;
第四层:全连接层:第一个全连接层,神经元个数可以选择512;The fourth layer: fully connected layer: the first fully connected layer, the number of neurons can be selected 512;
第五层:全连接层:第二个全连接层,神经元个数也可以选择512。The fifth layer: fully connected layer: the second fully connected layer, the number of neurons can also choose 512.
对于本发明实施例的分类模型的训练,图像处理可以直接采用3通道彩色图像进行处理,调整尺寸后的面部图像可以统一缩放到256*256,然后再裁剪为227*227,且训练过程可以随机裁剪,随机剪裁多张图片来训练有助于使网络识别率更高。验证测试过程则通过矩形的四个角+中心裁剪,也就是说网络的输入是227*227的3通道彩色 图像,通过使用较小的学习率和使用参数正则化方法(dropout)不断优化模型,使之更加准确。For the training of the classification model according to the embodiment of the present invention, the image processing can be directly processed with 3-channel color images, and the resized facial images can be uniformly scaled to 256 * 256, and then cropped to 227 * 227, and the training process can be random Cropping, randomly cropping multiple pictures to train helps to make the network recognition rate higher. In the verification test process, the four corners of the rectangle + the center are cropped, which means that the input of the network is a 227 * 227 3-channel color image, and the model is continuously optimized by using a small learning rate and using parameter regularization (dropout). Make it more accurate.
图4是根据本发明实施例的模拟用户穿戴服装饰品的装置的主要模块的示意图。如图4所示,本发明实施例的模拟用户穿戴服装饰品的装置400包括:评估模块401、第一识别模块402、第二识别模块403和模拟模块404。其中,FIG. 4 is a schematic diagram of main modules of a device for simulating a user to wear a clothing accessory according to an embodiment of the present invention. As shown in FIG. 4, the apparatus 400 for simulating a user's wearing of clothing accessories according to an embodiment of the present invention includes: an evaluation module 401, a first recognition module 402, a second recognition module 403, and a simulation module 404. among them,
评估模块401,用于获取用户的全身图像,基于评估模型对所述全身图像进行检测,以评估用户姿态;其中,所述评估模型为基于MobileNet算法的openpose网络结构;The evaluation module 401 is used to obtain a user's whole-body image and detect the whole-body image based on an evaluation model to evaluate the user's posture; wherein the evaluation model is an openpose network structure based on the MobileNet algorithm;
第一识别模块402,用于识别用户身份,根据所述用户身份推荐服装饰品;The first identification module 402 is used to identify the user's identity and recommend costume accessories according to the user's identity;
第二识别模块403,用于基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令;The second recognition module 403 is used to recognize the user's gesture based on the recognition algorithm based on geometric moment and edge detection and obtain the gesture command;
模拟模块404,用于基于所述用户姿态和所述手势命令模拟用户穿戴服装饰品。The simulation module 404 is used for simulating a user to wear a costume ornament based on the user posture and the gesture command.
在本发明实施例中,所述评估模块401还用于:利用所述MobileNet算法从所述全身图像中提取骨骼特征;利用所述openpose网络结构从所述骨骼特征提取骨骼线段和所述骨骼线段的关键点向量;利用余弦相似度、夹角余弦和所述骨骼线段的权重对所述关键点向量进行计算,以评估用户姿态。In the embodiment of the present invention, the evaluation module 401 is further configured to: use the MobileNet algorithm to extract bone features from the whole-body image; use the openpose network structure to extract bone line segments and the bone line segments from the bone features The key point vector of; calculates the key point vector using the cosine similarity, the angle cosine and the weight of the bone line segment to evaluate the user's posture.
此外,所述用户身份包括用户名、用户性别和用户年龄;以及所述第一识别模块402还用于:在所述用户姿态与预设姿态匹配时,获取用户的面部图像;调整所述面部图像的尺寸,生成不同尺寸的面部图像,以构建图像金字塔;其中,所述图像金字塔的层数根据以下公式确定:
Figure PCTCN2019103681-appb-000003
minL﹥12,org_L是所述面部图像的尺寸,minsize是最小人脸尺寸,factor是缩放因子,n是所述图像金字塔的层数;利用多层神经网络结构检测所述图像金字塔,得到人脸框,基于所述人脸框识别出所述用户名;利用分类模型对所述面部图像进行检测,以确定所述用户性别和所述用户年龄;其中, 所述分类模型包括三个卷积层和二个全连接层。
In addition, the user identity includes a user name, a user's gender, and a user's age; and the first recognition module 402 is further used to: obtain a user's facial image when the user's posture matches a preset posture; adjust the face The size of the image, generating facial images of different sizes to construct an image pyramid; wherein, the number of layers of the image pyramid is determined according to the following formula:
Figure PCTCN2019103681-appb-000003
minL ﹥ 12, org_L is the size of the face image, minsize is the minimum face size, factor is the scaling factor, and n is the number of layers of the image pyramid; the multi-layer neural network structure is used to detect the image pyramid to obtain the face Frame, identifying the user name based on the face frame; detecting the facial image using a classification model to determine the user's gender and the user's age; wherein, the classification model includes three convolutional layers And two fully connected layers.
在本发明实施例中,所述第二识别模块403还用于:对用户的手部进行定位与追踪;获取用户的手势图像,并对所述手势图像进行二值化处理;采用几何矩和边缘检测的识别算法计算所述手势图像的七个几何矩特征分量,从七个所述几何矩特征分量中选择四个所述几何矩特征分量作为几何矩特征向量;生成所述手势图像的灰度图,检测所述灰度图的边缘,得到所述手势图像的边界方向特征向量;基于所述几何矩特征向量和所述边界方向特征向量,计算所述手势图像与手势库中的任一手势的距离,以获得手势命令。In the embodiment of the present invention, the second recognition module 403 is further used to: locate and track the user's hand; acquire the user's gesture image, and binarize the gesture image; use geometric moment and The edge detection recognition algorithm calculates seven geometric moment feature components of the gesture image, and selects four geometric moment feature components from the seven geometric moment feature components as geometric moment feature vectors; generates the gray of the gesture image Degree map, detecting the edge of the grayscale image to obtain the boundary direction feature vector of the gesture image; based on the geometric moment feature vector and the boundary direction feature vector, calculating any of the gesture image and the gesture library Gesture distance to get gesture commands.
根据本发明实施例的模拟用户穿戴服装饰品的装置可以看出,根据本发明实施例的模拟用户穿戴服装饰品的装置可以看出,因为采用获取用户的全身图像,基于评估模型对全身图像进行检测,以评估用户姿态;识别用户身份,根据用户身份推荐服装饰品;基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令;基于用户姿态和手势命令模拟用户穿戴服装饰品的技术手段,所以克服了计算量巨大,耗时长,成本高;手势识别准确度低,需要用户多次挥手的技术问题,进而达到减少计算量,提高计算速度以及识别准确度的技术效果。It can be seen from the device for simulating the wearing of clothing accessories according to the embodiment of the present invention. The device for simulating the wearing of clothing accessories according to the embodiment of the present invention can be seen because the whole body image of the user is acquired and the whole body image is detected based on the evaluation model , To evaluate the user's posture; identify the user's identity, and recommend clothing accessories based on the user's identity; recognition algorithms based on geometric moments and edge detection recognize user gestures and obtain gesture commands; based on the user's gestures and gesture commands to simulate the technical means of wearing clothing accessories, so It overcomes the technical problem of huge calculation amount, long time consumption and high cost; low accuracy of gesture recognition, which requires users to wave their hands many times, thereby achieving the technical effects of reducing the amount of calculation, increasing the calculation speed and identifying accuracy.
图5是根据本发明实施例的模拟用户穿戴服装饰品的系统的示意图。如图5所示,本发明实施例还提供了一种模拟用户穿戴服装饰品的系统500,该模拟用户穿戴服装饰品的系统500包括图像采集端501、识别端502、推荐服务端503、显示端504和数据存储端505。其中,FIG. 5 is a schematic diagram of a system for simulating a user to wear clothing accessories according to an embodiment of the present invention. As shown in FIG. 5, an embodiment of the present invention also provides a system 500 for simulating a user's clothing accessories. The system 500 for simulating a user's clothing accessories includes an image acquisition terminal 501, an identification terminal 502, a recommendation server 503, and a display terminal 504 和 数据 存 端 505。 504 and data storage terminal 505. among them,
图像采集端501用于获取用户的全身图像、面部图像和手势图像;The image acquisition terminal 501 is used to acquire the user's full-body image, facial image and gesture image;
识别端502用于评估用户姿态、用户身份和获得手势命令;The recognition terminal 502 is used to evaluate the user's posture, user identity and obtain gesture commands;
推荐服务端503用于根据用户身份推荐服装饰品;The recommendation server 503 is used to recommend clothing accessories based on the user's identity;
显示端504用于展示模拟用户穿戴服装饰品;The display end 504 is used to display simulated user wearing clothing accessories;
数据存储端505用于存储图像采集端、识别端和推荐服务端的数据。The data storage terminal 505 is used to store data of the image acquisition terminal, the recognition terminal, and the recommendation server.
根据本发明实施例的模拟用户穿戴服装饰品的系统可以看出,因为采用获取用户的全身图像,基于评估模型对全身图像进行检测,以评估用户姿态;识别用户身份,根据用户身份推荐服装饰品;基于几 何矩和边缘检测的识别算法识别用户手势,获得手势命令;基于用户姿态和手势命令模拟用户穿戴服装饰品的技术手段,所以克服了计算量巨大,耗时长,成本高;手势识别准确度低,需要用户多次挥手的技术问题,进而达到减少计算量,提高计算速度以及识别准确度的技术效果。According to the system for simulating a user wearing a clothing accessory according to an embodiment of the present invention, it can be seen that the whole body image of the user is acquired and the whole body image is detected based on the evaluation model to evaluate the user's posture; the user identity is identified and the clothing accessory is recommended according to the user identity; Recognition algorithms based on geometric moments and edge detection recognize user gestures and obtain gesture commands; based on user gestures and gesture commands to simulate the technical means of wearing clothing accessories, so overcoming the huge amount of calculation, time-consuming, high cost; gesture recognition accuracy is low , The technical problem that requires users to wave their hands many times to achieve the technical effect of reducing the amount of calculation, increasing the calculation speed and identifying accuracy.
图6示出了可以应用本发明实施例的模拟用户穿戴服装饰品的方法或模拟用户穿戴服装饰品的装置的示例性系统架构600。如图6所示,系统架构600可以包括终端设备601、602、603,网络604和服务器605。网络604用以在终端设备601、602、603和服务器605之间提供通信链路的介质。网络604可以包括各种连接类型,例如有线、无线通信链路或者光纤电缆等等。FIG. 6 shows an exemplary system architecture 600 to which the method for simulating the wearing of clothing accessories or the device for simulating the wearing of clothing accessories of the embodiments of the present invention can be applied. As shown in FIG. 6, the system architecture 600 may include terminal devices 601, 602, and 603, a network 604, and a server 605. The network 604 is used as a medium for providing communication links between the terminal devices 601, 602, and 603 and the server 605. The network 604 may include various connection types, such as wired, wireless communication links, or fiber optic cables, and so on.
用户可以使用终端设备601、602、603通过网络604与服务器605交互,以接收或发送消息等。终端设备601、602、603上可以安装有各种通讯客户端应用,例如购物类应用、网页浏览器应用、搜索类应用、即时通信工具、邮箱客户端、社交平台软件等。The user can use the terminal devices 601, 602, 603 to interact with the server 605 via the network 604 to receive or send messages, etc. Various communication client applications, such as shopping applications, web browser applications, search applications, instant communication tools, email clients, and social platform software, can be installed on the terminal devices 601, 602, and 603.
终端设备601、602、603可以是具有显示屏并且支持网页浏览的各种电子设备,包括但不限于智能手机、平板电脑、膝上型便携计算机和台式计算机等等。The terminal devices 601, 602, and 603 may be various electronic devices that have a display screen and support web browsing, including but not limited to smart phones, tablet computers, laptop portable computers, desktop computers, and so on.
服务器605可以是提供各种服务的服务器,例如对用户利用终端设备601、602、603所浏览的购物类网站提供支持的后台管理服务器。后台管理服务器可以对接收到的产品信息查询请求等数据进行分析等处理,并将处理结果(例如目标推送信息、产品信息)反馈给终端设备。The server 605 may be a server that provides various services, for example, a background management server that provides support for shopping websites browsed by users using terminal devices 601, 602, and 603. The background management server may perform analysis and other processing on the received product information query request and other data, and feed back the processing results (such as target push information and product information) to the terminal device.
需要说明的是,本发明实施例所提供的模拟用户穿戴服装饰品的方法一般由服务器605执行,相应地,模拟用户穿戴服装饰品的装置一般设置于服务器605中。It should be noted that the method for simulating the wearing of clothing accessories provided by the embodiment of the present invention is generally executed by the server 605, and accordingly, the device for simulating the wearing of clothing accessories by the user is generally provided in the server 605.
应该理解,图6中的终端设备、网络和服务器的数目仅仅是示意性的。根据实现需要,可以具有任意数目的终端设备、网络和服务器。It should be understood that the numbers of terminal devices, networks, and servers in FIG. 6 are only schematic. According to the implementation needs, there can be any number of terminal devices, networks and servers.
下面参考图7,其示出了适于用来实现本发明实施例的终端设备的计算机系统700的结构示意图。图7示出的终端设备仅仅是一个示例, 不应对本发明实施例的功能和使用范围带来任何限制。如图7所示,计算机系统700包括中央处理单元(CPU)701,其可以根据存储在只读存储器(ROM)702中的程序或者从存储部分708加载到随机访问存储器(RAM)703中的程序而执行各种适当的动作和处理。在RAM 703中,还存储有系统700操作所需的各种程序和数据。CPU 701、ROM 702以及RAM 703通过总线704彼此相连。输入/输出(I/O)接口705也连接至总线704。7, which shows a schematic structural diagram of a computer system 700 suitable for implementing a terminal device according to an embodiment of the present invention. The terminal device shown in FIG. 7 is only an example, and should not bring any limitation to the functions and use scope of the embodiments of the present invention. As shown in FIG. 7, the computer system 700 includes a central processing unit (CPU) 701 that can be loaded into a random access memory (RAM) 703 from a program stored in a read-only memory (ROM) 702 or from a storage section 708 Instead, perform various appropriate actions and processing. In the RAM 703, various programs and data necessary for the operation of the system 700 are also stored. The CPU 701, ROM 702, and RAM 703 are connected to each other through a bus 704. An input / output (I / O) interface 705 is also connected to the bus 704.
以下部件连接至I/O接口705:包括键盘、鼠标等的输入部分706;包括诸如阴极射线管(CRT)、液晶显示器(LCD)等以及扬声器等的输出部分707;包括硬盘等的存储部分708;以及包括诸如LAN卡、调制解调器等的网络接口卡的通信部分709。通信部分709经由诸如因特网的网络执行通信处理。驱动器710也根据需要连接至I/O接口705。可拆卸介质711,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器710上,以便于从其上读出的计算机程序根据需要被安装入存储部分708。The following components are connected to the I / O interface 705: an input section 706 including a keyboard, a mouse, etc .; an output section 707 including a cathode ray tube (CRT), a liquid crystal display (LCD), etc., and a speaker, etc .; a storage section 708 including a hard disk, etc. ; And a communication section 709 including a network interface card such as a LAN card, a modem, etc. The communication section 709 performs communication processing via a network such as the Internet. The drive 710 is also connected to the I / O interface 705 as needed. A removable medium 711, such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like, is installed on the drive 710 as necessary, so that the computer program read out therefrom is installed into the storage portion 708 as needed.
特别地,根据本发明公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本发明公开的实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信部分709从网络上被下载和安装,和/或从可拆卸介质711被安装。在该计算机程序被中央处理单元(CPU)701执行时,执行本发明的系统中限定的上述功能。In particular, according to the disclosed embodiments of the present invention, the process described above with reference to the flowchart may be implemented as a computer software program. For example, the disclosed embodiments of the present invention include a computer program product including a computer program carried on a computer-readable medium, the computer program containing program code for performing the method shown in the flowchart. In such an embodiment, the computer program may be downloaded and installed from the network through the communication section 709, and / or installed from the removable medium 711. When the computer program is executed by the central processing unit (CPU) 701, the above-described functions defined in the system of the present invention are executed.
需要说明的是,本发明所示的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存 储器件、或者上述的任意合适的组合。在本发明中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本发明中,计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:无线、电线、光缆、RF等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium shown in the present invention may be a computer-readable signal medium or a computer-readable storage medium or any combination of the two. The computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or any combination of the above. More specific examples of computer-readable storage media may include, but are not limited to: electrical connections with one or more wires, portable computer diskettes, hard drives, random access memory (RAM), read-only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the foregoing. In the present invention, the computer-readable storage medium may be any tangible medium containing or storing a program, which may be used by or in combination with an instruction execution system, apparatus, or device. In the present invention, the computer-readable signal medium may include a data signal that is propagated in baseband or as part of a carrier wave, in which computer-readable program code is carried. This propagated data signal can take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above. The computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, and the computer-readable medium may send, propagate, or transmit a program for use by or in combination with an instruction execution system, apparatus, or device. . The program code contained on the computer-readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, optical cable, RF, etc., or any suitable combination of the foregoing.
附图中的流程图和框图,图示了按照本发明各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,上述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图或流程图中的每个方框、以及框图或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowchart and block diagrams in the drawings illustrate the possible implementation architecture, functions, and operations of the system, method, and computer program product according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagram may represent a module, a program segment, or a part of code, and the above-mentioned module, program segment, or part of code contains one or more for implementing a specified logical function Executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession can actually be executed in parallel, and sometimes they can also be executed in reverse order, depending on the functions involved. It should also be noted that each block in the block diagram or flowchart, and a combination of blocks in the block diagram or flowchart, can be implemented with a dedicated hardware-based system that performs the specified function or operation, or can be used It is realized by a combination of dedicated hardware and computer instructions.
描述于本发明实施例中所涉及到的模块可以通过软件的方式实现,也可以通过硬件的方式来实现。所描述的模块也可以设置在处理器中,例如,可以描述为:一种处理器包括评估模块、第一识别模块、第二识别模块和模拟模块。其中,这些模块的名称在某种情况下并不构成对该模块本身的限定,例如,模拟模块还可以被描述为“基于所述用户姿态和所述手势命令模拟用户穿戴服装饰品的模块”。The modules described in the embodiments of the present invention may be implemented in software or hardware. The described module may also be provided in the processor. For example, it may be described as: a processor includes an evaluation module, a first identification module, a second identification module, and an analog module. In some cases, the names of these modules do not constitute a limitation on the module itself. For example, the simulation module may also be described as "a module for simulating a user to wear a costume ornament based on the user gesture and the gesture command".
作为另一方面,本发明还提供了一种计算机可读介质,该计算机可读介质可以是上述实施例中描述的设备中所包含的;也可以是单独 存在,而未装配入该设备中。上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被一个该设备执行时,使得该设备包括:步骤S101:获取用户的全身图像,基于评估模型对全身图像进行检测,以评估用户姿态;步骤S102:识别用户身份,根据用户身份推荐服装饰品;步骤S103:基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令;步骤S104:基于用户姿态和手势命令模拟用户穿戴服装饰品。As another aspect, the present invention also provides a computer-readable medium. The computer-readable medium may be included in the device described in the foregoing embodiments; or it may exist alone without being assembled into the device. The computer-readable medium carries one or more programs. When the one or more programs are executed by a device, the device includes: Step S101: Acquire a user's whole-body image and detect the whole-body image based on an evaluation model, To evaluate the user's posture; Step S102: Recognize the user's identity and recommend clothing accessories according to the user's identity; Step S103: Recognize the user's gesture based on the recognition algorithm based on geometric moments and edge detection to obtain the gesture command; Step S104: Simulate the user based on the user's gesture and gesture command Wear clothing accessories.
根据本发明实施例的技术方案,因为采用获取用户的全身图像,基于评估模型对全身图像进行检测,以评估用户姿态;识别用户身份,根据用户身份推荐服装饰品;基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令;基于用户姿态和手势命令模拟用户穿戴服装饰品的技术手段,所以克服了计算量巨大,耗时长,成本高;手势识别准确度低,需要用户多次挥手的技术问题,进而达到减少计算量,提高计算速度以及识别准确度的技术效果。According to the technical solution of the embodiment of the present invention, the whole-body image of the user is acquired and the whole-body image is detected based on the evaluation model to evaluate the user's posture; the user's identity is recognized, and clothing accessories are recommended according to the user's identity; recognition based on geometric moments and edge detection The algorithm recognizes user gestures and obtains gesture commands; based on the user's gestures and gesture commands, the technical means of simulating the user's wearing clothing accessories, so it overcomes the huge calculation, time-consuming, and high cost; the gesture recognition accuracy is low, requiring the user to wave many times. The problem is to achieve the technical effect of reducing the amount of calculation, increasing the calculation speed and identifying accuracy.
上述具体实施方式,并不构成对本发明保护范围的限制。本领域技术人员应该明白的是,取决于设计要求和其他因素,可以发生各种各样的修改、组合、子组合和替代。任何在本发明的精神和原则之内所作的修改、等同替换和改进等,均应包含在本发明保护范围之内。The above specific embodiments do not limit the protection scope of the present invention. Those skilled in the art should understand that various modifications, combinations, sub-combinations and substitutions can occur depending on design requirements and other factors. Any modification, equivalent replacement and improvement made within the spirit and principle of the present invention shall be included in the protection scope of the present invention.

Claims (11)

  1. 一种模拟用户穿戴服装饰品的方法,其特征在于,包括:A method for simulating a user to wear clothing accessories, which is characterized by including:
    获取用户的全身图像,基于评估模型对所述全身图像进行检测,以评估用户姿态;其中,所述评估模型为基于MobileNet算法的openpose网络结构;Obtaining the user's whole-body image and detecting the whole-body image based on the evaluation model to evaluate the user's posture; wherein the evaluation model is an openpose network structure based on the MobileNet algorithm;
    识别用户身份,根据所述用户身份推荐服装饰品;Identify the user's identity and recommend costume accessories based on the user's identity;
    基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令;Recognition algorithms based on geometric moments and edge detection recognize user gestures and obtain gesture commands;
    基于所述用户姿态和所述手势命令模拟用户穿戴服装饰品。Based on the user's posture and the gesture command, the user wears a clothing accessory.
  2. 根据权利要求1所述的方法,其特征在于,基于评估模型对所述全身图像进行检测,以评估用户姿态包括:The method according to claim 1, wherein detecting the whole-body image based on the evaluation model to evaluate the user's posture includes:
    利用所述MobileNet算法从所述全身图像中提取骨骼特征;Using the MobileNet algorithm to extract bone features from the whole-body image;
    利用所述openpose网络结构从所述骨骼特征提取骨骼线段和所述骨骼线段的关键点向量;Extracting bone line segments and key point vectors of the bone line segments from the bone features using the openpose network structure;
    利用余弦相似度、夹角余弦和所述骨骼线段的权重对所述关键点向量进行计算,以评估用户姿态。The key point vector is calculated using the cosine similarity, the angle cosine and the weight of the bone line segment to evaluate the user's posture.
  3. 根据权利要求2所述的方法,其特征在于,所述用户身份包括用户名、用户性别和用户年龄;以及The method according to claim 2, wherein the user identity includes a user name, user gender, and user age; and
    识别用户身份包括:Identifying users includes:
    在所述用户姿态与预设姿态匹配时,获取用户的面部图像;When the user's posture matches the preset posture, obtain the user's facial image;
    调整所述面部图像的尺寸,生成不同尺寸的面部图像,以构建图像金字塔;其中,所述图像金字塔的层数根据以下公式确定:Adjust the size of the facial image to generate facial images of different sizes to construct an image pyramid; wherein, the number of layers of the image pyramid is determined according to the following formula:
    Figure PCTCN2019103681-appb-100001
    minL﹥12,org_L是所述面部图像的尺寸,minsize是最小人脸尺寸,factor是缩放因子,n是所述图像金字塔的层数;
    Figure PCTCN2019103681-appb-100001
    minL ﹥ 12, org_L is the size of the face image, minsize is the minimum face size, factor is the scaling factor, and n is the number of layers of the image pyramid;
    利用多层神经网络结构检测所述图像金字塔,得到人脸框,基于所述人脸框识别出所述用户名;Detecting the image pyramid using a multi-layer neural network structure to obtain a face frame, and identifying the user name based on the face frame;
    利用分类模型对所述面部图像进行检测,以确定所述用户性别和所述用户年龄;其中,所述分类模型包括三个卷积层和二个全连接层。A classification model is used to detect the facial image to determine the user's gender and the user's age; wherein, the classification model includes three convolutional layers and two fully connected layers.
  4. 根据权利要求1所述的方法,其特征在于,基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令包括:The method according to claim 1, wherein the recognition algorithm based on geometric moments and edge detection recognizes user gestures, and obtaining gesture commands includes:
    对用户的手部进行定位与追踪;Position and track the user's hand;
    获取用户的手势图像,并对所述手势图像进行二值化处理;Acquiring a gesture image of a user, and performing binary processing on the gesture image;
    采用几何矩和边缘检测的识别算法计算所述手势图像的七个几何矩特征分量,从七个所述几何矩特征分量中选择四个所述几何矩特征分量作为几何矩特征向量;A geometric moment and edge detection recognition algorithm is used to calculate seven geometric moment feature components of the gesture image, and four geometric moment feature components are selected from the seven geometric moment feature components as geometric moment feature vectors;
    生成所述手势图像的灰度图,检测所述灰度图的边缘,得到所述手势图像的边界方向特征向量;Generating a grayscale image of the gesture image, detecting edges of the grayscale image, and obtaining a boundary direction feature vector of the gesture image;
    基于所述几何矩特征向量和所述边界方向特征向量,计算所述手势图像与手势库中的任一手势的距离,以获得手势命令。Based on the geometric moment feature vector and the boundary direction feature vector, the distance between the gesture image and any gesture in the gesture library is calculated to obtain a gesture command.
  5. 一种模拟用户穿戴服装饰品的装置,其特征在于,包括:A device for simulating a user to wear a costume ornament is characterized in that it includes:
    评估模块,用于获取用户的全身图像,基于评估模型对所述全身图像进行检测,以评估用户姿态;其中,所述评估模型为基于MobileNet算法的openpose网络结构;The evaluation module is used to obtain a user's whole-body image and detect the whole-body image based on an evaluation model to evaluate the user's posture; wherein the evaluation model is an openpose network structure based on the MobileNet algorithm;
    第一识别模块,用于识别用户身份,根据所述用户身份推荐服装饰品;The first identification module is used to identify the user's identity, and recommend costume accessories based on the user's identity;
    第二识别模块,用于基于几何矩和边缘检测的识别算法识别用户手势,获得手势命令;The second recognition module is used to recognize the user's gesture based on the recognition algorithm based on geometric moment and edge detection and obtain the gesture command;
    模拟模块,用于基于所述用户姿态和所述手势命令模拟用户穿戴服装饰品。The simulation module is used for simulating a user to wear a clothing ornament based on the user posture and the gesture command.
  6. 根据权利要求5所述的装置,其特征在于,所述评估模块还用于:The apparatus according to claim 5, wherein the evaluation module is further used to:
    利用所述MobileNet算法从所述全身图像中提取骨骼特征;Using the MobileNet algorithm to extract bone features from the whole-body image;
    利用所述openpose网络结构从所述骨骼特征提取骨骼线段和所述骨骼线段的关键点向量;Extracting bone line segments and key point vectors of the bone line segments from the bone features using the openpose network structure;
    利用余弦相似度、夹角余弦和所述骨骼线段的权重对所述关键点向量进行计算,以评估用户姿态。The key point vector is calculated using the cosine similarity, the angle cosine and the weight of the bone line segment to evaluate the user's posture.
  7. 根据权利要求6所述的装置,其特征在于,所述用户身份包括用户名、用户性别和用户年龄;以及The apparatus according to claim 6, wherein the user identity includes a user name, user gender, and user age; and
    所述第一识别模块还用于:The first identification module is also used to:
    在所述用户姿态与预设姿态匹配时,获取用户的面部图像;When the user's posture matches the preset posture, obtain the user's facial image;
    调整所述面部图像的尺寸,生成不同尺寸的面部图像,以构建图像金字塔;其中,所述图像金字塔的层数根据以下公式确定:Adjust the size of the facial image to generate facial images of different sizes to construct an image pyramid; wherein, the number of layers of the image pyramid is determined according to the following formula:
    Figure PCTCN2019103681-appb-100002
    minL﹥12,org_L是所述面部图像的尺寸,minsize是最小人脸尺寸,factor是缩放因子,n是所述图像金字塔的层数;
    Figure PCTCN2019103681-appb-100002
    minL ﹥ 12, org_L is the size of the face image, minsize is the minimum face size, factor is the scaling factor, and n is the number of layers of the image pyramid;
    利用多层神经网络结构检测所述图像金字塔,得到人脸框,基于所述人脸框识别出所述用户名;Detecting the image pyramid using a multi-layer neural network structure to obtain a face frame, and identifying the user name based on the face frame;
    利用分类模型对所述面部图像进行检测,以确定所述用户性别和所述用户年龄;其中,所述分类模型包括三个卷积层和二个全连接层。A classification model is used to detect the facial image to determine the user's gender and the user's age; wherein, the classification model includes three convolutional layers and two fully connected layers.
  8. 根据权利要求5所述的装置,其特征在于,所述第二识别模块还用于:The device according to claim 5, wherein the second identification module is further used to:
    对用户的手部进行定位与追踪;Position and track the user's hand;
    获取用户的手势图像,并对所述手势图像进行二值化处理;Acquiring a gesture image of a user, and performing binary processing on the gesture image;
    采用几何矩和边缘检测的识别算法计算所述手势图像的七个几何矩特征分量,从七个所述几何矩特征分量中选择四个所述几何矩特征分量作为几何矩特征向量;A geometric moment and edge detection recognition algorithm is used to calculate seven geometric moment feature components of the gesture image, and four geometric moment feature components are selected from the seven geometric moment feature components as geometric moment feature vectors;
    生成所述手势图像的灰度图,检测所述灰度图的边缘,得到所述手势图像的边界方向特征向量;Generating a grayscale image of the gesture image, detecting edges of the grayscale image, and obtaining a boundary direction feature vector of the gesture image;
    基于所述几何矩特征向量和所述边界方向特征向量,计算所述手势图像与手势库中的任一手势的距离,以获得手势命令。Based on the geometric moment feature vector and the boundary direction feature vector, the distance between the gesture image and any gesture in the gesture library is calculated to obtain a gesture command.
  9. 一种模拟用户穿戴服装饰品的系统,其特征在于,包括图像采集端、识别端、推荐服务端、显示端和数据存储端,其中:A system for simulating a user's clothing accessories is characterized in that it includes an image acquisition end, an identification end, a recommendation service end, a display end, and a data storage end, where:
    所述图像采集端用于获取用户的全身图像、面部图像和手势图像;The image acquisition terminal is used to acquire the user's full-body image, facial image and gesture image;
    所述识别端用于评估用户姿态、用户身份和获得手势命令;The recognition terminal is used to evaluate the user's posture, user identity and obtain gesture commands;
    所述推荐服务端用于根据所述用户身份推荐服装饰品;The recommendation server is used for recommending clothing accessories according to the user identity;
    所述显示端用于展示模拟用户穿戴服装饰品;The display terminal is used to display simulated user wearing clothing accessories;
    所述数据存储端用于存储所述图像采集端、所述识别端和所述推 荐服务端的数据。The data storage end is used to store data of the image acquisition end, the identification end, and the recommendation server end.
  10. 一种模拟用户穿戴服装饰品的电子设备,其特征在于,包括:An electronic device simulating a user's clothing accessories is characterized by including:
    一个或多个处理器;One or more processors;
    存储装置,用于存储一个或多个程序,Storage device for storing one or more programs,
    当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如权利要求1-4中任一所述的方法。When the one or more programs are executed by the one or more processors, the one or more processors implement the method according to any one of claims 1-4.
  11. 一种计算机可读介质,其上存储有计算机程序,其特征在于,所述程序被处理器执行时实现如权利要求1-4中任一所述的方法。A computer-readable medium on which a computer program is stored, characterized in that, when the program is executed by a processor, the method according to any one of claims 1-4 is implemented.
PCT/CN2019/103681 2018-10-15 2019-08-30 Method, device and system for simulating user wearing clothing and accessories WO2020078119A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811195470.2 2018-10-15
CN201811195470.2A CN109409994A (en) 2018-10-15 2018-10-15 The methods, devices and systems of analog subscriber garments worn ornaments

Publications (1)

Publication Number Publication Date
WO2020078119A1 true WO2020078119A1 (en) 2020-04-23

Family

ID=65467178

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/103681 WO2020078119A1 (en) 2018-10-15 2019-08-30 Method, device and system for simulating user wearing clothing and accessories

Country Status (2)

Country Link
CN (1) CN109409994A (en)
WO (1) WO2020078119A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111639632A (en) * 2020-07-31 2020-09-08 南京浦和数据有限公司 Subway driver action sequence identification method based on support vector machine
CN111861822A (en) * 2020-06-03 2020-10-30 四川大学华西医院 Patient model construction method, equipment and medical education system
CN112131965A (en) * 2020-08-31 2020-12-25 深圳云天励飞技术股份有限公司 Human body posture estimation method and device, electronic equipment and storage medium
CN116453221A (en) * 2023-04-19 2023-07-18 北京百度网讯科技有限公司 Target object posture determining method, training device and storage medium

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109409994A (en) * 2018-10-15 2019-03-01 北京京东金融科技控股有限公司 The methods, devices and systems of analog subscriber garments worn ornaments
CN110084138A (en) * 2019-04-04 2019-08-02 高新兴科技集团股份有限公司 A kind of more people's Attitude estimation methods of 2D
CN110096973A (en) * 2019-04-16 2019-08-06 东南大学 A kind of traffic police's gesture identification method separating convolutional network based on ORB algorithm and depth level
CN111231892A (en) * 2019-12-29 2020-06-05 的卢技术有限公司 Automatic automobile unlocking control method and system based on face and gesture recognition
CN111401260B (en) * 2020-03-18 2020-09-29 南通大学 Sit-up test counting method and system based on Quick-OpenPose model
CN111382723A (en) * 2020-03-30 2020-07-07 北京云住养科技有限公司 Method, device and system for identifying help
CN111882531B (en) * 2020-07-15 2021-08-17 中国科学技术大学 Automatic analysis method for hip joint ultrasonic image
CN112582064A (en) * 2020-11-05 2021-03-30 中国科学院深圳先进技术研究院 Action evaluation method, device, equipment and storage medium
CN113822202A (en) * 2021-09-24 2021-12-21 河南理工大学 Taijiquan attitude detection system based on OpenPose and PyQt

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103533449A (en) * 2012-12-20 2014-01-22 Tcl集团股份有限公司 Method and system for realizing three-dimensional fitting based on intelligent three-dimensional television
US20160240015A1 (en) * 2015-02-13 2016-08-18 Speed 3D Inc. Three-dimensional avatar generating system, device and method thereof
CN205507877U (en) * 2015-11-16 2016-08-24 长春理工大学 Virtual fitting device that can be used to three -dimensional real time kinematic that purchases of net
CN107240007A (en) * 2017-07-21 2017-10-10 陕西科技大学 A kind of AR three-dimensional virtual fitting systems combined with 3D manikins
CN109409994A (en) * 2018-10-15 2019-03-01 北京京东金融科技控股有限公司 The methods, devices and systems of analog subscriber garments worn ornaments

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049852B (en) * 2012-12-19 2015-12-09 武汉世纪炎龙网络科技有限公司 Virtual fitting system
CN105426850B (en) * 2015-11-23 2021-08-31 深圳市商汤科技有限公司 Associated information pushing device and method based on face recognition
CN107346486A (en) * 2016-05-04 2017-11-14 上海柚钛智能科技有限公司 Wall-mounted intelligent interaction device and its interactive approach

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103533449A (en) * 2012-12-20 2014-01-22 Tcl集团股份有限公司 Method and system for realizing three-dimensional fitting based on intelligent three-dimensional television
US20160240015A1 (en) * 2015-02-13 2016-08-18 Speed 3D Inc. Three-dimensional avatar generating system, device and method thereof
CN205507877U (en) * 2015-11-16 2016-08-24 长春理工大学 Virtual fitting device that can be used to three -dimensional real time kinematic that purchases of net
CN107240007A (en) * 2017-07-21 2017-10-10 陕西科技大学 A kind of AR three-dimensional virtual fitting systems combined with 3D manikins
CN109409994A (en) * 2018-10-15 2019-03-01 北京京东金融科技控股有限公司 The methods, devices and systems of analog subscriber garments worn ornaments

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111861822A (en) * 2020-06-03 2020-10-30 四川大学华西医院 Patient model construction method, equipment and medical education system
CN111861822B (en) * 2020-06-03 2023-11-21 四川大学华西医院 Patient model construction method, equipment and medical education system
CN111639632A (en) * 2020-07-31 2020-09-08 南京浦和数据有限公司 Subway driver action sequence identification method based on support vector machine
CN112131965A (en) * 2020-08-31 2020-12-25 深圳云天励飞技术股份有限公司 Human body posture estimation method and device, electronic equipment and storage medium
CN112131965B (en) * 2020-08-31 2023-10-13 深圳云天励飞技术股份有限公司 Human body posture estimation method and device, electronic equipment and storage medium
CN116453221A (en) * 2023-04-19 2023-07-18 北京百度网讯科技有限公司 Target object posture determining method, training device and storage medium
CN116453221B (en) * 2023-04-19 2024-03-08 北京百度网讯科技有限公司 Target object posture determining method, training device and storage medium

Also Published As

Publication number Publication date
CN109409994A (en) 2019-03-01

Similar Documents

Publication Publication Date Title
WO2020078119A1 (en) Method, device and system for simulating user wearing clothing and accessories
CN109359538B (en) Training method of convolutional neural network, gesture recognition method, device and equipment
US11069131B2 (en) Predictive personalized three-dimensional body models
WO2020182121A1 (en) Expression recognition method and related device
US20210174072A1 (en) Microexpression-based image recognition method and apparatus, and related device
US11163978B2 (en) Method and device for face image processing, storage medium, and electronic device
JP2020522285A (en) System and method for whole body measurement extraction
CN112800903B (en) Dynamic expression recognition method and system based on space-time diagram convolutional neural network
CN109635752B (en) Method for positioning key points of human face, method for processing human face image and related device
JP2014211719A (en) Apparatus and method for information processing
US11861860B2 (en) Body dimensions from two-dimensional body images
WO2024001095A1 (en) Facial expression recognition method, terminal device and storage medium
KR20210090456A (en) Image-based Posture Preservation Virtual Fitting System Supporting Multi-Poses
Kerdvibulvech A methodology for hand and finger motion analysis using adaptive probabilistic models
CN114549557A (en) Portrait segmentation network training method, device, equipment and medium
CN113902989A (en) Live scene detection method, storage medium and electronic device
Lu et al. Cost-effective real-time recognition for human emotion-age-gender using deep learning with normalized facial cropping preprocess
US20220207917A1 (en) Facial expression image processing method and apparatus, and electronic device
Li et al. A novel art gesture recognition model based on two channel region-based convolution neural network for explainable human-computer interaction understanding
US20220139113A1 (en) Method and device for detecting object in image
KR102237131B1 (en) Appratus and method for processing image including at least one object
CN114511877A (en) Behavior recognition method and device, storage medium and terminal
Liu et al. Design and implementation of hair recommendation system based on face recognition
Satybaldina et al. Application development for hand gestures recognition with using a depth camera
Xu et al. Feature fusion capsule network for cow face recognition

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19872837

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19872837

Country of ref document: EP

Kind code of ref document: A1