CN113160826B

CN113160826B - Family member communication method and system based on face recognition

Info

Publication number: CN113160826B
Application number: CN202110225394.0A
Authority: CN
Inventors: 不公告发明人
Original assignee: Terminus Technology Group Co Ltd
Current assignee: Terminus Technology Group Co Ltd
Priority date: 2021-03-01
Filing date: 2021-03-01
Publication date: 2022-09-02
Anticipated expiration: 2041-03-01
Also published as: CN113160826A

Abstract

The embodiment of the application provides a family member communication method and system based on face recognition. The method comprises the following steps: a face recognition device and a voice recognition device are arranged in the communication terminal, and the voice recognition device prestores at least one user password template for triggering the face recognition process; after the voice recognition device catches at least two times of sounds with the similarity smaller than a specified threshold value with any one password template in the user password templates, starting the face recognition device; the face recognition device captures a face image through rotation, recognizes the expression characteristics of a user and forms a user state characteristic set; collecting the environment around the user in the rotating process, identifying the environmental characteristics around the user and forming an environmental characteristic set; and importing the expression feature set and the environment feature set of the user into a deep learning network, predicting family members needing communication of the user, and starting a communication process. The method and the device improve the efficiency of family member communication through the human body recognition algorithm.

Description

Family member communication method and system based on face recognition

Technical Field

The application relates to the field of face recognition technology and communication management, in particular to a family member communication method and system based on face recognition.

Background

The family members generally have the members like the old, children or other members needing help, and the members are often inconvenient in the communication process, for example, the old have poor eyesight and inconvenient hands and feet, and are often inconvenient to operate the communication terminal quickly and accurately. However, the demand for communication among such family members is not reduced due to limitations in their operational capabilities, particularly in emergencies such as sudden illness, accidents, etc., where immediate contact with the family members is required for the first time. The current common mode is carried out by a voice dialing mode, but on one hand, the accuracy of the voice dialing is difficult to ensure due to the imperfect language functions of family members needing help, such as old people, children and the like; on the other hand, it is difficult for these family members to make a sound for voice dialing in cases such as natural disasters, criminal crimes, sudden diseases, and the like.

Therefore, it is necessary to develop a more convenient method for family member connection to solve the above problems.

Disclosure of Invention

In view of this, the present application aims to provide a method and a system for family member communication based on face recognition, so as to improve the communication efficiency of family members and solve the technical problem that some special family members cannot normally contact the family members in emergency at present.

Based on the above purpose, the present application provides a family member communication method based on face recognition, which includes:

setting a face recognition device and a voice recognition device in the communication terminal, wherein the voice recognition device prestores at least one user password template for triggering the face recognition process;

after the voice recognition device catches at least two times of sounds with the similarity smaller than a specified threshold value with any one password template in the user password templates, starting the face recognition device;

the face recognition device captures a face image through rotation, and recognizes the expression characteristics of the user to form a user state characteristic set;

collecting the environment around the user in the rotation process, identifying the environmental characteristics around the user and forming an environmental characteristic set;

and importing the expression feature set and the environment feature set of the user into a deep learning network, predicting family members needing communication of the user, and starting a communication process.

In some embodiments, the method further comprises:

if the human body recognition algorithm judges that the standard action sequence is not met, an error instruction is sent to the user, and a replacement action suggestion is provided;

and if the human body recognition algorithm judges that the times of the action sequences which do not accord with the standard exceed the times of a specified threshold aiming at the same user, sending an alarm to an administrator.

In some embodiments, the method further comprises:

and if the communication equipment of the family member cannot be connected, the communication equipment of the user selects the family member with the next rank from the deep learning result to carry out communication.

In some embodiments, the speech recognition apparatus pre-stores at least one user password template for triggering a face recognition process, including:

setting a white list user password template in the voice recognition device, and immediately starting the face recognition process after the voice recognition device catches the voice in the white list user password template;

and setting a blacklist user password template in the voice recognition device, and after capturing the voice in the blacklist user password template, carrying out neglect processing by the voice recognition device.

In some embodiments, after the voice recognition device captures at least two sounds with similarity smaller than a specified threshold with any one of the user password templates, the face recognition device is started, including:

after the voice recognition device captures a first voice for the first time, obtaining a first comparison result by calculating the similarity between the first voice and a corresponding password in the password template;

within a specified time interval, if the voice recognition device catches a second voice, a second comparison result is obtained by calculating the similarity between the second voice and a corresponding password in the password template;

and if the first comparison result and the second comparison result both fall into a preset similarity interval, starting the face recognition device.

In some embodiments, the face recognition device performs face image capture by rotation, identifies expression features of the user, and forms a user status feature set, including:

the face recognition device adjusts the image acquisition direction of the face recognition device according to the voice direction captured by the voice recognition device, and obtains the expression characteristics of the user;

the face recognition device obtains the emotional characteristics and the body state characteristics of the user through the expression characteristics and the body state of the user to form the user state characteristic set.

In some embodiments, the collecting of the environment around the user during the rotating, the identifying of the environmental features around the user, and the forming of the environmental feature set include:

the face recognition device takes the user as a circle center, adjusts the image acquisition direction of the face recognition device and obtains the environmental information around the user;

the face recognition device obtains surrounding crowd and geographic state features of the user through the surrounding environment information of the user to form the user environment feature set.

In some embodiments, each element in the user environment feature set is calculated by the following formula:

wherein j is a discrete distance series, i is an environment type series, f _i () And the characteristic value is the environment characteristic value of the ith type of environment, P is the characteristic value of the surrounding crowd, and G is the characteristic value of the geographic state.

Based on the above purpose, the present application further provides a family member communication system based on face recognition, including:

the construction module is used for arranging a face recognition device and a voice recognition device in the communication terminal, wherein the voice recognition device prestores at least one user password template for triggering the face recognition process;

the triggering module is used for starting the face recognition device after the voice recognition device captures at least two times of sounds with the similarity smaller than a specified threshold value with any one password template in the user password templates;

the face recognition module is used for capturing a face image by the face recognition device through rotation, recognizing the expression characteristics of the user and forming a user state characteristic set;

the environment identification module is used for collecting the environment around the user in the rotation process, identifying the environmental characteristics around the user and forming an environmental characteristic set;

and the communication module is used for importing the expression feature set and the environment feature set of the user into a deep learning network, predicting family members needing communication of the user and starting a communication process.

In some embodiments, the system further comprises:

and the delay module is used for selecting the family member with the next rank from the deep learning result by the user's communication equipment to carry out communication if the communication equipment of the family member cannot be connected.

In some embodiments, the building module comprises:

the white list unit is used for setting a white list user password template in the voice recognition device, and immediately starting the face recognition process after the voice recognition device catches the voice in the white list user password template;

and the blacklist unit is used for setting a blacklist user password template in the voice recognition device, and performing ignoring processing after the voice recognition device catches the voice in the blacklist user password template.

In general, the advantages of the present application and the experience brought to the user are: the method and the device improve the efficiency of family member communication through the human body recognition algorithm. The technical problem that some special family members cannot normally contact the family members in emergency at present is solved.

Drawings

In the drawings, like reference numerals refer to the same or similar parts or elements throughout the several views unless otherwise specified. The figures are not necessarily to scale. It is appreciated that these drawings depict only some embodiments in accordance with the disclosure and are therefore not to be considered limiting of its scope.

Fig. 1 is a flowchart illustrating a family member communication method based on face recognition according to an embodiment of the present invention.

Fig. 2 is a flowchart illustrating a family member communication method based on face recognition according to an embodiment of the present invention.

Fig. 3 is a block diagram illustrating a family member communication system based on face recognition according to an embodiment of the present invention.

Fig. 4 is a block diagram illustrating a family member communication system based on face recognition according to an embodiment of the present invention.

Fig. 5 shows a composition diagram of a building block according to an embodiment of the present invention.

Detailed Description

The present application will be described in further detail with reference to the drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant invention and not restrictive of the invention. It should be noted that, for convenience of description, only the portions related to the related invention are shown in the drawings.

It should be noted that, in the present application, the embodiments and features of the embodiments may be combined with each other without conflict. The present application will be described in detail below with reference to the accompanying drawings in conjunction with embodiments.

Fig. 1 shows a flow chart of a family member communication method based on face recognition according to an embodiment of the invention. As shown in fig. 1, the family member communication method based on face recognition includes:

and step S11, setting a face recognition device and a voice recognition device in the communication terminal, wherein the voice recognition device prestores at least one user password template for triggering the face recognition process.

Specifically, the communication terminal may refer to an electronic device having a communication function, such as a mobile phone, a telephone, a computer, and the like. The face recognition device and the voice recognition device can be used as a part of the communication terminal and embedded into the communication terminal, so that the whole equipment is portable and light.

In one embodiment, the voice recognition device prestores at least one user password template for triggering a face recognition process, and the method comprises the following steps:

setting a white list user password template in the voice recognition device, and immediately starting the face recognition process after the voice recognition device captures the voice in the white list user password template;

setting a blacklist user password template in the voice recognition device, and after the voice recognition device captures the voice in the blacklist user password template, performing ignoring processing.

In particular, a user may trigger a face recognition process through at least one password template. For example, when a user calls "make a call to dad" or "make a call to mom," the face recognition process can be triggered.

And step S12, after the voice recognition device captures at least two times of voice with the similarity smaller than a specified threshold value with any password template in the user password template, starting the face recognition device.

Specifically, since the user may inadvertently hit the password in the password template during a daily conversation or conversation, it is contemplated that the face recognition device is activated after hearing two or more times of speech similar to the speech in the password template.

In one embodiment, after the voice recognition device captures at least two sounds with similarity smaller than a specified threshold with any one of the user password templates, the face recognition device is started, and the method comprises the following steps:

And step S13, the face recognition device captures a face image through rotation, the expression features of the user are recognized, and a user state feature set is formed.

In one embodiment, the face recognition device performs face image capture by rotation, recognizes expressive features of the user, and forms a user status feature set, including:

Particularly, the collection angle of the face recognition device may affect the collection effect, and therefore, it is necessary to select the face recognition device to achieve the best collection effect.

Step S14, collecting the environment around the user in the rotation process, identifying the environmental features around the user, and forming an environmental feature set.

In one embodiment, the collecting of the environment around the user during the rotation, the identifying of the environmental features around the user, and the forming of the environmental feature set comprise:

In particular, the face recognition device needs to collect the environmental information around the user to determine the environmental characteristics of the user. For example, if a feature such as a sharp, criminal, etc. is found around the child, the environmental characteristics of the child may be determined.

In one embodiment, each element in the user environment feature set is calculated by the following formula:

And step S15, importing the expression feature set and the environment feature set of the user into a deep learning network, predicting family members needing to be communicated by the user, and starting a communication process.

Specifically, initiating the communicative process refers to initiating communication with a communicative object. For example, if it is predicted to communicate with "dad," then "dad" can be reached.

Fig. 2 is a flowchart illustrating a family member communication method based on face recognition according to an embodiment of the present invention. As shown in fig. 2, the family member communication method based on face recognition further includes:

and step S16, if the communication equipment of the family member can not be connected, the communication equipment of the user selects the family member with the next rank from the deep learning result to carry out communication.

Specifically, it is likely that the communication object cannot be normally connected at some time, but at an emergency, the communication object needs to be immediately contacted with the corresponding communication object, so that the communication object can be immediately contacted with the next communication object, and the problem that the first contact person cannot be contacted and the solving of things is delayed is prevented.

Fig. 3 is a block diagram illustrating a family member communication system based on face recognition according to an embodiment of the present invention. As shown in fig. 3, the whole family member communication system based on face recognition may be divided into:

the building module 31 is used for setting a face recognition device and a voice recognition device in the communication terminal, wherein the voice recognition device prestores at least one user password template for triggering the face recognition process;

the triggering module 32 is configured to start the face recognition device after the voice recognition device captures at least two sounds with similarity smaller than a specified threshold value with any one of the user password templates;

a face recognition module 33, configured to capture a face image by the face recognition device through rotation, recognize an expression feature of the user, and form a user state feature set;

an environment identification module 34, configured to collect an environment around the user in the rotation process, identify an environment feature around the user, and form an environment feature set;

and the communication module 35 is configured to import the expression feature set and the environment feature set of the user into a deep learning network, predict family members of the user who need to communicate, and start a communication process.

Fig. 4 is a block diagram of a family member communication system based on face recognition according to an embodiment of the present invention. As shown in fig. 4, the whole family member communication system based on face recognition further includes:

and a delay module 36, configured to select the family member with the next rank from the deep learning result to perform the communication by the user's communication device if the communication device of the family member cannot be connected.

Fig. 5 shows a composition diagram of a building block according to an embodiment of the present invention. As shown in fig. 5, the construction module 31 of the family member communication system based on face recognition includes:

a white list unit 311, configured to set a white list user password template in the speech recognition apparatus, and immediately start the face recognition process after the speech recognition apparatus captures the speech in the white list user password template;

a blacklist unit 312, configured to set a blacklist user password template in the speech recognition apparatus, and perform an ignoring process after the speech recognition apparatus captures the speech in the blacklist user password template.

The functions of the modules in the systems in the embodiments of the present application may refer to the corresponding descriptions in the above methods, and are not described herein again.

In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.

Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing specific logical functions or steps of the process, and alternate implementations are included within the scope of the preferred embodiment of the present invention in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present invention.

The logic and/or steps represented in the flowcharts or otherwise described herein, such as an ordered listing of executable instructions that can be considered to implement logical functions, can be embodied in any computer-readable medium for use by or in connection with an instruction execution system, apparatus, or device, such as a computer-based system, processor-containing system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions. For the purposes of this description, a "computer-readable medium" can be any means that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. More specific examples (a non-exhaustive list) of the computer-readable medium would include the following: an electrical connection (electronic device) having one or more wires, a portable computer diskette (magnetic device), a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber device, and a portable read-only memory (CDROM). Further, the computer-readable medium could even be paper or another suitable medium upon which the program is printed, as the program can be electronically captured, via for instance optical scanning of the paper or other medium, then compiled, interpreted or otherwise processed in a suitable manner if necessary, and then stored in a computer memory.

It should be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.

It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and when the program is executed, the program includes one or a combination of the steps of the method embodiments.

In addition, functional units in the embodiments of the present invention may be integrated into one processing module, or each unit may exist alone physically, or two or more units are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a separate product, may also be stored in a computer readable storage medium. The storage medium may be a read-only memory, a magnetic or optical disk, or the like.

While the invention has been described with reference to specific embodiments, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A family member communication method based on face recognition is characterized by comprising the following steps:

a face recognition device and a voice recognition device are arranged in the communication terminal, and the voice recognition device prestores at least one user password template for triggering the face recognition process;

2. The method of claim 1, further comprising:

3. The method according to claim 1, wherein the voice recognition device pre-stores at least one user password template for triggering a face recognition process, comprising:

4. The method of claim 1, wherein activating the face recognition device after the voice recognition device captures at least two sounds having a similarity less than a specified threshold with any of the user password templates comprises:

within a specified time interval, if the voice recognition device captures a second voice, obtaining a second comparison result by calculating the similarity between the second voice and a corresponding password in the password template;

5. The method of claim 1, wherein the face recognition device performs face image capture by rotation, recognizes the expression features of the user, and forms a user status feature set, comprising:

6. The method of claim 1, wherein collecting the environment around the user during the rotation, identifying the environmental features around the user, and forming an environmental feature set comprises:

the face recognition device takes the user as a circle center, adjusts the image acquisition direction of the face recognition device, and acquires the environmental information around the user;

7. The method of claim 6, wherein each element in the user environment feature set is calculated by the following formula:

wherein j is a discrete distance series, i is an environment type series, f _i () And the characteristic value is the environment characteristic value of the i-th environment, P is the characteristic value of the surrounding crowd, and G is the characteristic value of the geographic state.

8. A family member communication system based on face recognition is characterized by comprising:

the triggering module is used for starting the face recognition device after the voice recognition device catches at least two times of sounds with the similarity smaller than a specified threshold value with any one password template in the user password templates;

9. The system of claim 8, further comprising:

10. The system of claim 8, wherein the building block comprises:

a white list unit, configured to set a white list user password template in the speech recognition device, and immediately start the face recognition process after the speech recognition device captures the speech in the white list user password template;