WO2022143300A1 - Visitor talkback control method, talkback control apparatus, system, electronic device, and storage medium - Google Patents

Visitor talkback control method, talkback control apparatus, system, electronic device, and storage medium Download PDF

Info

Publication number
WO2022143300A1
WO2022143300A1 PCT/CN2021/140086 CN2021140086W WO2022143300A1 WO 2022143300 A1 WO2022143300 A1 WO 2022143300A1 CN 2021140086 W CN2021140086 W CN 2021140086W WO 2022143300 A1 WO2022143300 A1 WO 2022143300A1
Authority
WO
WIPO (PCT)
Prior art keywords
intercom
image data
preset
information
determined
Prior art date
Application number
PCT/CN2021/140086
Other languages
French (fr)
Chinese (zh)
Inventor
钟浩华
Original Assignee
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oppo广东移动通信有限公司 filed Critical Oppo广东移动通信有限公司
Publication of WO2022143300A1 publication Critical patent/WO2022143300A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
    • H04N7/183Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast for receiving images from a single remote source
    • H04N7/186Video door telephones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast

Definitions

  • the present application relates to the field of communication control, and more particularly, to a visiting intercom control method, an intercom control device, a system, an electronic device and a storage medium.
  • smart doorbells which can perform functions such as video shooting, monitoring, and intercom, and can also be linked with other home products.
  • intercom the current doorbell products still have some inconvenience in the process of intercom.
  • Embodiments of the present application provide a visiting intercom control method, an intercom control device, a system, an electronic device, and a storage medium. Improve the convenience of intercom control.
  • an embodiment of the present application provides a visiting intercom control method, which is applied to an electronic device.
  • the method includes: determining image data collected during an intercom process or an intercom request process, and the image data includes an image of an intercom requesting end. data and/or image data of the intercom receiving end; when the image data satisfies the first preset condition, the end of the intercom process is triggered or the process of the intercom request is triggered to end.
  • an embodiment of the present application provides a visiting intercom control method, which is applied to an electronic device.
  • the method includes: determining image data collected during the intercom process and intercom voice collected during the intercom process, and the image data includes a pair of The image data of the talk requesting end and/or the image data of the intercom receiving end; the intercom voice includes the voice of the intercom requesting end and/or the voice of the intercom receiving end; when the image data satisfies the first preset condition and the intercom voice satisfies the second preset When the condition is set, the intercom process is triggered to end.
  • an embodiment of the present application provides a visiting intercom control device, the intercom control device includes: a determining unit configured to determine image data collected during an intercom process or an intercom request process, and the image data includes: The image data of the intercom requesting end and/or the image data of the intercom receiving end; the triggering unit is configured to trigger the end of the intercom process or the end of the intercom request process when the image data satisfies the first preset condition.
  • an embodiment of the present application provides a visiting intercom control device
  • the intercom control device includes: a determining unit configured to determine image data collected during the intercom process and intercom voice collected during the intercom process , the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end, and the intercom voice includes the speech of the intercom requesting end and/or the speech of the intercom receiving end; the triggering unit is configured to be used when the image data satisfies the first When a preset condition and the intercom voice satisfies the second preset condition, the end of the intercom process is triggered.
  • an embodiment of the present application provides a visiting intercom system, the system includes a doorbell device and a TV, the doorbell device is connected to the TV, and the doorbell device or the TV is configured to determine an intercom process or an intercom request
  • the image data collected in the process, the image data includes the image data of the intercom requesting end collected by the doorbell device and/or the image data of the intercom receiving end collected by the TV; when the image data meets the first preset condition, the doorbell device or TV It is configured to trigger the end of the intercom procedure or to trigger the end of the intercom request procedure.
  • embodiments of the present application provide an electronic device, including: one or more processors; a memory; and one or more application programs, wherein the one or more application programs are stored in the memory and configured to be executed by a or multiple processors executing, one or more programs configured to perform a method as related to any one of the first aspect or the second aspect.
  • an embodiment of the present application provides a computer-readable storage medium, where a program code is stored in the computer-readable storage medium, and the program code can be called by a processor to execute any one related to the first aspect or the second aspect. item method.
  • FIG. 1 is a schematic flowchart of a visiting intercom control method provided by an embodiment of the present application
  • FIG. 2 is a schematic flowchart of another visiting intercom control method provided by an embodiment of the present application.
  • FIG. 3 is a schematic diagram of an electronic device provided by an embodiment of the present application.
  • FIG. 4 is a schematic diagram of a computer-readable storage medium provided by an embodiment of the present application.
  • FIG. 5 is a block diagram of functional units of a visiting intercom control device provided by an embodiment of the present application.
  • FIG. 6 is a schematic diagram of the architecture of a visiting intercom system provided by an embodiment of the present application.
  • FIG. 7 is a schematic flowchart of a visiting intercom system provided by an embodiment of the present application.
  • An embodiment of the present application provides a visiting intercom control method, which is applied to an electronic device.
  • the electronic device includes at least one of an intercom requesting end device, an intercom receiving end device, and a cloud server; wherein the intercom requesting end device may be It is understood as an outdoor device in the visiting intercom, which is mainly used to control and initiate the intercom. It can include: at least one of a doorbell outdoor unit (which may include an image acquisition unit), a camera, and an access control outdoor unit, where the camera can be a cat-eye camera. It can be a surveillance camera, etc., which is not limited.
  • the intercom receiving end device can be understood as the indoor device in the visiting intercom, such as smart home equipment, which is mainly used to control the receiving intercom, which can include: doorbell indoor unit, access control indoor unit, TV, router, gateway device, customer front At least one of the CPE (Customer Premise Equipment), speaker, smart camera, TV box, computer, and mobile phone.
  • CPE Customer Premise Equipment
  • the intercom can be a voice intercom or a video intercom.
  • the video intercom can be a single-party video intercom, that is, only one party can display video images, or it can be two-party or multi-party video intercom, that is, multiple parties can display video images. .
  • the user may be a visited user or a visiting user (eg, a visitor).
  • a visiting intercom control method includes: step S10. Determine the image data collected during the intercom process or the intercom request process, and the image data includes a pair of Speak the image data of the requester and/or the image data of the intercom receiver;
  • the intercom process can be understood as the process in which the visiting user initiates the intercom but the interviewed user has not yet accepted the intercom.
  • the speaking requesting end or the intercom receiving end device generates a prompt such as voice or image.
  • This stage can be considered as an intercom request process, which is used to request and wait for the interviewed user of the intercom receiving end to accept the intercom; for example, step S10 includes: S101 .
  • Determine the image data collected in the intercom request process the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end.
  • step S10 includes: S102. Determine the image data collected during the intercom process, where the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end.
  • the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end, which may only include the image data of the intercom requesting end, or only include the image data of the intercom receiving end, or include the image data of the intercom requesting end.
  • Image data and image data of the intercom receiver are only include the image data of the intercom requesting end, or only include the image data of the intercom receiving end, or include the image data of the intercom requesting end.
  • the image data of the intercom requesting end generally refers to the image data collected by the image acquisition unit of the intercom requesting end device
  • the image data of the intercom receiving end generally refers to the image data collected by the image acquisition unit of the intercom receiving end device. It should be noted that, it may also be image data collected by other devices connected to the intercom receiving end device or the intercom requesting end device, and there may be multiple intercom receiving end devices. Determine the image data collected during the intercom process or the intercom request process. It can be an image acquisition unit (either the requester or the receiver) to collect image data, and the intercom requester device or the intercom receiver device or the cloud.
  • the server In the case of acquisition by the server, etc., it can be acquired by the CPU or GPU of the intercom requesting end device and the intercom receiving end device.
  • the transmission can be wired or wireless.
  • the acquisition device may be different from the device that executes the above S10.
  • the image data determined in S10 may be part or all of the image data collected during the intercom process or the intercom request process.
  • the image acquisition unit of the intercom requesting end device can collect the image data of the intercom requesting end in real time, and the image data of the intercom receiving end can be the intercom receiving end device.
  • the image data collected by the image acquisition unit in real time during the intercom process in which the image data of the intercom requester is generally the image data that can be collected by the image acquisition unit of the intercom requester device, such as the range that can be captured by the camera.
  • the image data of the intercom receiving end is generally the image data that can be collected by the image acquisition unit of the intercom receiving end device, such as the image data within the range that can be captured by a TV camera.
  • the intercom requesting end device, the intercom receiving end device, the cloud server, etc. can obtain the above collected image data.
  • the device on the intercom request side is the doorbell external unit, which can be understood as a doorbell device installed outside the door.
  • the doorbell external unit After receiving the intercom request command, the doorbell external unit starts to collect the image data of the intercom requester and sends it to the doorbell internal unit.
  • the doorbell internal unit will issue a prompt after receiving the intercom request command (such as a voice prompt, or a prompt by playing the image data of the intercom request terminal, or a vibration prompt, etc., which is not limited here) to Remind the interviewed user to answer the intercom, the doorbell internal unit enters the intercom process after receiving the instruction to answer the intercom, so that the visitor and the interviewed user can intercom through the doorbell external unit and the doorbell internal unit, and the doorbell internal unit can receive the doorbell
  • the image data collected by the image acquisition unit (such as a camera) of the external machine is used for image playback.
  • the image of the intercom receiving end is generally not played to the intercom requesting end, that is, visitors generally cannot see the intercom receiving end. Image of interviewed user.
  • the doorbell internal unit (intercom receiver), cloud server, etc. can also receive the image data collected by the doorbell external unit to obtain relevant image data.
  • the image acquisition unit of the doorbell external unit collects images in real time
  • the data is then sent to the doorbell internal unit or cloud server through wired or wireless communication; the doorbell internal unit is generally used indoors, and the related functions of the doorbell internal unit can also be integrated into various terminal devices, such as TV sets , speakers, mobile phones, tablet computers, etc.
  • a picture will pop up on the TV screen (if the TV is in use, a picture-in-picture can also be displayed on the playing TV screen) to display the intercom.
  • the image of the requesting end after the TV receives the intercom instruction from the interviewed user, it establishes the intercom with the doorbell end.
  • step S10 of this embodiment can also be applied to the intercom request process, that is, in the intercom request process, the image data collected in the intercom request process is acquired, and the image data includes the image data of the intercom requesting end; the image data Can be sent to the intercom receiver or not.
  • the above-mentioned image data may be the image data of the intercom requesting end collected in real time during the intercom process or the intercom request process, or the image data of the intercom receiving end, or the image data of the intercom requesting end and the intercom. image data at the receiving end.
  • the end of the intercom request process is triggered or the end of the intercom process is triggered; it can also be that when the image data of the intercom receiving end meets the first preset condition, Triggering the end of the intercom request process or triggering the end of the intercom process, or triggering the end of the intercom request process or triggering the intercom when both the image data of the intercom requester and the image data of the intercom receiver meet the first preset condition Process ends.
  • the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end
  • triggering the end of the intercom process or triggering the end of the intercom request process including:
  • the end of the intercom process is triggered or the end of the intercom request process is triggered.
  • triggering the end of the intercom process can be understood as triggering the end of the intercom, so that both the visiting user and the interviewed user cannot continue to communicate through the intercom device.
  • the end of the process of triggering the intercom request can be understood as triggering the end of the intercom request, that is to say, the intercom request is no longer issued to wait for the user to accept it.
  • the intercom request process triggered by data analysis ends.
  • step S30 may include step S301 or step S302, therefore, step S10 and step S30 may include the following embodiments:
  • a visiting intercom control method including:
  • Step S101 Determine the image data collected in the intercom request process, and the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end;
  • Step S301 When the image data satisfies the first preset condition, trigger the intercom request process to end;
  • an access control method includes:
  • Step S102 Determine the image data collected in the intercom process, and the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end;
  • Step S302 When the image data satisfies the first preset condition, the intercom process is triggered to end.
  • controlling the intercom request according to the image data of the intercom requester can prevent the visitor from leaving, etc., because the intercom request is still meaningless for a certain period of time, resulting in power consumption problems. Or the disturbance to the surrounding or indoor interviewed users, or the inconvenience caused by the need for visitors or interviewed users to manually shut down, etc.
  • the intercom request can also be controlled according to the image data of the intercom receiver. For example, when the image of the receiving end satisfies the first preset condition, for example, does not include character feature information, etc., the intercom request is ended, thereby avoiding waiting for the visitor.
  • the intercom request can also be controlled according to the image data of the intercom requesting end and the image data of the intercom receiving end, and the intercom request process can be triggered only when both meet the first preset condition. end, so as to avoid false triggers caused by the visited user being out of the monitoring range of the receiving end device or misjudging the characteristic information of the visitor or the visited user.
  • the intercom can also be controlled only according to the image data of the intercom requester, so as to avoid the visitor from continuing to be in the intercom request state after leaving; Avoid the presence of the interviewed user, but still keep the visitor waiting; or control the intercom by combining the image data of the intercom requester and the intercom receiver, so that the end of the intercom can be triggered only when both meet the first preset condition. , enhance the convenience of operation and the accuracy of control.
  • the image data satisfies the first preset condition it may be when a sampled frame image in the acquired image data satisfies the first preset condition (that is, the current sampled frame image satisfies the first preset condition), the trigger is triggered.
  • the intercom request process ends or the intercom process is triggered. This method can trigger the end of the intercom relatively quickly, but there may be some misjudgments. For example, the visitor temporarily exceeds the range of the camera because of a certain posture change.
  • the image data satisfies the first preset condition it may also be when the images of the consecutive preset sampling frames meet the first preset condition.
  • the multi-sampled frame images all meet the first preset condition, thereby improving the judgment. accuracy.
  • the image data satisfies the first preset condition
  • the above methods can be used in combination.
  • the sampled frame image may be image data of some or all of the frames selected from the determined image data for analysis.
  • the sampling period can be separated by a specific number of frames, and the specific number of frames can be a positive integer greater than or equal to zero, that is, every frame can be sampled and analyzed, or analyzed at specific frame intervals; of course, sampling can also be performed at specific time intervals. analyze.
  • the image data satisfies the first preset condition, it may be that no character feature information is detected in the image data, or it may be that the preset behavior information of characters is detected in the image data, or The preset behavior information of the person is detected in the data, and then the person characteristic information is not detected in the image data, that is, if the preset time period after the preset behavior information of the person is detected in the image data, no information is detected in the image data.
  • the character feature information it is determined that the image data meets the first preset condition, which means that the character first changes the preset behavior and then leaves the monitoring area.
  • the preset time period can be set according to actual needs. This is not limited.
  • the character feature information may include: at least one of face information, body contour information, and human body infrared information; the character preset behavior information includes: at least one of back-turn information, side-turn information, and distance information; wherein,
  • the face information may include, but is not limited to, facial features information, skin color information, face contour information, pupil information, etc.; the human body contour information may be part of the human body contour or the entire human body contour, such as head contour, upper body contour, side contour, frontal contour, Back profile, etc. It can be understood that the backward turn information, the side turn information or the departure information can be obtained according to the changes of the face information and the body contour information.
  • the back turn information may be determined by detecting the change from the front profile to the side profile and then to the back profile
  • the side turn information may be determined by detecting the change from the front profile to the side profile
  • the away information may be detected by detecting the back profile in the image.
  • the proportional change is determined.
  • the image acquisition device of the doorbell outdoor unit collects image data in real time during the intercom process.
  • no character feature information is detected in the collected image data, it can be considered that the visitor has left the collection range of the image acquisition unit, that is, the visitor.
  • Leave which automatically ends the intercom, without the need for the interviewed user or visitor to end the intercom. For example, by not detecting the face information, it can be quickly determined that the visitor is about to leave or has already left.
  • the intercom when the visitor turns around, the face information cannot be detected, so the intercom is automatically ended, which can avoid the user's manual operation, and at the same time, the fastest
  • the doorbell internal unit is integrated with other devices, such as a TV
  • the intercom can be ended as soon as possible, so that the interviewed users can continue to watch TV; another example is integrated in a speaker.
  • the human body contour information is used as the character feature information. When the human body contour information is not detected in the image data, it means that the visitor has left, so that it can be more accurately judged whether the intercom needs to end.
  • the character feature information that is not detected in the image data may include at least one of the following: no character feature information is detected in a sampled frame image in the image data; continuous preset sampling frame number in the image data. No person feature information is detected in the image; no person feature information is detected in the image data for the first preset time. Among them, the collection of image data can be collected in real time. If no character feature information is detected in a sampled frame image of the image data, it can be understood that no character feature information is detected in the current sample frame image, which can trigger the end of the intercom request process or At the end of the intercom process, of course, the acquisition and analysis can be two processes, or even carried out by two devices.
  • the determined (acquired) image data can be part or all of the acquired image data.
  • the sampled data for analysis may be part or all of the determined (acquired) image data.
  • the setting of the preset number of sampling frames and the first preset time can increase the accuracy of detection.
  • the preset number of sampling frames can be completed by counting. After analyzing the image data of the continuous preset number of sampling frames, the count is re-counted. Of course, It can also start counting when it is determined that the current sampling frame image data does not detect the character feature information, and the first preset time can also refer to a similar method, of course, other methods can also be used, which are not limited here.
  • the preset behavior information of the person detected in the image data may include at least one of the following: it is detected that the outline of the human body changes from a frontal outline to a side outline (which can be considered as a A way to detect lateral turning information); in the image of the continuous sampling frame number in the image data, it is detected that the contour of the human body changes from the frontal contour to the side contour and then to the back contour (it can be considered as a way to detect the backward turning information).
  • sampling can be performed once every specific number of frames, which can be a positive integer greater than or equal to zero; Each frame of image is analyzed, and part of the frame image can also be sampled. Since a character's behavior may be a continuous action, it needs to be determined by analyzing the image data of consecutively sampled frames. For example, the image data of the continuous sampling frame number is detected, and when the preset character behavior is detected, the end of the intercom process or the end of the intercom request process is triggered. When a frame is a back profile, the end of the intercom process or the end of the intercom request process is triggered. In the behavior of moving away, it can be determined by the proportion of the human body contour in the captured image.
  • the area contained by the human body contour will become smaller in the total area of the image. Specifically, it can be reduced in the above Basically, when the occupied area is less than the preset proportion, the end of the intercom process or the end of the intercom request process is triggered; specifically, on the basis of the above reduction, the number of frames can be counted to become smaller, for example, the number of frames becomes smaller from the beginning.
  • the sampling frame count starts to count, and the related end action is triggered when the smaller sampling frame number reaches the first preset sampling frame number. From another angle, since the camera may be fixed when the person is far away, the outline of the human body that can be photographed will increase.
  • the end intercom or intercom request can be triggered.
  • the ratio is reached, the end of the intercom process or the end of the intercom request process is triggered.
  • the sampling frame number is used. Since the behavior of the character is relatively complex, directly using the collected continuous frame images may increase the misjudgment.
  • the detection effect may be related to the setting position and setting method of the image acquisition module, so in the case of no conflict, the above methods can be combined for detection to improve the applicability.
  • the contour can be the head, upper body, whole body, etc., which needs to be determined according to the situation that the image acquisition device can capture, such as the setting position of the camera and the viewing angle width of the camera.
  • the front profile, back profile or side profile can be determined according to the face information, especially the large front profile may be similar to the back profile, so in order to distinguish, it can also be distinguished by combining facial information such as facial features.
  • the side turn information, the back turn information, and the distance information can also be determined in other ways. It should be noted that, by detecting the side turn information, it will trigger a request to terminate the intercom or terminate the intercom, which can respond quickly, but it will also increase misjudgment.
  • triggering the end of the intercom process or triggering the end of the intercom request process may include: when the image data of the intercom requesting end satisfies the first preset condition and the image data of the intercom receiving end satisfies the first preset condition, the intercom process is triggered to end or the intercom request process is triggered to end.
  • the first preset condition There may be various specific implementations of the first preset condition.
  • the first preset condition satisfied by the image data of the intercom requesting end and the first preset condition satisfied by the image data of the intercom receiving end may be the same or different. That is to say, the specific conditions met by the two may be different, for example, the image data of the intercom requesting end satisfies that no person information is detected in the image data, while the image data of the intercom receiving end satisfies that the distance behavior is detected in the image data, etc.
  • the image data when no face information, human body contour information, or human body infrared information is detected in the image data, it can be considered that the person on the intercom side has left.
  • the captured image data is monitored.
  • face information is detected in a certain sampling frame image or a preset sampling frame number image
  • the end of the intercom process is triggered.
  • the preset sampling frame number image can also prevent visitors from talking during the intercom process. Misjudgment occurs due to temporary movement.
  • an appropriate preset sampling frame number can be set in combination with the accuracy of the judgment and the timeliness of the trigger.
  • the image data of the intercom requesting end is generally collected by the intercom requesting end device; the image data of the intercom receiving end is collected by the intercom receiving end device.
  • the docking requester device can generally be installed outdoors to obtain visitor information, while the intercom receiver device can be installed indoors. It can be a specialized device, such as a doorbell, or integrated in other electronic devices. There can be one or more than one machine, speakers, etc. Furthermore, acquisition of image data and analysis of image data may be performed by different devices.
  • intercom requesting end devices and intercom receiving end devices may not have image capture functions, but some of the above products It is also possible to integrate image acquisition functions and even image display functions, such as speakers with cameras, speakers with displays, etc. That is to say, the intercom receiver equipment listed above may or may not have the image acquisition function. When it does not have the image acquisition function, if you need to obtain the image data of the intercom receiver, you can use the Other devices with image acquisition function to collect the image data of the intercom receiver, such as smart cameras.
  • the image data of the intercom receiving end needs to be used in some scenarios, it can be considered that the above-mentioned intercom requesting end device or intercom receiving end device has the image acquisition function or can obtain the intercom receiving end from the device with the image acquisition function. image data.
  • the playback mode of the image data of the intercom requesting end can be determined according to the character information data or the device state data determined by the intercom receiving end device. Specifically, it may include but is not limited to the following methods: when it is determined that the TV is in the running state, it is determined that the image data of the intercom requesting terminal is played through the TV in a picture-in-picture manner; in this case, the viewing of the interviewed users can be reduced.
  • Influence of TV in one case, when it is determined that the TV is in running state, and it is determined that the interviewed user and the TV are within the preset range, it is determined that the image data of the intercom requester is played through the TV in a picture-in-picture manner; In this case, the positional relationship between the interviewed user and the TV is also considered, so as to infer whether the interviewed user can know the intercom request or whether it is convenient for the interviewed user to intercom.
  • the preset range can be determined according to the interviewed user himself.
  • the default is within the range that can be captured by the TV camera, that is to say, whether there is an interview is detected by the image data captured by the TV camera (such as the image data of the intercom receiver).
  • User information if any, is considered to be within the preset range.
  • the TV when it is determined that the TV is in the off state, it is determined that the image data of the intercom requester is played through the TV in a full-screen display; Display the image. If the TV is in a completely off state, start the TV to make the TV run, and play the image data of the intercom request terminal through the TV in a full-screen display mode. If the TV is in a sleep state, turn the TV on.
  • the intercom receiver device can also be a mobile phone.
  • the interviewed user when it is determined that the mobile phone is in use by the interviewed user, it is determined to play the image data of the intercom requesting end through the mobile phone; in this case, the interviewed user can be detected by To judge whether the interviewed user is using the mobile phone by touching the screen of the mobile phone or whether it is playing audio and video data.
  • the image data captured by the mobile phone camera can also be used to determine whether the interviewed user is using it. If the interviewed user information is obtained, it is determined that the interviewed user is using a mobile phone. It can also be a computer scenario, including home computers, tablet computers, etc.
  • the use state can be determined by means of a mobile phone or a TV.
  • the interview user is using a certain intercom receiver device, the image data can be played through the intercom receiver device first, so as to quickly remind the interviewed user that there is a visitor, or it is more convenient for the user to play the image data. Intercom etc.
  • the above-mentioned visiting intercom control method further includes: S20. Determine the intercom voice collected during the intercom process.
  • step S30 The above-mentioned when the image data meets the first preset condition, triggering the end of the intercom process or triggering the intercom request process to end, can be replaced with step S40: when the image data meets the first preset condition and When the intercom voice satisfies the second preset condition, the intercom process is triggered to end. That is to say, in addition to the judgment based on the image data, the judgment is also combined with the intercom voice data, thereby increasing the accuracy of the judgment.
  • the intercom voice satisfies the second preset condition, which may include: the intercom voice includes a preset keyword or the intercom voice is not acquired for a preset time.
  • the intercom voice includes a preset keyword or the intercom voice is not acquired for a preset time.
  • the intercom voice is acquired, it may be that there is no voice signal in the collected sound signal, or it may not be able to collect any sound signal, or it may be that the voice signal of both parties in the intercom cannot be collected, that is to say, it is possible to identify the two parties who have just spoken to each other.
  • the voice information so as to determine whether the two sides stop the intercom, this method can improve the accuracy in the case of a noisy voice environment.
  • the TV or the remote control of the TV collects the data (such as the microphone on the TV or the remote ) the first preset voice of the interviewed user at the intercom receiver, triggers the intercom process; or, during the intercom process, if the TV or the remote The microphone collects) the second preset voice of the interviewed user at the intercom receiving end, which triggers the end of the intercom.
  • the first preset voice can be used as an intercom request, thereby triggering the entry into the intercom stage.
  • the second preset voice may be used to end the intercom, and may be a voice containing preset keywords such as "end the intercom” and "goodbye".
  • the method is controlled by voice, which is convenient for the user to control the intercom. It should be noted that, in this solution, it is clarified that the user's voice at the receiving end collected by the TV or TV remote control is not the voice of the intercom, so as to avoid being affected by the voice of the visiting customer.
  • the second preset voice used to end the intercom or the intercom request may be the interviewed user's voice collected by the TV or the remote control, or the intercom voice. The description of the second preset condition" will not be repeated here.
  • steps S10 and S30 can both be performed by the intercom requesting end device, for example
  • the image acquisition unit of the intercom requesting end device can collect the image data of the requesting end during the intercom request or the intercom process, and the image data of the receiving end can be collected by the image acquisition unit of the intercom receiving end during the intercom request or the intercom process.
  • the intercom requesting end device determines (can be understood as acquiring) the image data (including the image data of the intercom requesting end and/or the image data of the intercom receiving end), and when the intercom requesting end device determines When the image data meets the first preset condition, the intercom request process is triggered or the intercom process ends.
  • the trigger here can be the end of direct control, or a signal is sent to the intercom receiver device, which is controlled by the intercom receiver device. End, can belong to the protection scope of the trigger, there is no limit to this.
  • both steps S10 and S30 may be performed by the intercom receiving end device, for example, the image acquisition unit of the intercom requesting end device may collect the image data of the requesting end during the intercom request or the intercom process, which may be performed by the intercom requesting end device.
  • the image acquisition unit of the receiving end collects the image data of the receiving end during the intercom request or the intercom process, and then the intercom receiving end device (such as its processor) determines (can be understood as acquiring) the image data (including the image of the intercom requesting end).
  • both steps S10 and S30 may be performed by a cloud server.
  • the image acquisition unit of the intercom requesting end device may collect the image data of the requesting end during the intercom request or the intercom process, and the image data of the intercom receiving end may be collected.
  • the acquisition unit collects the image data of the receiving end during the intercom request or the intercom process, and then the cloud server determines (can be understood as acquiring) the image data (including the image data of the intercom requesting end and/or the image data of the intercom receiving end), And when the cloud server determines that the image data meets the first preset condition, it triggers the end of the intercom request process or the end of the intercom process.
  • the trigger here can be sending a signal to the intercom requesting end device or sending a signal to the intercom receiver.
  • the end device is controlled by the intercom requesting end device or the intercom receiving end device. All of the above can belong to the protection scope of the trigger, and there is no restriction on this. It should be noted that, since the electronic device may include at least one of a talk requesting end device, an intercom receiving end device, and a cloud server, steps S10 and S30 may be performed on the same or different electronic devices.
  • the image data collected during the intercom process or the intercom request process is determined, and the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end; when the image data satisfies the first preset condition, Triggering the end of the intercom process or triggering the end of the intercom request process. Therefore, manual operation by the user is not required, which is convenient for the user to use. Further, by setting the first preset condition, the timeliness of ending the intercom request or ending the intercom can be improved, resources can be released in time, and the impact on the user's use of other devices can be reduced. Wait.
  • the present application also provides another method for controlling a visiting intercom, which is applied to an electronic device, and the method for controlling a visiting intercom includes:
  • the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end;
  • the intercom voice includes the voice of the intercom requesting end and/or the voice of the intercom receiver.
  • this embodiment combines the intercom voice and image data collected during the intercom process to comprehensively control the intercom.
  • the accuracy is higher.
  • the intercom voice can be collected in real time by the microphone of the intercom requesting end device, or by the microphone of the intercom receiving end device.
  • Determine the intercom voice collected during the intercom process which can be Determine part or all of the collected intercom speech.
  • the image data satisfying the first preset condition may include: no character feature information is detected in the image data, and/or character preset behavior information is detected in the image data.
  • the character feature information may include at least one of face information, human body contour information, and human body infrared information; and the character preset behavior information may include at least one of backward turn information, side turn information, and distance information. Among them, the backward turn information, the side turn information or the distance information can be obtained according to the change of the face information and the body contour information. It is understandable that no character feature information is detected in the image data, which may include but is not limited to:
  • the front profile, the back profile or the side profile can be determined according to the face information.
  • the specific manners in which no character feature information is detected in various image data exemplified above, or the specific manners in which person preset behavior information is detected in various image data, can be used in combination in the case of no conflict.
  • the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end
  • the image data satisfies the first preset condition, including: the image data of the intercom requesting end and/or the image data of the intercom receiving end
  • the data satisfies the first preset condition.
  • the intercom voice includes the voice of the intercom requesting end and/or the voice of the intercom receiving end
  • the intercom voice satisfies the second preset condition, including: the voice of the intercom requesting end and/or the voice of the intercom receiving end satisfies the second preset condition condition.
  • the intercom voice satisfies the second preset condition, including: the intercom voice includes a preset keyword or the intercom voice is not acquired for a preset time.
  • the further explanation of the preset keywords and the unacquired intercom voice meeting the preset time can be found in the foregoing description, which will not be repeated here.
  • the electronic device includes: at least one of an intercom requesting end device, an intercom receiving end device, and a cloud server; optionally, the intercom requesting end device includes: a doorbell external unit, a camera, and an external access control unit.
  • the intercom receiver equipment includes: doorbell internal machine, access control internal machine, TV, router, gateway device, customer premise equipment CPE (Customer Premise Equipment), speaker, smart camera, TV box, computer, mobile phone at least one of.
  • CPE Customer Premise Equipment
  • the image data of the intercom requesting end is collected by the intercom requesting end device; the image data of the intercom receiving end is collected by the device of the intercom receiving end.
  • the character information data and/or the device status data determined by the intercom receiving end device determine the playback mode of the image data of the intercom requesting end. Further, determine the playback mode of the image data of the intercom requester according to the character information data and/or device status data determined by the intercom receiver device, including but not limited to:
  • the intercom process is triggered; or, during the intercom process.
  • the TV or the remote control of the TV collects the second preset voice of the interviewed user at the intercom receiving end, it triggers the end of the intercom.
  • the first preset voice and the second preset voice refer to the previous explanation, and are not repeated here.
  • This embodiment is aimed at the control of the intercom process, mainly for the control of the intercom process. If the technical features in some technical solutions are not further explained or the related technical effects are not described, please refer to the descriptions in the previous relevant parts, which will not be repeated here.
  • the technical solution in this embodiment considers not only the image data but also the intercom voice, and the intercom voice is added to control the intercom, which requires both the image data to satisfy the first preset condition and the intercom voice to satisfy the second preset condition , thus further improving the accuracy of the control.
  • the present application also provides an electronic device 500, comprising: one or more processors 510; a memory 520; one or more application programs, wherein one or more application programs are stored in the memory In 520 and configured to be executed by one or more processors 510, one or more programs are configured to perform the method of any of the above.
  • the present application also provides a computer-readable storage medium 600, where program codes 610 are stored in the computer-readable storage medium 600, and the program codes 610 can be called by the processor to execute any one of the above. item method.
  • the present application further provides a visiting intercom control device 1000 , and the visiting intercom control device 1000 includes:
  • Determining unit 1010 configured to determine the image data collected in the intercom process or the intercom request process, wherein the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end;
  • the triggering unit 1020 is configured to trigger the end of the intercom process or the end of the intercom request process when the image data satisfies the first preset condition.
  • the triggering unit 1020 includes a detection module configured to determine that the image data satisfies the first preset condition; further, the detection module is configured to determine that the image data does not detect character feature information, and/or the detection unit is configured to determine that a person preset behavior information is detected in the image data.
  • the character feature information includes: at least one of face information, human body contour information, and human body infrared information; and/or, the character preset behavior information includes: at least one of back-turn information, side-turn information, and distance information . Further, the backward turn information or the side turn information or the far away information is obtained according to the change of the face information and the body contour information.
  • the detection module can also be configured to determine that a frame image in the image data has not detected the character feature information; the detection module can also be configured to determine that the image of the continuous preset sampling frame number in the image data has not been detected. The detection module may also be configured to determine that no person feature information is detected in the image data for a first preset time. It can be understood that the detection module can also be configured to determine that the contour of the human body has changed from a frontal contour to a side contour in the image of the number of consecutive sampling frames in the image data; the detection module can also be configured to determine the continuous sampling in the image data.
  • the human body contour changes from the frontal contour to the side contour and then to the back contour in the image of the sampling frame number;
  • the proportion of the contour in the image becomes smaller;
  • the detection module may be further configured to determine that the proportion of the detected human contour in the image of the second consecutive preset sampling frame number in the image data increases in the proportion of the total contour of the human body.
  • the front profile, the back profile or the side profile can be determined according to the face information.
  • the triggering unit 1020 can be configured to be used when the image data of the intercom requesting end and/or the image data of the intercom receiving end satisfy Under the first preset condition, the triggering of the intercom process ends or the triggering of the intercom request process ends.
  • the visiting intercom control device can be applied to electronic devices, and the above electronic device 500 includes, but is not limited to, at least one of an intercom requesting end device, an intercom receiving end device, and a cloud server.
  • the intercom requesting terminal equipment includes but is not limited to: at least one of the doorbell outdoor unit, the camera, and the access control outdoor unit;
  • the intercom receiving end equipment includes but is not limited to: the doorbell indoor unit, the access control indoor unit, TV, router, At least one of gateway equipment, Customer Premise Equipment (CPE), speakers, smart cameras, TV boxes, computers, and mobile phones.
  • CPE Customer Premise Equipment
  • the intercom control device can be applied to electronic equipment, and the electronic device 500 may be one type of device, or multiple devices (two or more), therefore, each unit and module in the intercom control device Can be applied to different electronic devices without conflict.
  • the intercom control device further includes a playback control unit, which can be configured to determine the playback mode of the image data of the intercom requester according to the character information data and/or device status data determined by the intercom receiver device.
  • the intercom control device can be applied to the intercom requesting end. Although the image data of the intercom requesting end needs to be played on the intercom receiving end, when there are multiple request receiving end devices, the intercom requesting end device can be used to determine which device is used. Intercom receiver device to play. The same applies to the cloud server, which will not be repeated here.
  • the intercom receiving end device determines the playback mode of the image data of the intercom requesting end.
  • the playback control unit can be configured to play the image data of the intercom request terminal through the TV in a picture-in-picture manner when it is determined that the TV is in a running state; the playback control unit can be configured to When it is determined that the TV is in the running state, and it is determined that the interviewed user and the TV are within the preset range, the image data of the intercom requesting terminal is played through the TV in a picture-in-picture manner; the playback control unit can be configured to When it is determined that the TV is turned off, the image data of the intercom requesting terminal is played through the TV in a full-screen display mode; the playback control unit can be configured to determine whether the TV is turned off, and to determine whether the interviewed user and the TV are related to each other.
  • the playback control unit can be configured to determine that the mobile phone is in the use state of the interviewed user, and then determine to play the intercom request through the mobile phone. and the playback control unit can be configured to play the image data of the intercom requester through the computer when it is determined that the computer is in the use state of the interviewed user.
  • the intercom control device further includes a start intercom control unit, which is configured to be configured to, during the intercom request process, if the TV or the remote control of the TV collects the first preset of the interviewed user at the intercom receiving end. If the voice is set, it will trigger to enter the intercom process; alternatively, the trigger unit 1020 can also be configured to be used for, during the intercom process, if the TV or the remote control of the TV collects the second data of the interviewed user at the intercom receiving end The preset voice will trigger the end of this intercom.
  • the intercom control device can be applied to a TV set, and can also be applied to other electronic devices.
  • the triggering unit 1020 can be configured to trigger the end of the intercom process when the image data satisfies the first preset condition and the intercom voice satisfies the second preset condition. Further, the triggering unit 1020 may be configured to determine that the intercom speech includes a preset keyword or that the intercom speech has not been acquired for a preset time.
  • the above-mentioned visiting intercom control device can intelligently determine whether to trigger the end intercom request or end the intercom according to the image data of the intercom requesting end and/or the intercom receiving end, so that manual operation is not required, which is convenient for users. Use, further, can also be combined with voice control to further facilitate the user's intercom operation.
  • voice control to further facilitate the user's intercom operation.
  • the embodiment of the present application further provides another visiting intercom control device, the intercom control device includes: a determination unit configured to determine the image data collected during the intercom process and the intercom voice collected during the intercom process, wherein , the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end, and the intercom voice includes the voice of the intercom requesting end and/or the voice of the intercom receiving end;
  • the triggering unit 1020 is configured to trigger the end of the intercom process when the image data satisfies the first preset condition and the intercom voice satisfies the second preset condition.
  • the intercom control device corresponds to a related intercom control method, and other parts refer to the method part, which will not be repeated here.
  • FIG. 6, further illustrates a visiting intercom system 2000, which can provide convenient intercom for the visiting user 2010 and the interviewed user 2020. Control the experience.
  • the system 2000 includes a doorbell device 2100 and a TV 2200.
  • the doorbell device 2100 is connected to the TV 2200, either by wired connection or wireless connection.
  • the wireless connection can be connected through a common network or directly through Bluetooth, etc. At least one of them may also be connected to a cloud server (not shown), and may include other intercom receiver devices 2300, such as tablet computers, mobile phones, and the like, in addition to the television set.
  • the system can be used to perform the following steps:
  • the doorbell device 2100 or the TV 2200 determines the image data collected during the intercom process or the intercom request process, and the image data includes the image data of the intercom requester collected by the doorbell device 2100 and/or the intercom reception collected by the TV 2200 end image data;
  • the doorbell device 2100 or the TV 2200 triggers the end of the intercom process or triggers the end of the intercom request process.
  • the image data satisfying the first preset condition further includes: no character feature information is detected in the image data, and/or character preset behavior information is detected in the image data.
  • the character feature information includes: at least one of face information, body contour information, and human body infrared information; and the character preset behavior information includes: at least one of backward turn information, side turn information, and distance information. The backward turn information or the side turn information or the far away information is obtained according to the change of the face information and the body contour information.
  • No character feature information is detected in the image data, including at least one of the following: no character feature information is detected in a frame image in the image data; no character feature information is detected in an image with a continuous preset sampling number of frames in the image data; an image The person characteristic information is not detected in the data for the first preset time.
  • the preset behavior information of the person detected in the image data includes at least one of the following: it is detected that the contour of the human body changes from the frontal contour to the side contour in the image of the continuous sampling frame number in the image data; the image of the continuous sampling frame number in the image data It is detected that the contour of the human body changes from the frontal contour to the side contour and then to the back contour; the proportion of the human contour detected in the image with the first consecutive preset sampling frames in the image data becomes smaller; The proportion of the human body contour detected in the image with the continuous sampling frame number becomes smaller and smaller than the preset proportion; the proportion of the human body contour detected in the image with the continuous preset sampling frame number in the image data to the total human body contour increases.
  • the proportion of the contours of the human body to the total contours of the human body increases and is greater than the preset proportion in the images of the consecutive sampling frames in the image data.
  • the front profile, the back profile or the side profile can be determined according to the face information.
  • the doorbell device or TV when the image data includes the image data collected by the doorbell device and/or the image data collected by the TV, when the image data satisfies the first preset condition, the doorbell device or TV triggers the intercom process to end or triggers the intercom.
  • the request process ends, including: when the image data collected by the doorbell device and/or the image data collected by the TV meet the first preset condition, the doorbell device or the TV triggers the intercom process to end or the trigger intercom request process ends.
  • the above system may further include a cloud server, and the cloud server may be used to determine that the image data satisfies the first preset condition.
  • the cloud server may be used to determine that the image data satisfies the first preset condition.
  • it can also be determined by the doorbell device or the TV set to determine that the image data meets the first preset condition.
  • it can even be determined by other electronic devices and then notified to the doorbell device or TV set to trigger the intercom request.
  • the end of the process or the end of the intercom process is not limited here.
  • the doorbell device or the TV determines the playback mode of the image data collected by the doorbell device according to the character information data and/or the TV state data determined by the TV.
  • the playback method of the image data of the intercom requesting terminal is determined according to the character information data and/or device status data determined by the TV, including but not limited to:
  • the doorbell device or the TV will trigger the intercom process; or, During the intercom process, if the TV or the remote control of the TV collects the second preset voice of the interviewed user at the intercom receiver, the doorbell device or the TV triggers the end of the intercom.
  • the doorbell device or the TV set triggers the intercom process to end.
  • the intercom voice satisfies the second preset condition, including: the intercom voice includes a preset keyword or the intercom voice is not acquired for a preset time.
  • the image data collected during the intercom process or the intercom request process is determined, and the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end;
  • the triggering of the intercom process ends or the triggering of the intercom request process ends Under the first preset condition, the triggering of the intercom process ends or the triggering of the intercom request process ends. Therefore, manual operation by the user is not required, which is convenient for the user to use.
  • the accuracy of the request for ending the intercom or the end of the intercom can be improved, and the resources can be reasonably released to reduce the The impact on the user's use of other devices, etc.
  • each embodiment has its own emphasis. For parts that are not described in detail in a certain embodiment, reference may be made to the relevant descriptions of other embodiments.
  • the disclosed apparatus may be implemented in other manners.
  • the device embodiments described above are only illustrative.
  • the division of the above-mentioned units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or integrated. to another system, or some features can be ignored, or not implemented.
  • the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical or other forms.
  • the units described above as separate components may or may not be physically separated, and components shown as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units.
  • the above-mentioned integrated units if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable memory.
  • the technical solution of the present application can be embodied in the form of a software product in essence, or the part that contributes to the prior art, or all or part of the technical solution, and the computer software product is stored in a memory.
  • a computer device which may be a personal computer, a server, or a network device, etc.
  • the aforementioned memory includes: U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk and other media that can store program codes.
  • ROM read-only memory
  • RAM random access memory
  • mobile hard disk magnetic disk or optical disk and other media that can store program codes.
  • the program can be stored in a computer-readable memory
  • the memory can include: a flash disk , Read-only memory (English: Read-Only Memory, referred to as: ROM), random access device (English: Random Access Memory, referred to as: RAM), magnetic disk or optical disk, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • Interconnected Communication Systems, Intercoms, And Interphones (AREA)
  • Closed-Circuit Television Systems (AREA)

Abstract

The embodiments of the present application disclose a visitor talkback control method, a talkback control apparatus, a system, an electronic device, and a storage medium. The talkback control method is applied to an electronic device, and comprises: determining image data acquired in a talkback process or a talkback request process, the image data comprising image data of a talkback request end and/or image data of a talkback receiving end; and when the image data satisfies a first preset condition, triggering the talkback process to end or triggering the talkback request process to end, thereby improving convenience of talkback control.

Description

来访对讲控制方法、对讲控制装置、系统、电子设备及存储介质Visiting intercom control method, intercom control device, system, electronic equipment and storage medium
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请要求于2020年12月31日提交中国专利局的申请号为202011629375.6、名称为“来访对讲控制方法、对讲控制装置、系统、电子设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese Patent Application No. 202011629375.6 and titled "Visitor Intercom Control Method, Intercom Control Device, System, Electronic Equipment and Storage Medium" filed with the China Patent Office on December 31, 2020, The entire contents of which are incorporated herein by reference.
技术领域technical field
本申请涉及通信控制领域,更具体而言,涉及一种来访对讲控制方法、对讲控制装置、系统、电子设备及存储介质。The present application relates to the field of communication control, and more particularly, to a visiting intercom control method, an intercom control device, a system, an electronic device and a storage medium.
背景技术Background technique
随着家居用品的智能化,越来越多的智能家居产品组网形成智慧家庭方便用户使用,例如智能门铃,可以进行视频拍摄、监控、对讲等功能,还可以与其他家居产品实现联动,例如进行对讲,但目前的门铃产品在对讲过程中仍然存在一些不方便。With the intelligentization of household items, more and more smart home products are networked to form a smart home for users to use, such as smart doorbells, which can perform functions such as video shooting, monitoring, and intercom, and can also be linked with other home products. For example, intercom, but the current doorbell products still have some inconvenience in the process of intercom.
发明内容SUMMARY OF THE INVENTION
本申请实施例提供一种来访对讲控制方法、对讲控制装置、系统、电子设备及存储介质。提升对讲控制的便捷性。Embodiments of the present application provide a visiting intercom control method, an intercom control device, a system, an electronic device, and a storage medium. Improve the convenience of intercom control.
第一方面,本申请实施例提供一种来访对讲控制方法,应用于电子设备,该方法包括:确定对讲过程中或者对讲请求过程中采集的图像数据,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;当图像数据满足第一预设条件时,触发对讲过程结束或者触发对讲请求过程结束。In a first aspect, an embodiment of the present application provides a visiting intercom control method, which is applied to an electronic device. The method includes: determining image data collected during an intercom process or an intercom request process, and the image data includes an image of an intercom requesting end. data and/or image data of the intercom receiving end; when the image data satisfies the first preset condition, the end of the intercom process is triggered or the process of the intercom request is triggered to end.
第二方面,本申请实施例提供一种来访对讲控制方法,应用于电子设备,该方法包括:确定对讲过程中采集的图像数据和对讲过程中采集的对讲语音,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;对讲语音包括对讲请求端的语音和/或对讲接收端的语音;当图像数据满足第一预设条件且对讲语音满足第二预设条件时,触发对讲过程结束。In a second aspect, an embodiment of the present application provides a visiting intercom control method, which is applied to an electronic device. The method includes: determining image data collected during the intercom process and intercom voice collected during the intercom process, and the image data includes a pair of The image data of the talk requesting end and/or the image data of the intercom receiving end; the intercom voice includes the voice of the intercom requesting end and/or the voice of the intercom receiving end; when the image data satisfies the first preset condition and the intercom voice satisfies the second preset When the condition is set, the intercom process is triggered to end.
第三方面,本申请实施例提供一种来访对讲控制装置,该对讲控制装置包括:确定单元,被配置用于确定对讲过程中或者对讲请求过程中采集的图像数据,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;触发单元,被配置用于当图像数据满足第一预设条件时,触发对讲过程结束或者触发对讲请求过程结束。In a third aspect, an embodiment of the present application provides a visiting intercom control device, the intercom control device includes: a determining unit configured to determine image data collected during an intercom process or an intercom request process, and the image data includes: The image data of the intercom requesting end and/or the image data of the intercom receiving end; the triggering unit is configured to trigger the end of the intercom process or the end of the intercom request process when the image data satisfies the first preset condition.
第四方面,本申请实施例提供一种来访对讲控制装置,该对讲控制装置包括:确定单元,被配置用于确定对讲过程中采集的图像数据和对讲过程中采集的对讲语音,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据,对讲语音包括对讲请求端的语音和/或对讲接收端的语音;触发单元,被配置用于当图像数据满足第一预设条件且对讲语音满足第二预设条件时,触发对讲过程结束。In a fourth aspect, an embodiment of the present application provides a visiting intercom control device, the intercom control device includes: a determining unit configured to determine image data collected during the intercom process and intercom voice collected during the intercom process , the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end, and the intercom voice includes the speech of the intercom requesting end and/or the speech of the intercom receiving end; the triggering unit is configured to be used when the image data satisfies the first When a preset condition and the intercom voice satisfies the second preset condition, the end of the intercom process is triggered.
第五方面,本申请实施例提供一种来访对讲系统,该系统包括门铃设备、电视机,门铃设备与电视机连接,门铃设备或电视机被配置用于确定对讲过程中或者对讲请求过程中采集的图像数据,图像数据包括门铃设备采集的对讲请求端的图像数据和/或电视机采集的对讲接收端的图像数据;当图像数据满足第一预设条件时,门铃设备或电视机被配置用于触发对讲过程结束或者触发对讲请求过程结束。In a fifth aspect, an embodiment of the present application provides a visiting intercom system, the system includes a doorbell device and a TV, the doorbell device is connected to the TV, and the doorbell device or the TV is configured to determine an intercom process or an intercom request The image data collected in the process, the image data includes the image data of the intercom requesting end collected by the doorbell device and/or the image data of the intercom receiving end collected by the TV; when the image data meets the first preset condition, the doorbell device or TV It is configured to trigger the end of the intercom procedure or to trigger the end of the intercom request procedure.
第六方面,本申请实施例提供一种电子设备,包括:一个或多个处理器;存储器;一个或多个应用程序,其中一个或多个应用程序被存储在存储器中并被配置为由一个或多个处理器执行,一个或多个程序配置用于执行如第一方面或第二方面相关的任一项的方法。In a sixth aspect, embodiments of the present application provide an electronic device, including: one or more processors; a memory; and one or more application programs, wherein the one or more application programs are stored in the memory and configured to be executed by a or multiple processors executing, one or more programs configured to perform a method as related to any one of the first aspect or the second aspect.
第七方面,本申请实施例提供一种计算机可读取存储介质,计算机可读取存储介质中 存储有程序代码,程序代码可被处理器调用执行如第一方面或第二方面相关的任一项的方法。In a seventh aspect, an embodiment of the present application provides a computer-readable storage medium, where a program code is stored in the computer-readable storage medium, and the program code can be called by a processor to execute any one related to the first aspect or the second aspect. item method.
附图说明Description of drawings
本申请的上述和/或附加的方面和优点从结合下面附图对实施方式的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present application will become apparent and readily understood from the following description of embodiments taken in conjunction with the accompanying drawings, wherein:
图1是本申请实施例提供的一种来访对讲控制方法的流程示意图;1 is a schematic flowchart of a visiting intercom control method provided by an embodiment of the present application;
图2是本申请实施例提供的另一种来访对讲控制方法的流程示意图;2 is a schematic flowchart of another visiting intercom control method provided by an embodiment of the present application;
图3是本申请实施例提供的一种电子设备的示意图;3 is a schematic diagram of an electronic device provided by an embodiment of the present application;
图4是本申请实施例提供的一种计算机可读存储介质的示意图;4 is a schematic diagram of a computer-readable storage medium provided by an embodiment of the present application;
图5是本申请实施例提供的一种来访对讲控制装置功能单元框图;5 is a block diagram of functional units of a visiting intercom control device provided by an embodiment of the present application;
图6是本申请实施例提供的一种来访对讲系统架构示意图;6 is a schematic diagram of the architecture of a visiting intercom system provided by an embodiment of the present application;
图7是本申请实施例提供的一种来访对讲系统的流程示意图。FIG. 7 is a schematic flowchart of a visiting intercom system provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述。显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those skilled in the art without creative work fall within the protection scope of the present application.
在本申请的描述中,需要理解的是,术语“第一”、“第二”等仅用于描述目的,而不能理解为指示或暗示相对重要性。对于本领域的普通技术人员而言,可以具体情况理解上述术语在本申请中的具体含义。此外,在本申请的描述中,除非另有说明,“多个”是指两个或两个以上。“和/或”,描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。字符“/”一般表示前后关联对象是一种“或”的关系。本申请中的步骤编号仅用于举例,可能对应不同的实施方式,在不冲突的情况下,不限制其顺序。In the description of the present application, it should be understood that the terms "first", "second" and the like are used for descriptive purposes only, and should not be construed as indicating or implying relative importance. For those of ordinary skill in the art, the specific meanings of the above terms in this application can be understood in specific situations. Also, in the description of the present application, unless otherwise specified, "a plurality" means two or more. "And/or", which describes the association relationship of the associated objects, means that there can be three kinds of relationships, for example, A and/or B, which can mean that A exists alone, A and B exist at the same time, and B exists alone. The character "/" generally indicates that the associated objects are an "or" relationship. The step numbers in the present application are only used for example, may correspond to different embodiments, and the sequence is not limited unless there is conflict.
本申请实施例提供一种来访对讲控制方法,应用于电子设备,电子设备包括:对讲请求端设备、对讲接收端设备、云端服务器中的至少一种;其中,对讲请求端设备可以理解为来访对讲中的室外设备,主要用于控制发起对讲,可以包括:门铃外机(可包括图像采集单元)、摄像头、门禁外机中的至少一种,其中摄像头可以是猫眼摄像头也可以是监控摄像头等,对此不做限制。对讲接收端设备可以理解为来访对讲中的室内设备,例如智能家居设备,主要用于控制接受对讲,可以包括:门铃内机、门禁内机、电视机、路由器、网关设备、客户前置设备CPE(Customer Premise Equipment)、音箱、智能摄像头、电视盒、电脑、手机中的至少一种。An embodiment of the present application provides a visiting intercom control method, which is applied to an electronic device. The electronic device includes at least one of an intercom requesting end device, an intercom receiving end device, and a cloud server; wherein the intercom requesting end device may be It is understood as an outdoor device in the visiting intercom, which is mainly used to control and initiate the intercom. It can include: at least one of a doorbell outdoor unit (which may include an image acquisition unit), a camera, and an access control outdoor unit, where the camera can be a cat-eye camera. It can be a surveillance camera, etc., which is not limited. The intercom receiving end device can be understood as the indoor device in the visiting intercom, such as smart home equipment, which is mainly used to control the receiving intercom, which can include: doorbell indoor unit, access control indoor unit, TV, router, gateway device, customer front At least one of the CPE (Customer Premise Equipment), speaker, smart camera, TV box, computer, and mobile phone.
可以理解的,在当有人来访时,发起对讲请求,一般由对讲请求端设备发起对讲请求,然后由对讲接收端设备接受对讲请求从而建立对讲,如果没有用户接收对讲请求,一般对讲请求会持续一段时间后结束,即使访客离开,如果还在设置的时间内仍然会继续处于对讲请求状态,造成资源的浪费或对周围环境不必要的干扰。该对讲可以是语音对讲也可以是视频对讲,视频对讲可以是单方视频对讲,即只有一方可以显示视频图像,也可以是双方或多方视频对讲,即可以显示多方的视频图像。由于在对讲中,一般需要用户手动才能结束对讲,例如通过按键结束对讲,不便于用户操作,特别是用户离对讲设备较远的情况下。需要说明的是,用户可以是受访用户,可以是来访用户(例如访客)。It is understandable that when someone visits, an intercom request is initiated. Generally, the intercom requester device initiates the intercom request, and then the intercom receiver device accepts the intercom request to establish the intercom. If no user receives the intercom request , Generally, the intercom request will end after a period of time. Even if the visitor leaves, if it is still in the intercom request state within the set time, it will cause waste of resources or unnecessary interference to the surrounding environment. The intercom can be a voice intercom or a video intercom. The video intercom can be a single-party video intercom, that is, only one party can display video images, or it can be two-party or multi-party video intercom, that is, multiple parties can display video images. . Because in the intercom, it is generally necessary for the user to manually end the intercom, such as ending the intercom by pressing a button, which is inconvenient for the user to operate, especially when the user is far away from the intercom device. It should be noted that the user may be a visited user or a visiting user (eg, a visitor).
为方便用户对讲,请参阅图1,本申请实施例提供的一种来访对讲控制方法,包括:步骤S10.确定对讲过程中或者对讲请求过程中采集的图像数据,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;For the convenience of user intercom, please refer to FIG. 1 . A visiting intercom control method provided by an embodiment of the present application includes: step S10. Determine the image data collected during the intercom process or the intercom request process, and the image data includes a pair of Speak the image data of the requester and/or the image data of the intercom receiver;
需要说明的是,对讲过程可以理解为来访用户发起对讲但受访用户还未接受对讲的过 程,例如来访者按压门铃、或门铃设备检测到有人来访等情况下,发起来访请求,对讲请求端或对讲接收端设备生成语音或图像等提示,这个阶段可以认为是对讲请求过程,用于请求和等待对讲接收端受访用户接受对讲;示例的,步骤S10包括:S101.确定对讲请求过程中采集的图像数据,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据。对讲接收端的受访用户在感知到上述提示后,可通过对讲接收端设备接受对讲,从而可以理解为进入对讲过程,也就是说,对讲过程可以理解为受访用户接受了对讲请求后,进入对讲双方可对讲的阶段,在对讲过程中,对讲双方可以通过语音、视频等各种方式进行沟通。示例的,步骤S10包括:S102.确定对讲过程中采集的图像数据,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据。其中,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据,可以是只包括对讲请求端的图像数据,或者,只包括对讲接收端的图像数据,或者,包括对讲请求端的图像数据和对讲接收端的图像数据。It should be noted that the intercom process can be understood as the process in which the visiting user initiates the intercom but the interviewed user has not yet accepted the intercom. The speaking requesting end or the intercom receiving end device generates a prompt such as voice or image. This stage can be considered as an intercom request process, which is used to request and wait for the interviewed user of the intercom receiving end to accept the intercom; for example, step S10 includes: S101 . Determine the image data collected in the intercom request process, the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end. After sensing the above prompt, the interviewed user at the intercom receiver can accept the intercom through the intercom receiver device, which can be understood as entering the intercom process, that is to say, the intercom process can be understood as the interviewed user accepts After the request is made, it enters the stage where both parties can talk to each other. During the intercom process, both parties can communicate through voice, video and other methods. Exemplarily, step S10 includes: S102. Determine the image data collected during the intercom process, where the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end. Wherein, the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end, which may only include the image data of the intercom requesting end, or only include the image data of the intercom receiving end, or include the image data of the intercom requesting end. Image data and image data of the intercom receiver.
此外,对讲请求端的图像数据一般是指对讲请求端设备的图像采集单元采集的图像数据,而对讲接收端的图像数据一般是指对讲接收端设备的图像采集单元采集的图像数据,但需要说明的是,也可以是与对讲接收端设备或对讲请求端设备连接的其他设备采集的图像数据,并且对讲接收端设备可能为多个。确定对讲过程中或者对讲请求过程中采集的图像数据,可以是图像采集单元(可以是请求端的也可以是接收端的)采集图像数据,由对讲请求端设备或者对讲接收端设备或者云端服务器等获取的情况,可以是对讲请求端设备、对讲接收端设备的CPU或GPU等获取,传输可以是有线方式也可以是无线方式,采集的设备和执行上述S10的设备可能不同,此外,S10中确定的图像数据可以是对讲过程或对讲请求过程中采集的图像数据中的部分或全部数据。In addition, the image data of the intercom requesting end generally refers to the image data collected by the image acquisition unit of the intercom requesting end device, while the image data of the intercom receiving end generally refers to the image data collected by the image acquisition unit of the intercom receiving end device. It should be noted that, it may also be image data collected by other devices connected to the intercom receiving end device or the intercom requesting end device, and there may be multiple intercom receiving end devices. Determine the image data collected during the intercom process or the intercom request process. It can be an image acquisition unit (either the requester or the receiver) to collect image data, and the intercom requester device or the intercom receiver device or the cloud. In the case of acquisition by the server, etc., it can be acquired by the CPU or GPU of the intercom requesting end device and the intercom receiving end device. The transmission can be wired or wireless. The acquisition device may be different from the device that executes the above S10. In addition, , the image data determined in S10 may be part or all of the image data collected during the intercom process or the intercom request process.
可以理解的,在对讲过程和/或对讲请求过程中,对讲请求端设备的图像采集单元可以实时采集对讲请求端的图像数据,该对讲接收端的图像数据可以是对讲接收端设备的图像采集单元在对讲过程中实时采集的图像数据,其中对讲请求端的图像数据,一般是对讲请求端设备的图像采集单元能够采集到的图像数据,例如摄像头所能拍摄到的范围内的图像数据;对讲接收端的图像数据,一般是对讲接收端设备的图像采集单元能够采集到的图像数据,例如电视机摄像头能够拍摄到的范围内的图像数据。而对讲请求端设备、对讲接收端设备、云端服务器等可以获取上述所采集的图像数据。It can be understood that during the intercom process and/or the intercom request process, the image acquisition unit of the intercom requesting end device can collect the image data of the intercom requesting end in real time, and the image data of the intercom receiving end can be the intercom receiving end device. The image data collected by the image acquisition unit in real time during the intercom process, in which the image data of the intercom requester is generally the image data that can be collected by the image acquisition unit of the intercom requester device, such as the range that can be captured by the camera. The image data of the intercom receiving end is generally the image data that can be collected by the image acquisition unit of the intercom receiving end device, such as the image data within the range that can be captured by a TV camera. The intercom requesting end device, the intercom receiving end device, the cloud server, etc. can obtain the above collected image data.
示例的,对讲请求端的设备为门铃外机,门铃外机可以理解为安装在门外的门铃设备,门铃外机接收到对讲请求指令后启动采集对讲请求端的图像数据并向门铃内机发送对讲请求指令,门铃内机接收到对讲请求指令后发出提示(例如声音提示、也可以是播放对讲请求端的图像数据进行提示、还可以是震动提示等,在此不做限制)以提醒受访用户接听对讲,门铃内机接收到接听对讲的指令后进入对讲过程,以使访客和受访用户可以通过门铃外机和门铃内机进行对讲,门铃内机可以接收门铃外机的图像采集单元(例如摄像头)采集的图像数据,进行图像播放,为了保障私密性,一般情况下对讲接收端的图像不向对讲请求端播放,即访客一般看不到对讲接收端受访用户的图像。除门铃外机可以获取图像数据外,门铃内机(对讲接收端)、云端服务器等也可以接收门铃外机采集的图像数据从而获得相关图像数据,例如门铃外机的图像采集单元实时采集图像数据后通过有线或无线通信的方式发送给门铃内机或云端服务器等;其中门铃内机一般是在室内被使用,也可以将门铃内机的相关功能集成在各种终端设备中,例如电视机、音箱、手机、平板电脑等。示例的,以电视机为例,当接收到对讲请求时,则在电视画面中弹出画面(如果电视机正在使用,还可以在播放的电视画面上显示画中画)以用于显示对讲请求端的图像,电视机接收到受访用户的对讲指令后,建立与门铃端的对讲。For example, the device on the intercom request side is the doorbell external unit, which can be understood as a doorbell device installed outside the door. After receiving the intercom request command, the doorbell external unit starts to collect the image data of the intercom requester and sends it to the doorbell internal unit. Send an intercom request command, and the doorbell internal unit will issue a prompt after receiving the intercom request command (such as a voice prompt, or a prompt by playing the image data of the intercom request terminal, or a vibration prompt, etc., which is not limited here) to Remind the interviewed user to answer the intercom, the doorbell internal unit enters the intercom process after receiving the instruction to answer the intercom, so that the visitor and the interviewed user can intercom through the doorbell external unit and the doorbell internal unit, and the doorbell internal unit can receive the doorbell The image data collected by the image acquisition unit (such as a camera) of the external machine is used for image playback. In order to ensure privacy, the image of the intercom receiving end is generally not played to the intercom requesting end, that is, visitors generally cannot see the intercom receiving end. Image of interviewed user. In addition to the doorbell external unit that can obtain image data, the doorbell internal unit (intercom receiver), cloud server, etc. can also receive the image data collected by the doorbell external unit to obtain relevant image data. For example, the image acquisition unit of the doorbell external unit collects images in real time The data is then sent to the doorbell internal unit or cloud server through wired or wireless communication; the doorbell internal unit is generally used indoors, and the related functions of the doorbell internal unit can also be integrated into various terminal devices, such as TV sets , speakers, mobile phones, tablet computers, etc. For example, taking a TV as an example, when an intercom request is received, a picture will pop up on the TV screen (if the TV is in use, a picture-in-picture can also be displayed on the playing TV screen) to display the intercom. The image of the requesting end, after the TV receives the intercom instruction from the interviewed user, it establishes the intercom with the doorbell end.
需要说明的是,门铃外机的图像采集单元可以是在对讲请求发起时即开始采集图像数据,在对讲开始后继续采集,甚至更早就开始采集,例如门铃外机感应到访客到达预定区域时,启动图像采集单元。因此,该实施例的S10步骤还可以应用于对讲请求过程中, 即在对讲请求过程中,获取对讲请求过程中采集的图像数据,图像数据包括对讲请求端的图像数据;该图像数据可以被发送给对讲接收端也可以不被发送。It should be noted that the image acquisition unit of the doorbell outdoor unit can start to collect image data when the intercom request is initiated, continue to collect after the intercom starts, or even start acquisition earlier. region, start the image acquisition unit. Therefore, step S10 of this embodiment can also be applied to the intercom request process, that is, in the intercom request process, the image data collected in the intercom request process is acquired, and the image data includes the image data of the intercom requesting end; the image data Can be sent to the intercom receiver or not.
S30.当图像数据满足第一预设条件时,触发对讲请求过程结束或者触发对讲过程结束。S30. When the image data satisfies the first preset condition, trigger the intercom request process to end or trigger the intercom process to end.
可以理解的是,上述图像数据可以是对讲过程中或者对讲请求过程中实时采集的对讲请求端的图像数据,或者,对讲接收端的图像数据,或者,对讲请求端的图像数据和对讲接收端的图像数据。也就是说可以是对讲请求端的图像数据满足第一预设条件时,就触发对讲请求过程结束或触发对讲过程结束;也可以是对讲接收端的图像数据满足第一预设条件时,就触发对讲请求过程结束或触发对讲过程结束,还可以是对讲请求端的图像数据和对讲接收端的图像数据均满足第一预设条件时,就触发对讲请求过程结束或触发对讲过程结束。也就是说,当图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据时,若图像数据满足第一预设条件,触发对讲过程结束或者触发对讲请求过程结束,包括:当对讲请求端的图像数据和/或对讲接收端的图像数据满足第一预设条件时,触发对讲过程结束或者触发对讲请求过程结束。It can be understood that the above-mentioned image data may be the image data of the intercom requesting end collected in real time during the intercom process or the intercom request process, or the image data of the intercom receiving end, or the image data of the intercom requesting end and the intercom. image data at the receiving end. That is to say, when the image data of the intercom requesting end meets the first preset condition, the end of the intercom request process is triggered or the end of the intercom process is triggered; it can also be that when the image data of the intercom receiving end meets the first preset condition, Triggering the end of the intercom request process or triggering the end of the intercom process, or triggering the end of the intercom request process or triggering the intercom when both the image data of the intercom requester and the image data of the intercom receiver meet the first preset condition Process ends. That is to say, when the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end, if the image data satisfies the first preset condition, triggering the end of the intercom process or triggering the end of the intercom request process, including: When the image data of the intercom requesting end and/or the image data of the intercom receiving end satisfy the first preset condition, the end of the intercom process is triggered or the end of the intercom request process is triggered.
需要说明的是,触发对讲过程结束可以理解为触发结束对讲,使得来访用户和受访用户双方无法通过上述对讲设备继续沟通。而触发对讲请求过程结束可以理解为触发结束对讲请求,也就是说,不再发出对讲请求以等待用户接受,这里一般是指受访用户未接受对讲请求的情况下,通过对图像数据分析触发的对讲请求过程结束。It should be noted that triggering the end of the intercom process can be understood as triggering the end of the intercom, so that both the visiting user and the interviewed user cannot continue to communicate through the intercom device. The end of the process of triggering the intercom request can be understood as triggering the end of the intercom request, that is to say, the intercom request is no longer issued to wait for the user to accept it. The intercom request process triggered by data analysis ends.
可以理解的,步骤S30可以包括步骤S301或步骤S302,因此,步骤S10和步骤S30可以包括以下实施方式:It can be understood that step S30 may include step S301 or step S302, therefore, step S10 and step S30 may include the following embodiments:
示例的,一种来访对讲控制方法,包括:An example, a visiting intercom control method, including:
步骤S101.确定对讲请求过程中采集的图像数据,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;Step S101. Determine the image data collected in the intercom request process, and the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end;
步骤S301.当图像数据满足第一预设条件时,触发对讲请求过程结束;Step S301. When the image data satisfies the first preset condition, trigger the intercom request process to end;
示例的,一种来访控制方法,包括:Illustratively, an access control method includes:
步骤S102.确定对讲过程中采集的图像数据,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;Step S102. Determine the image data collected in the intercom process, and the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end;
步骤S302.当图像数据满足第一预设条件时,触发对讲过程结束。Step S302. When the image data satisfies the first preset condition, the intercom process is triggered to end.
一般来说,在对讲请求过程中,根据对讲请求端的图像数据来控制对讲请求可以避免访客离开等情况下,由于对讲请求仍然无意义的保持一定时间,带来的功耗问题,或对周围或室内受访用户的打扰,或者需要访客或受访用户手动关闭带来的不便等。当然,在对讲请求过程中,也可以根据对讲接收端的图像数据来控制对讲请求。例如,当接收端的图像满足第一预设条件时,例如没有包括人物特征信息等,则结束对讲请求,从而避免访客等待。当然,在对讲请求过程中,还可以根据对讲请求端的图像数据和对讲接收端的图像数据来控制对讲请求,可以是两者都满足第一预设条件时,才触发对讲请求过程结束,从而避免受访用户不在接收端设备监控的范围内或对访客或受访用户的特征信息误判等导致的误触发等情况。此外,在对讲过程中,也可以只根据对讲请求端的图像数据来控制对讲,避免访客离开后仍继续处于对讲请求状态等;或者只根据对讲接收端的图像数据来控制对讲,避免受访用户不在,但仍继续让访客等待等;或者结合对讲请求端和对讲接收端的图像数据来控制对讲,从而二者均满足第一预设条件时,才可以触发结束对讲,增强操作的便捷性以及控制的准确性等。Generally speaking, during the intercom request process, controlling the intercom request according to the image data of the intercom requester can prevent the visitor from leaving, etc., because the intercom request is still meaningless for a certain period of time, resulting in power consumption problems. Or the disturbance to the surrounding or indoor interviewed users, or the inconvenience caused by the need for visitors or interviewed users to manually shut down, etc. Of course, during the intercom request process, the intercom request can also be controlled according to the image data of the intercom receiver. For example, when the image of the receiving end satisfies the first preset condition, for example, does not include character feature information, etc., the intercom request is ended, thereby avoiding waiting for the visitor. Of course, during the intercom request process, the intercom request can also be controlled according to the image data of the intercom requesting end and the image data of the intercom receiving end, and the intercom request process can be triggered only when both meet the first preset condition. end, so as to avoid false triggers caused by the visited user being out of the monitoring range of the receiving end device or misjudging the characteristic information of the visitor or the visited user. In addition, during the intercom process, the intercom can also be controlled only according to the image data of the intercom requester, so as to avoid the visitor from continuing to be in the intercom request state after leaving; Avoid the presence of the interviewed user, but still keep the visitor waiting; or control the intercom by combining the image data of the intercom requester and the intercom receiver, so that the end of the intercom can be triggered only when both meet the first preset condition. , enhance the convenience of operation and the accuracy of control.
可以理解的,当图像数据满足第一预设条件时,可以是当获取的图像数据中的一采样帧图像满足第一预设条件(即当前采样帧图像满足第一预设条件)时,触发对讲请求过程结束或触发对讲过程结束,这种方式可以较为快速的触发结束对讲,但可能存在一些误判,例如访客因为某个姿势变化暂时超出了摄像头拍摄的范围。当图像数据满足第一预设条件时,还可以是当连续预设采样帧数图像满足第一预设条件时,这种情况要求 多采样帧图像均满足第一预设条件,从而提升了判断的准确性。当图像数据满足第一预设条件时,还可以是连续第一预设时间的图像数据满足第一预设条件,也就是说不通过帧数来衡量,而是通过持续时间来衡量,也可以提升判断的准确性。在不冲突的情况下,上述方式可以结合使用。It can be understood that when the image data satisfies the first preset condition, it may be when a sampled frame image in the acquired image data satisfies the first preset condition (that is, the current sampled frame image satisfies the first preset condition), the trigger is triggered. The intercom request process ends or the intercom process is triggered. This method can trigger the end of the intercom relatively quickly, but there may be some misjudgments. For example, the visitor temporarily exceeds the range of the camera because of a certain posture change. When the image data satisfies the first preset condition, it may also be when the images of the consecutive preset sampling frames meet the first preset condition. In this case, the multi-sampled frame images all meet the first preset condition, thereby improving the judgment. accuracy. When the image data satisfies the first preset condition, it can also be that the image data for the first preset time continuously satisfies the first preset condition, that is to say, it is not measured by the number of frames, but by the duration. Improve the accuracy of judgment. In the case of no conflict, the above methods can be used in combination.
需要说明的是,采样帧图像可以是从确定的图像数据中选择出部分或全部帧的图像数据进行分析。其中,采样周期可以间隔特定帧数,该特定帧数可以为大于或等于零的正整数,也就是说,可以每一帧都采样分析,或间隔特定帧分析;当然,还可以间隔特定时间进行采样分析。It should be noted that, the sampled frame image may be image data of some or all of the frames selected from the determined image data for analysis. Among them, the sampling period can be separated by a specific number of frames, and the specific number of frames can be a positive integer greater than or equal to zero, that is, every frame can be sampled and analyzed, or analyzed at specific frame intervals; of course, sampling can also be performed at specific time intervals. analyze.
可以理解的,图像数据满足第一预设条件,可以是在图像数据中未检测到人物特征信息,或者,可以是在图像数据中检测到人物预设行为信息等,还可以是,先在图像数据中检测到人物预设行为信息,然后再在图像数据中未检测到人物特征信息,即若在图像数据中检测到人物预设行为信息之后的预设时间段内,在图像数据中未检测到人物特征信息,则判定图像数据满足第一预设条件,表示人物先做出了预设行为的变化,随即离开监控区域的过程等,其中,预设时间段可以根据实际需求而设定,在此不做限定。其中,人物特征信息可以包括:人脸信息、人体轮廓信息、人体红外信息中的至少一种;人物预设行为信息包括:后转信息、侧转信息、远离信息中的至少一种;其中,人脸信息可以包括但不限于五官信息、肤色信息、人脸轮廓信息、瞳孔信息等;人体轮廓信息可以是部分人体轮廓或者全部人体轮廓,例如头部轮廓、上半身轮廓、侧面轮廓、正面轮廓、背面轮廓等。可以理解的,后转信息或侧转信息或离开信息都可以根据人脸信息和人体轮廓信息的变化获得。例如后转信息可以是检测到正面轮廓到侧面轮廓再到背面轮廓的变化确定,侧转信息可以是检测到正面轮廓到侧面轮廓的变化确定的,而远离信息可以是检测到背面轮廓在图像中的比例变化确定的。It can be understood that if the image data satisfies the first preset condition, it may be that no character feature information is detected in the image data, or it may be that the preset behavior information of characters is detected in the image data, or The preset behavior information of the person is detected in the data, and then the person characteristic information is not detected in the image data, that is, if the preset time period after the preset behavior information of the person is detected in the image data, no information is detected in the image data. When the character feature information is reached, it is determined that the image data meets the first preset condition, which means that the character first changes the preset behavior and then leaves the monitoring area. The preset time period can be set according to actual needs. This is not limited. Wherein, the character feature information may include: at least one of face information, body contour information, and human body infrared information; the character preset behavior information includes: at least one of back-turn information, side-turn information, and distance information; wherein, The face information may include, but is not limited to, facial features information, skin color information, face contour information, pupil information, etc.; the human body contour information may be part of the human body contour or the entire human body contour, such as head contour, upper body contour, side contour, frontal contour, Back profile, etc. It can be understood that the backward turn information, the side turn information or the departure information can be obtained according to the changes of the face information and the body contour information. For example, the back turn information may be determined by detecting the change from the front profile to the side profile and then to the back profile, the side turn information may be determined by detecting the change from the front profile to the side profile, and the away information may be detected by detecting the back profile in the image. The proportional change is determined.
示例的,门铃外机的图像采集设备在对讲过程中实时采集图像数据,当在采集的图像数据中检测不到人物特征信息时,可以认为访客已经离开图像采集单元的可采集范围,即访客离开,从而自动结束对讲,无需受访用户或访客主动结束对讲。例如,通过未检测到人脸信息,可以较快速的判断访客将要离开或者已经离开,例如当访客转身后就无法检测到人脸信息,因此自动结束对讲,可以避免用户手动操作,同时最快的让出资源供门铃内机实现其他功能,特别是门铃内机集成在其他设备上时,例如集成在电视机上,可以尽早的结束对讲,方便受访用户继续看电视;又例如集成在音箱上,可以尽早的结束对讲,方便受访用户听音乐等。而将人体轮廓信息作为人物特征信息,当图像数据中未检测到人体轮廓信息时,说明访客已经离开,从而可以更准确的判断对讲是否需要结束,但对于那种可以拍摄较远距离的场景,可能会导致结束不够及时,因此也可以通过检测到人物预设行为信息来判断,或者结合人物特征信息和人物预设行为信息来综合判断,从而能够在对讲请求或者对讲过程中,自动结束对讲请求或结束对讲,无需用户或访客手动关闭,同时也避免访客离开或受访用户离开后对讲或对讲请求仍然保持导致的资源占用等问题。For example, the image acquisition device of the doorbell outdoor unit collects image data in real time during the intercom process. When no character feature information is detected in the collected image data, it can be considered that the visitor has left the collection range of the image acquisition unit, that is, the visitor. Leave, which automatically ends the intercom, without the need for the interviewed user or visitor to end the intercom. For example, by not detecting the face information, it can be quickly determined that the visitor is about to leave or has already left. For example, when the visitor turns around, the face information cannot be detected, so the intercom is automatically ended, which can avoid the user's manual operation, and at the same time, the fastest In particular, when the doorbell internal unit is integrated with other devices, such as a TV, the intercom can be ended as soon as possible, so that the interviewed users can continue to watch TV; another example is integrated in a speaker. On, you can end the intercom as soon as possible to facilitate the interviewed users to listen to music, etc. The human body contour information is used as the character feature information. When the human body contour information is not detected in the image data, it means that the visitor has left, so that it can be more accurately judged whether the intercom needs to end. , which may cause the end to be not timely enough, so it can also be judged by detecting the preset behavior information of the character, or combined with the character characteristic information and the preset behavior information of the character to make a comprehensive judgment, so that it can automatically To end the intercom request or end the intercom, there is no need to manually close the user or visitor, and it also avoids the problem of resource occupation caused by the intercom or intercom request still remaining after the visitor leaves or the interviewed user leaves.
可以理解的,在图像数据中未检测到人物特征信息,可以包括以下至少一种:在图像数据中的一采样帧图像中未检测到人物特征信息;在图像数据中连续预设采样帧数的图像中未检测到人物特征信息;在图像数据中未检测到人物特征信息持续第一预设时间。其中,图像数据的采集可以是实时采集的,在图像数据的一采样帧图像中未检测到人物特征信息可以理解为当前采样帧图像未检测到人物特征信息,则可以触发对讲请求过程结束或对讲过程结束,当然采集和分析可以是两个过程,甚至可是由两个设备来进行。因此,从采集到当前图像数据,到完成对当前图像数据的分析会有一定的时间差,但一般影响较小,并且确定(获取)的图像数据可以是采集的图像数据的部分或全部图像数据,进一步的,用于分析的采样数据可以是该确定(获取)的图像数据的部分或全部图像数据。It can be understood that the character feature information that is not detected in the image data may include at least one of the following: no character feature information is detected in a sampled frame image in the image data; continuous preset sampling frame number in the image data. No person feature information is detected in the image; no person feature information is detected in the image data for the first preset time. Among them, the collection of image data can be collected in real time. If no character feature information is detected in a sampled frame image of the image data, it can be understood that no character feature information is detected in the current sample frame image, which can trigger the end of the intercom request process or At the end of the intercom process, of course, the acquisition and analysis can be two processes, or even carried out by two devices. Therefore, there will be a certain time difference from the acquisition of the current image data to the completion of the analysis of the current image data, but generally the impact is small, and the determined (acquired) image data can be part or all of the acquired image data. Further, the sampled data for analysis may be part or all of the determined (acquired) image data.
此外,预设采样帧数和第一预设时间的设置可以增加检测的准确性,预设采样帧数可以通过计数来完成,每分析完连续预设采样帧数的图像数据就重新计数,当然也可以当确定当前采样帧图像数据未检测到人物特征信息时开始计数,第一预设时间也可以参照类似方式,当然还可以通过其他方式,在此不做限制。In addition, the setting of the preset number of sampling frames and the first preset time can increase the accuracy of detection. The preset number of sampling frames can be completed by counting. After analyzing the image data of the continuous preset number of sampling frames, the count is re-counted. Of course, It can also start counting when it is determined that the current sampling frame image data does not detect the character feature information, and the first preset time can also refer to a similar method, of course, other methods can also be used, which are not limited here.
可以理解的,在图像数据中检测到人物预设行为信息,可以包括以下至少一种:在图像数据中的连续采样帧数的图像中检测到人体轮廓从正面轮廓变为侧面轮廓(可以认为是检测侧转信息的一种方式);在图像数据中的连续采样帧数的图像中检测到人体轮廓从正面轮廓变为侧面轮廓再变为背面轮廓(可以认为是检测后转信息的一种方式);在图像数据中的连续第一预设采样帧数的图像中检测到人体轮廓在图像中的占比变小(可以认为是检测远离信息的一种方式,该方式能够检测出变化趋势,且可在满足连续第一预设采样帧数的图像变小时触发结束动作);图像数据中的连续采样帧数的图像中检测到人体轮廓在图像中的占比变小且小于预设占比(可以认为是检测远离信息的一种方式);图像数据中的连续第二预设采样帧数的图像中检测到人体轮廓占人体全部轮廓的比例增加(可以认为是检测远离信息的一种方式,该方式能够检测出变化趋势,且可在连续第二预设采样帧数的图像比例增加时触发结束动作);图像数据中的连续预设采样帧数的图像中检测到人体轮廓占人体全部轮廓的比例增加且大于预设比例(可以认为是检测远离信息的一种方式)。It can be understood that the preset behavior information of the person detected in the image data may include at least one of the following: it is detected that the outline of the human body changes from a frontal outline to a side outline (which can be considered as a A way to detect lateral turning information); in the image of the continuous sampling frame number in the image data, it is detected that the contour of the human body changes from the frontal contour to the side contour and then to the back contour (it can be considered as a way to detect the backward turning information). ); in the image of the continuous first preset sampling frame number in the image data, it is detected that the proportion of the human body contour in the image becomes smaller (it can be considered as a way to detect the distance information, which can detect the change trend, And can trigger the end action when the image that meets the first consecutive preset sampling frame number becomes smaller); in the image with the continuous sampling frame number in the image data, it is detected that the proportion of human silhouette in the image becomes smaller and smaller than the preset proportion (It can be considered as a way to detect the distance information); the proportion of the human body contour detected in the image of the second consecutive preset sampling frame number in the image data to the total human body contour increases (it can be considered as a way to detect the distance information) , this method can detect the change trend, and can trigger the end action when the proportion of the image with the second consecutive preset sampling frame number increases); in the image of the continuous preset sampling frame number in the image data, it is detected that the contour of the human body accounts for all the human body The scale of the contour increases and is larger than a preset scale (which can be considered as a way to detect far-away information).
需要说明的是,在对图像数据进行分析时,可以每隔特定帧数采样一次,该特定帧数可以为大于或等于零的正整数;也可以每隔特定时间采样一次,也就是说,可以对每一帧图像进行分析,也可以采样其中的部分帧图像。由于人物行为可能是一个连续的动作,因此需要对连续采样帧数的图像数据进行分析才能确定。例如检测连续采样帧数的图像数据,当检测到完成预设人物行为后则触发对讲过程结束或对讲请求过程结束,比如,检测后转时,若检测到上述连续采样帧图像中的某一帧为背面轮廓时,则触发对讲过程结束或对讲请求过程结束。而远离行为中,可以通过人体轮廓在拍摄的图像中的占比来确定,一般来说远离时,人体轮廓所包含的面积占图像的总面积会变小,具体的,可以在上述变小的基础上,当所占面积小于预设占比时触发对讲过程结束或对讲请求过程结束;具体的,还可以在上述变小的基础上,计数变小的帧数,例如从开始变小的采样帧开始计数,当变小的采样帧数达到第一预设采样帧数时则触发相关结束动作。另一个角度,由于人远离时,摄像头可能是固定的,因此能够拍到的人体轮廓会增加,例如近的时候只能拍到人头部轮廓、再远一点可以拍到上半身轮廓、更远可能可以拍到人体全部轮廓,因此可以通过这种方式来判断人物的远离,可以检测到这样的一个趋势就触发结束对讲或对讲请求,也可以是在这个趋势的基础上当检测到大于预设比例时就触发对讲过程结束或对讲请求过程结束,还可以通过对增大比例的采样帧数进行计数,例如从开始增大比例的采样帧开始计数,当大于第二预设采样帧数时则触发相关结束动作。可以看到,在判断人物行为时,采用的是采样帧数,由于人物行为相对复杂,直接用采集的连续帧图像可能会增加误判。此外,检测效果可能跟图像采集模块的设置位置,设置方式有关,因此在不冲突的情况下可以结合上面多种方式来检测,提升适用性。It should be noted that when analyzing the image data, sampling can be performed once every specific number of frames, which can be a positive integer greater than or equal to zero; Each frame of image is analyzed, and part of the frame image can also be sampled. Since a character's behavior may be a continuous action, it needs to be determined by analyzing the image data of consecutively sampled frames. For example, the image data of the continuous sampling frame number is detected, and when the preset character behavior is detected, the end of the intercom process or the end of the intercom request process is triggered. When a frame is a back profile, the end of the intercom process or the end of the intercom request process is triggered. In the behavior of moving away, it can be determined by the proportion of the human body contour in the captured image. Generally speaking, when moving away, the area contained by the human body contour will become smaller in the total area of the image. Specifically, it can be reduced in the above Basically, when the occupied area is less than the preset proportion, the end of the intercom process or the end of the intercom request process is triggered; specifically, on the basis of the above reduction, the number of frames can be counted to become smaller, for example, the number of frames becomes smaller from the beginning. The sampling frame count starts to count, and the related end action is triggered when the smaller sampling frame number reaches the first preset sampling frame number. From another angle, since the camera may be fixed when the person is far away, the outline of the human body that can be photographed will increase. For example, when the person is close, only the outline of the head can be photographed, and the outline of the upper body can be photographed further away. The entire outline of the human body can be photographed, so the distance of the person can be judged in this way. When such a trend is detected, the end intercom or intercom request can be triggered. When the ratio is reached, the end of the intercom process or the end of the intercom request process is triggered. You can also count the number of sampling frames that increase the proportion, for example, start counting from the sampling frame that increases the proportion. When the corresponding end action is triggered. It can be seen that when judging the behavior of the character, the sampling frame number is used. Since the behavior of the character is relatively complex, directly using the collected continuous frame images may increase the misjudgment. In addition, the detection effect may be related to the setting position and setting method of the image acquisition module, so in the case of no conflict, the above methods can be combined for detection to improve the applicability.
可以理解的,轮廓可以是头部、上半身、全身等,需要根据图像采集设备能够采集的情况确定,例如与摄像头的设置位置和摄像头的视角广度等有关。此外,正面轮廓或背面轮廓或侧面轮廓能够根据人脸信息确定,特别是正面的大轮廓与背面的大轮廓有可能相似,因此为了区分还可以结合五官等人脸信息来区分。当然,侧转信息、后转信息、远离信息等还可以通过其他的方式来确定。需要说明的是,通过检测到侧转信息,则触发终止对讲请求或终止对讲,可以较快速的响应,但是也会增加误判,例如有可能访客或受访对象(受访用户)只是调整了姿势,而非想结束对讲或对讲请求;而通过检测到后转信息,来触发终止对讲请求或终止对讲,相比侧转信息的可靠性要强,进一步地检测远离信息大概率说明访客离开从而关闭对讲或对讲请求准确性相对更高。但上述效果 差异是相对的,在其他一些场景中效果差异可能不同。It can be understood that the contour can be the head, upper body, whole body, etc., which needs to be determined according to the situation that the image acquisition device can capture, such as the setting position of the camera and the viewing angle width of the camera. In addition, the front profile, back profile or side profile can be determined according to the face information, especially the large front profile may be similar to the back profile, so in order to distinguish, it can also be distinguished by combining facial information such as facial features. Of course, the side turn information, the back turn information, and the distance information can also be determined in other ways. It should be noted that, by detecting the side turn information, it will trigger a request to terminate the intercom or terminate the intercom, which can respond quickly, but it will also increase misjudgment. Adjusted the posture, rather than trying to end the intercom or intercom request; but by detecting the back turn information, to trigger the termination intercom request or terminate the intercom, the reliability is stronger than the side turn information, and it is further detected that the distance information is large. The probability indicates that the visitor leaves to close the intercom or the intercom request is relatively more accurate. However, the above differences in effects are relative and may differ in some other scenarios.
可以理解的,当图像数据包括对讲请求端的图像数据和对讲接收端的图像数据时,若图像数据满足第一预设条件,触发对讲过程结束或者触发对讲请求过程结束,可以包括:当对讲请求端的图像数据满足第一预设条件和对讲接收端的图像数据满足第一预设条件时,触发对讲过程结束或者触发对讲请求过程结束,其中,由于如上所述,图像数据满足第一预设条件的具体实施方式可能有多种,因此,可以理解的,对讲请求端的图像数据满足的第一预设条件和对讲接收端的图像数据满足的第一预设条件可以相同或不同。也就是说,二者满足的具体条件可以是不同的,例如对讲请求端的图像数据满足图像数据中未检测到人物信息,而对讲接收端的图像数据满足图像数据中检测到远离行为等。It can be understood that when the image data includes the image data of the intercom requesting end and the image data of the intercom receiving end, if the image data satisfies the first preset condition, triggering the end of the intercom process or triggering the end of the intercom request process may include: when When the image data of the intercom requesting end satisfies the first preset condition and the image data of the intercom receiving end satisfies the first preset condition, the intercom process is triggered to end or the intercom request process is triggered to end. There may be various specific implementations of the first preset condition. Therefore, it can be understood that the first preset condition satisfied by the image data of the intercom requesting end and the first preset condition satisfied by the image data of the intercom receiving end may be the same or different. That is to say, the specific conditions met by the two may be different, for example, the image data of the intercom requesting end satisfies that no person information is detected in the image data, while the image data of the intercom receiving end satisfies that the distance behavior is detected in the image data, etc.
示例的,当在图像数据中未检测到人脸信息或者人体轮廓信息或者人体红外信息时,可以认为对讲端的人物离开,例如在门铃对讲场景中,在对讲过程中对门铃外机实时拍摄的图像数据进行监测,当在某一采样帧图像或者预设采样帧数图像中检测到人脸信息,则触发对讲过程结束,预设采样帧数图像还可以防止访客在对讲过程中因暂时移动导致发生误判,对此,可以结合判断的准确性和触发的及时性来设置合适的预设采样帧数。For example, when no face information, human body contour information, or human body infrared information is detected in the image data, it can be considered that the person on the intercom side has left. The captured image data is monitored. When face information is detected in a certain sampling frame image or a preset sampling frame number image, the end of the intercom process is triggered. The preset sampling frame number image can also prevent visitors from talking during the intercom process. Misjudgment occurs due to temporary movement. In this regard, an appropriate preset sampling frame number can be set in combination with the accuracy of the judgment and the timeliness of the trigger.
在实际应用中,一般对讲请求端的图像数据由对讲请求端设备采集;对讲接收端的图像数据由对讲接收端的设备采集。对接请求端设备一般可以安装在室外用来获取来访者的信息,而对讲接收端设备可以安装在室内,可以是专门的设备,例如门铃内机,也可以集成在其他电子设备中,如果电视机、音箱等,可以是一个也可以是多个。此外,图像数据的采集和图像数据的分析可以是不同设备来执行。In practical applications, the image data of the intercom requesting end is generally collected by the intercom requesting end device; the image data of the intercom receiving end is collected by the intercom receiving end device. The docking requester device can generally be installed outdoors to obtain visitor information, while the intercom receiver device can be installed indoors. It can be a specialized device, such as a doorbell, or integrated in other electronic devices. There can be one or more than one machine, speakers, etc. Furthermore, acquisition of image data and analysis of image data may be performed by different devices.
需要说明的是,上述例举的对讲请求端设备和对讲接收端设备中,传统的路由器、网关、CPE、音箱或电视盒等设备中可能不具备图像采集功能,但有的上述产品中也可以集成图像采集功能,甚至图像显示功能,例如带摄像头的音箱、带显示屏的音箱等。也就是说,上述列举的对讲接收端设备可以具备图像采集功能,也可以不具备图像采集功能,当其不具备图像采集功能时,如果需要获取对讲接收端的图像数据,可以通过与其连接的其他具备图像采集功能的设备来采集对讲接收端的图像数据,例如智能摄像头。因此,如果某些场景下需要使用对讲接收端的图像数据,则可以认为上述对讲请求端设备或对讲接收端设备具备图像采集功能或者能够从具备图像采集功能的设备获取到对讲接收端的图像数据。It should be noted that among the above-mentioned intercom requesting end devices and intercom receiving end devices, traditional routers, gateways, CPEs, speakers or TV boxes may not have image capture functions, but some of the above products It is also possible to integrate image acquisition functions and even image display functions, such as speakers with cameras, speakers with displays, etc. That is to say, the intercom receiver equipment listed above may or may not have the image acquisition function. When it does not have the image acquisition function, if you need to obtain the image data of the intercom receiver, you can use the Other devices with image acquisition function to collect the image data of the intercom receiver, such as smart cameras. Therefore, if the image data of the intercom receiving end needs to be used in some scenarios, it can be considered that the above-mentioned intercom requesting end device or intercom receiving end device has the image acquisition function or can obtain the intercom receiving end from the device with the image acquisition function. image data.
在实际应用中,可以根据对讲接收端设备确定的人物信息数据或者设备状态数据确定对讲请求端的图像数据的播放方式。具体的,可以包括但不限于以下方式:当确定电视机处于运行状态,则确定通过电视机采用画中画的方式播放对讲请求端的图像数据;这种情况下,可以减少对受访用户观看电视的影响;一种情况下,当确定电视机处于运行状态,且确定受访用户与电视机处于预设范围内,则确定通过电视机采用画中画的方式播放对讲请求端的图像数据;这种情况下,还有考虑受访用户与电视机的位置关系,从而推断受访用户是否能够获知到对讲请求或者是否方便受访用户进行对讲,该预设范围可以根据受访用户自己需要设置,也可以出厂时默认一个范围,例如默认为电视机摄像头可拍摄到的范围内,也就是说通过电视机摄像头拍摄的图像数据(如对讲接收端的图像数据)来检测是否有受访用户信息,如果有则认为在预设范围内。一种情况,当确定电视机处于关闭状态,则确定通过电视机采用全屏显示的方式播放对讲请求端的图像数据;此处的关闭状态可以是完全关闭,也可以是休眠状态,即电视屏幕不显示图像,若电视机处于完全关闭状态,则启动电视机以使电视机处于运行状态,通过电视机采用全屏显示的方式播放对讲请求端的图像数据,若电视机处于休眠状态,则将电视机唤醒以使电视机处于运行状态,通过电视机采用全屏显示的方式播放对讲请求端的图像数据。因此,可以全屏方式来显示对讲请求端的图像数据,也不会影响受访用户看电视。一种情况,当确定电视机处于关闭状态,且确定受访用户与电视机处于预设范围内,则确定 通过电视机采用全屏显示的方式播放对讲请求端的图像数据;该方法与前面类似都考虑到了受访用户与电视机的位置关系,在此不再赘述。除了电视场景,对讲接收端设备还可以是手机,一种情况,当确定手机处于受访用户使用状态,则确定通过手机播放对讲请求端的图像数据;该情况下,可以通过检测受访用户对手机显示屏的触摸动作或者是否在播放音视频数据等来判断受访用户是否在使用手机,当然也可以利用手机摄像头拍摄的图像数据来判断受访用户是否在使用,如果拍摄的图像数据有受访用户信息,则判定受访用户在使用手机。还可以是电脑场景,包括家用电脑、平板电脑等,当确定电脑处于受访用户使用状态,则确定通过电脑播放对讲请求端的图像数据,使用状态的确定可以采用手机或者电视的方式,在此不再赘述。上述几种方式在不冲突的情况下可以组合使用,以增加针对不同场景的适用性。因此,当受访用户在使用某个对讲接收端设备时,可以优先通过该对讲接收端设备来进行图像数据的播放,从而更快速的提醒受访用户有访客来访,或者更方便用户进行对讲等。In practical applications, the playback mode of the image data of the intercom requesting end can be determined according to the character information data or the device state data determined by the intercom receiving end device. Specifically, it may include but is not limited to the following methods: when it is determined that the TV is in the running state, it is determined that the image data of the intercom requesting terminal is played through the TV in a picture-in-picture manner; in this case, the viewing of the interviewed users can be reduced. Influence of TV; in one case, when it is determined that the TV is in running state, and it is determined that the interviewed user and the TV are within the preset range, it is determined that the image data of the intercom requester is played through the TV in a picture-in-picture manner; In this case, the positional relationship between the interviewed user and the TV is also considered, so as to infer whether the interviewed user can know the intercom request or whether it is convenient for the interviewed user to intercom. The preset range can be determined according to the interviewed user himself. It needs to be set, or a range can be defaulted at the factory, for example, the default is within the range that can be captured by the TV camera, that is to say, whether there is an interview is detected by the image data captured by the TV camera (such as the image data of the intercom receiver). User information, if any, is considered to be within the preset range. In one case, when it is determined that the TV is in the off state, it is determined that the image data of the intercom requester is played through the TV in a full-screen display; Display the image. If the TV is in a completely off state, start the TV to make the TV run, and play the image data of the intercom request terminal through the TV in a full-screen display mode. If the TV is in a sleep state, turn the TV on. Wake up to make the TV in the running state, and play the image data of the intercom requester through the TV in a full-screen display mode. Therefore, the image data of the intercom requesting terminal can be displayed in a full-screen mode, and it will not affect the interviewed users watching TV. In one case, when it is determined that the TV is turned off, and the interviewed user and the TV are determined to be within the preset range, it is determined that the image data of the intercom requesting end is played through the TV in a full-screen display mode; this method is similar to the previous method. Considering the positional relationship between the interviewed user and the TV set, details are not repeated here. In addition to the TV scene, the intercom receiver device can also be a mobile phone. In one case, when it is determined that the mobile phone is in use by the interviewed user, it is determined to play the image data of the intercom requesting end through the mobile phone; in this case, the interviewed user can be detected by To judge whether the interviewed user is using the mobile phone by touching the screen of the mobile phone or whether it is playing audio and video data. Of course, the image data captured by the mobile phone camera can also be used to determine whether the interviewed user is using it. If the interviewed user information is obtained, it is determined that the interviewed user is using a mobile phone. It can also be a computer scenario, including home computers, tablet computers, etc. When it is determined that the computer is in the use state of the interviewed user, it is determined to play the image data of the intercom requester through the computer, and the use state can be determined by means of a mobile phone or a TV. Here No longer. The above methods can be used in combination without conflict to increase the applicability for different scenarios. Therefore, when the interviewed user is using a certain intercom receiver device, the image data can be played through the intercom receiver device first, so as to quickly remind the interviewed user that there is a visitor, or it is more convenient for the user to play the image data. Intercom etc.
基于前面描述的技术方案,为了进一步方便用户的使用,上述来访对讲控制方法还包括:S20.确定对讲过程中采集的对讲语音。在对讲过程中,步骤S30.上述当图像数据满足第一预设条件时,触发对讲过程结束或者触发对讲请求过程结束,可以替换为步骤S40:当图像数据满足第一预设条件且对讲语音满足第二预设条件时,则触发对讲过程结束。也就是说,除了根据图像数据进行判断,还结合对讲语音数据进行判断,从而增加判断的准确性。此处,对讲语音满足第二预设条件,可以包括:对讲语音中包括预设关键词或未获取到对讲语音满足预设时间。例如,检测到语音中有“结束对讲”、“再见”等预设关键词时,则认为对讲语音满足第二预设条件,或者,当对讲双方沟通完毕后,预设时间内没有获取到对讲语音,可以是采集的声音信号中没有语音信号,也可以是采集不到任何声音信号,还可以是采集不到对讲双方的语音信号,也就是说可以识别出刚刚对讲双方的声音信息,从而判断双方是否停止对讲,这种方式在语音环境嘈杂的情况下能够提高准确性。Based on the technical solutions described above, in order to further facilitate the use of the user, the above-mentioned visiting intercom control method further includes: S20. Determine the intercom voice collected during the intercom process. In the intercom process, step S30. The above-mentioned when the image data meets the first preset condition, triggering the end of the intercom process or triggering the intercom request process to end, can be replaced with step S40: when the image data meets the first preset condition and When the intercom voice satisfies the second preset condition, the intercom process is triggered to end. That is to say, in addition to the judgment based on the image data, the judgment is also combined with the intercom voice data, thereby increasing the accuracy of the judgment. Here, the intercom voice satisfies the second preset condition, which may include: the intercom voice includes a preset keyword or the intercom voice is not acquired for a preset time. For example, when it is detected that there are preset keywords such as "end intercom" and "goodbye" in the speech, it is considered that the intercom speech satisfies the second preset condition, or, after the two parties communicate with each other, there is no such thing as a preset time. If the intercom voice is acquired, it may be that there is no voice signal in the collected sound signal, or it may not be able to collect any sound signal, or it may be that the voice signal of both parties in the intercom cannot be collected, that is to say, it is possible to identify the two parties who have just spoken to each other. The voice information, so as to determine whether the two sides stop the intercom, this method can improve the accuracy in the case of a noisy voice environment.
可以理解的,为了提升使用的便携性,以电视机作为接收端设备为例,在对讲请求过程中,若电视机或电视机的遥控器采集到(如电视上或者遥控器上的麦克风采集)对讲接收端的受访用户的第一预设语音,则触发进入对讲过程;或者,在对讲过程中,若电视机或电视机的遥控器采集到(如电视上或者遥控器上的麦克风采集)对讲接收端的受访用户的第二预设语音,则触发结束本次对讲。其中,第一预设语音可以作为对讲请求,从而触发进入对讲阶段,例如检测到受访用户说“接收对讲”“开启对讲”等预设语音,具体内容在此不做限制。而第二预设语音则可以用于结束对讲,可以是包含“结束对讲”、“再见”等预设关键词的语音。该方法通过语音进行控制,方便用户控制对讲。需要说明的是,此方案中,明确了是电视机或电视机遥控器采集的接收端用户语音,而非对讲语音,从而避免受来访客户的语音影响。而对于用于结束对讲或对讲请求的第二预设语音可以是电视或遥控器采集的受访用户语音,也可以对讲语音,其中对讲语音的方案类似前面关于“对讲语音满足第二预设条件”的描述,在此不再赘述。It can be understood that, in order to improve the portability of use, taking the TV as the receiving end device as an example, during the intercom request process, if the TV or the remote control of the TV collects the data (such as the microphone on the TV or the remote ) the first preset voice of the interviewed user at the intercom receiver, triggers the intercom process; or, during the intercom process, if the TV or the remote The microphone collects) the second preset voice of the interviewed user at the intercom receiving end, which triggers the end of the intercom. Among them, the first preset voice can be used as an intercom request, thereby triggering the entry into the intercom stage. For example, it is detected that the interviewed user said preset voices such as "receive intercom" and "open intercom", and the specific content is not limited here. The second preset voice may be used to end the intercom, and may be a voice containing preset keywords such as "end the intercom" and "goodbye". The method is controlled by voice, which is convenient for the user to control the intercom. It should be noted that, in this solution, it is clarified that the user's voice at the receiving end collected by the TV or TV remote control is not the voice of the intercom, so as to avoid being affected by the voice of the visiting customer. The second preset voice used to end the intercom or the intercom request may be the interviewed user's voice collected by the TV or the remote control, or the intercom voice. The description of the second preset condition" will not be repeated here.
需要说明的是,由于电子设备可以是对讲请求端设备、也可以是对讲接收端设备、还可以是云端服务器,也就是说,步骤S10和S30可以均由对讲请求端设备执行,例如可以由对讲请求端设备的图像采集单元在对讲请求或者对讲过程中采集请求端的图像数据,可以由对讲接收端的图像采集单元在对讲请求或者对讲过程中采集接收端的图像数据,然后由对讲请求端设备(例如其处理器)来确定(可以理解为获取)图像数据(包括对讲请求端的图像数据和/或对讲接收端的图像数据),并当对讲请求端设备确定图像数据满足第一预设条件时,触发对讲请求过程结束或对讲过程结束,此处的触发可以是直接控制结束,或者发出一个信号给对讲接收端设备,由对讲接收端设备控制结束,都可以属于触发的保护范围,对此不做限制。可选的,步骤S10和S30可以均由对讲接收端设备来执行,例如可以由对讲请求端设备的图像采集单元在对讲请求或者对讲过程中采 集请求端的图像数据,可以由对讲接收端的图像采集单元在对讲请求或者对讲过程中采集接收端的图像数据,然后由对讲接收端设备(例如其处理器)来确定(可以理解为获取)图像数据(包括对讲请求端的图像数据和/或对讲接收端的图像数据),并当对讲接收端设备确定图像数据满足第一预设条件时,触发对讲请求过程结束或对讲过程结束,此处的触发可以是直接控制结束,或者发出一个信号给对讲请求端设备,由对讲请求端设备控制结束,以上都可以属于触发的保护范围,对此不做限制。可选的,步骤S10和S30可以均由云端服务器来执行,例如可以由对讲请求端设备的图像采集单元在对讲请求或者对讲过程中采集请求端的图像数据,可以由对讲接收端的图像采集单元在对讲请求或者对讲过程中采集接收端的图像数据,然后由云端服务器来确定(可以理解为获取)图像数据(包括对讲请求端的图像数据和/或对讲接收端的图像数据),并当云端服务器确定图像数据满足第一预设条件时,触发对讲请求过程结束或对讲过程结束,此处的触发可以是发出一个信号给对讲请求端设备或者发出一个信号给对讲接收端设备,由对讲请求端设备或对讲接收端设备控制结束,以上都可以属于触发的保护范围,对此不做限制。需要说明的是,由于电子设备可以包括讲请求端设备、对讲接收端设备、云端服务器中的至少一个,因此S10和S30步骤可以在相同或不同的电子设备上执行。It should be noted that since the electronic device can be an intercom requesting end device, an intercom receiving end device, or a cloud server, that is to say, steps S10 and S30 can both be performed by the intercom requesting end device, for example The image acquisition unit of the intercom requesting end device can collect the image data of the requesting end during the intercom request or the intercom process, and the image data of the receiving end can be collected by the image acquisition unit of the intercom receiving end during the intercom request or the intercom process. Then the intercom requesting end device (such as its processor) determines (can be understood as acquiring) the image data (including the image data of the intercom requesting end and/or the image data of the intercom receiving end), and when the intercom requesting end device determines When the image data meets the first preset condition, the intercom request process is triggered or the intercom process ends. The trigger here can be the end of direct control, or a signal is sent to the intercom receiver device, which is controlled by the intercom receiver device. End, can belong to the protection scope of the trigger, there is no limit to this. Optionally, both steps S10 and S30 may be performed by the intercom receiving end device, for example, the image acquisition unit of the intercom requesting end device may collect the image data of the requesting end during the intercom request or the intercom process, which may be performed by the intercom requesting end device. The image acquisition unit of the receiving end collects the image data of the receiving end during the intercom request or the intercom process, and then the intercom receiving end device (such as its processor) determines (can be understood as acquiring) the image data (including the image of the intercom requesting end). data and/or image data of the intercom receiver), and when the intercom receiver device determines that the image data meets the first preset condition, it triggers the end of the intercom request process or the end of the intercom process, where the trigger can be direct control end, or send a signal to the intercom requester device, which is controlled by the intercom requester device to end, all of the above can belong to the protection scope of the trigger, and there is no restriction on this. Optionally, both steps S10 and S30 may be performed by a cloud server. For example, the image acquisition unit of the intercom requesting end device may collect the image data of the requesting end during the intercom request or the intercom process, and the image data of the intercom receiving end may be collected. The acquisition unit collects the image data of the receiving end during the intercom request or the intercom process, and then the cloud server determines (can be understood as acquiring) the image data (including the image data of the intercom requesting end and/or the image data of the intercom receiving end), And when the cloud server determines that the image data meets the first preset condition, it triggers the end of the intercom request process or the end of the intercom process. The trigger here can be sending a signal to the intercom requesting end device or sending a signal to the intercom receiver. The end device is controlled by the intercom requesting end device or the intercom receiving end device. All of the above can belong to the protection scope of the trigger, and there is no restriction on this. It should be noted that, since the electronic device may include at least one of a talk requesting end device, an intercom receiving end device, and a cloud server, steps S10 and S30 may be performed on the same or different electronic devices.
可以看出,确定对讲过程中或者对讲请求过程中采集的图像数据,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;当图像数据满足第一预设条件时,触发对讲过程结束或者触发对讲请求过程结束。从而可以无需用户手动操作,方便用户使用,进一步的,通过对第一预设条件的设置,可以提升结束对讲请求或结束对讲的及时性,及时释放资源,降低对用户使用其他设备的影响等。It can be seen that the image data collected during the intercom process or the intercom request process is determined, and the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end; when the image data satisfies the first preset condition, Triggering the end of the intercom process or triggering the end of the intercom request process. Therefore, manual operation by the user is not required, which is convenient for the user to use. Further, by setting the first preset condition, the timeliness of ending the intercom request or ending the intercom can be improved, resources can be released in time, and the impact on the user's use of other devices can be reduced. Wait.
请参阅图2,本申请还提供另一种来访对讲控制方法,应用于电子设备,该来访对讲控制方法包括:Referring to FIG. 2, the present application also provides another method for controlling a visiting intercom, which is applied to an electronic device, and the method for controlling a visiting intercom includes:
S100.确定对讲过程中采集的图像数据和对讲过程中采集的对讲语音,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;对讲语音包括对讲请求端的语音和/或对讲接收端的语音。S100. Determine the image data collected in the intercom process and the intercom voice collected during the intercom process, the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end; the intercom voice includes the voice of the intercom requesting end and/or the voice of the intercom receiver.
需要说明的是,在前面实施例的基础上,本实施例结合对讲过程中采集的对讲语音和图像数据来综合控制对讲,相比仅采用图像数据来控制对讲,在某些情况下其准确性更高,对讲语音可以是对讲请求端设备的麦克风实时采集的,也可以是对讲接收端设备的麦克风实时采集的,确定对讲过程中采集的对讲语音,可以是确定采集的对讲语音中的部分或全部语音。此外,图像数据满足第一预设条件可以包括:图像数据未检测到人物特征信息,和/或,图像数据中检测到人物预设行为信息。人物特征信息可以包括:人脸信息、人体轮廓信息、人体红外信息中的至少一种;而人物预设行为信息可以包括:后转信息、侧转信息、远离信息中的至少一种。其中,后转信息或侧转信息或远离信息可根据人脸信息和人体轮廓信息的变化获得。可以理解的,图像数据中未检测到人物特征信息,可以包括但不限于:It should be noted that, on the basis of the previous embodiment, this embodiment combines the intercom voice and image data collected during the intercom process to comprehensively control the intercom. The accuracy is higher. The intercom voice can be collected in real time by the microphone of the intercom requesting end device, or by the microphone of the intercom receiving end device. Determine the intercom voice collected during the intercom process, which can be Determine part or all of the collected intercom speech. In addition, the image data satisfying the first preset condition may include: no character feature information is detected in the image data, and/or character preset behavior information is detected in the image data. The character feature information may include at least one of face information, human body contour information, and human body infrared information; and the character preset behavior information may include at least one of backward turn information, side turn information, and distance information. Among them, the backward turn information, the side turn information or the distance information can be obtained according to the change of the face information and the body contour information. It is understandable that no character feature information is detected in the image data, which may include but is not limited to:
(1)图像数据中的一采样帧图像未检测到人物特征信息;或者,(1) No character feature information is detected in a sampled frame image in the image data; or,
(2)图像数据中连续预设采样帧数的图像未检测到人物特征信息;或者,(2) No character feature information is detected in the image of the continuous preset sampling frame number in the image data; or,
(3)图像数据中未检测到人物特征信息持续第一预设时间。(3) No person feature information is detected in the image data for the first preset time.
可以理解的,图像数据中检测到人物预设行为信息,可以包括但不限于:It can be understood that the preset behavior information of characters detected in the image data may include but not limited to:
(1)图像数据中的连续采样帧数的图像中检测到人体轮廓从正面轮廓变为侧面轮廓;或者,(1) It is detected that the contour of the human body changes from the frontal contour to the side contour in the image of the consecutive sampling frames in the image data; or,
(2)图像数据中的连续采样帧数的图像中检测到人体轮廓从正面轮廓变为侧面轮廓再变为背面轮廓;或者,(2) It is detected that the human body contour changes from the frontal contour to the side contour and then to the back contour in the image of the consecutive sampling frames in the image data; or,
(3)图像数据中的连续第一预设采样帧数的图像中检测到人体轮廓在图像中的占比变小;或者,(3) It is detected that the proportion of the human body contour in the image in the image of the continuous first preset sampling frame number in the image data becomes smaller; or,
(4)图像数据中的连续采样帧数的图像中检测到人体轮廓在图像中的占比变小 且小于预设占比;或者,(4) In the image of the continuous sampling frame number in the image data, it is detected that the proportion of the human body contour in the image becomes smaller and smaller than the preset proportion; or,
(5)图像数据中的连续预设第二采样帧数的图像中检测到人体轮廓占人体全部轮廓的比例增加;或者,(5) It is detected that the proportion of the contour of the human body to the total contour of the human body is increased in the image of the consecutive preset second sampling frame number in the image data; or,
(6)图像数据中的连续采样帧数的图像中检测到人体轮廓占人体全部轮廓的比例增加。(6) The proportion of the human body contour detected in the image of the continuous sampling frame number in the image data to the total human body contour increases.
需要说明的是,其中,正面轮廓或背面轮廓或侧面轮廓能够根据人脸信息确定。上述例举的各种图像数据中未检测到人物特征信息的具体方式,或,各种图像数据中检测到人物预设行为信息的具体方式,在不冲突的情形下可以组合使用。It should be noted that, the front profile, the back profile or the side profile can be determined according to the face information. The specific manners in which no character feature information is detected in various image data exemplified above, or the specific manners in which person preset behavior information is detected in various image data, can be used in combination in the case of no conflict.
S300.当图像数据满足第一预设条件且对讲语音满足第二预设条件时,触发对讲过程结束。S300. When the image data satisfies the first preset condition and the intercom voice satisfies the second preset condition, trigger the end of the intercom process.
需要说明的,当图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据时,图像数据满足第一预设条件,包括:对讲请求端的图像数据和/或对讲接收端的图像数据满足第一预设条件。当对讲语音包括对讲请求端的语音和/或对讲接收端的语音时,对讲语音满足第二预设条件,包括:对讲请求端的语音和/或对讲接收端的语音满足第二预设条件。It should be noted that when the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end, the image data satisfies the first preset condition, including: the image data of the intercom requesting end and/or the image data of the intercom receiving end The data satisfies the first preset condition. When the intercom voice includes the voice of the intercom requesting end and/or the voice of the intercom receiving end, the intercom voice satisfies the second preset condition, including: the voice of the intercom requesting end and/or the voice of the intercom receiving end satisfies the second preset condition condition.
可以理解的,对讲语音满足第二预设条件,包括:对讲语音中包括预设关键词或未获取到对讲语音满足预设时间。其中,预设关键词和未获取到对讲语音满足预设时间的进一步解释详见前面的描述,此处不再赘述。It is understandable that the intercom voice satisfies the second preset condition, including: the intercom voice includes a preset keyword or the intercom voice is not acquired for a preset time. Wherein, the further explanation of the preset keywords and the unacquired intercom voice meeting the preset time can be found in the foregoing description, which will not be repeated here.
可以理解的,电子设备包括:对讲请求端设备、对讲接收端设备、云端服务器中的至少一种;可选的,对讲请求端设备包括:门铃外机、摄像头、门禁外机中的至少一种;对讲接收端设备包括:门铃内机、门禁内机、电视机、路由器、网关设备、客户前置设备CPE(Customer Premise Equipment)、音箱、智能摄像头、电视盒、电脑、手机中的至少一种。It can be understood that the electronic device includes: at least one of an intercom requesting end device, an intercom receiving end device, and a cloud server; optionally, the intercom requesting end device includes: a doorbell external unit, a camera, and an external access control unit. At least one; the intercom receiver equipment includes: doorbell internal machine, access control internal machine, TV, router, gateway device, customer premise equipment CPE (Customer Premise Equipment), speaker, smart camera, TV box, computer, mobile phone at least one of.
可以理解的,对讲请求端的图像数据由对讲请求端设备采集;对讲接收端的图像数据由对讲接收端的设备采集。It can be understood that the image data of the intercom requesting end is collected by the intercom requesting end device; the image data of the intercom receiving end is collected by the device of the intercom receiving end.
可以理解的,对讲接收端设备确定的人物信息数据和/或设备状态数据确定对讲请求端的图像数据的播放方式。进一步的,根据对讲接收端设备确定的人物信息数据和/或设备状态数据确定对讲请求端的图像数据的播放方式,包括但不限于:It can be understood that the character information data and/or the device status data determined by the intercom receiving end device determine the playback mode of the image data of the intercom requesting end. Further, determine the playback mode of the image data of the intercom requester according to the character information data and/or device status data determined by the intercom receiver device, including but not limited to:
当确定电视机处于运行状态,则确定通过电视机采用画中画的方式播放对讲请求端的图像数据;或者,When it is determined that the TV is in the running state, it is determined that the image data of the intercom requesting terminal is played through the TV in a picture-in-picture manner; or,
当确定电视机处于运行状态,且确定受访用户与电视机处于预设范围内,则确定通过电视机采用画中画的方式播放对讲请求端的图像数据;或者,When it is determined that the TV is in the running state, and it is determined that the interviewed user and the TV are within the preset range, it is determined that the image data of the intercom requesting terminal is played through the TV in a picture-in-picture manner; or,
当确定电视机处于关闭状态,则确定通过电视机采用全屏显示的方式播放对讲请求端的图像数据;或者,When it is determined that the TV is in an off state, it is determined that the image data of the intercom requesting end is played through the TV in a full-screen display mode; or,
当确定电视机处于关闭状态,且确定受访用户与电视机处于预设范围内,则确定通过电视机采用全屏显示的方式播放对讲请求端的图像数据;或者,When it is determined that the TV is turned off, and it is determined that the interviewed user and the TV are within the preset range, it is determined to play the image data of the intercom requesting end through the TV in a full-screen display mode; or,
当确定手机处于受访用户使用状态,则确定通过手机播放对讲请求端的图像数据;或者,When it is determined that the mobile phone is in use by the interviewed user, it is determined to play the image data of the intercom requester through the mobile phone; or,
当确定电脑处于受访用户使用状态,则确定通过电脑播放对讲请求端的图像数据。When it is determined that the computer is in the use state of the interviewed user, it is determined that the image data of the intercom requesting end is played through the computer.
需要说明的,上述例举的具体播放方式在不冲突的情况下可以相互组合使用。It should be noted that the specific playback modes exemplified above can be used in combination with each other without conflict.
可以理解的,在对讲请求过程中,若电视机或电视机的遥控器采集到对讲接收端的受访用户的第一预设语音,则触发进入对讲过程;或者,在对讲过程中,若电视机或电视机的遥控器采集到对讲接收端的受访用户的第二预设语音,则触发结束本次对讲。其中第一预设语音和第二预设语音参考前面的解释,在此不再赘述。It is understandable that during the intercom request process, if the TV or the remote control of the TV collects the first preset voice of the interviewed user at the intercom receiving end, the intercom process is triggered; or, during the intercom process. , if the TV or the remote control of the TV collects the second preset voice of the interviewed user at the intercom receiving end, it triggers the end of the intercom. The first preset voice and the second preset voice refer to the previous explanation, and are not repeated here.
本实施例针对对讲过程的控制,主要针对对讲过程的控制,有些技术方案中技术特征未进一步解释的或者相关技术效果未描述的请参考前面相关部分的描述,在此不再赘 述。本实施例中的技术方案不仅考虑了图像数据还考虑了对讲语音,增加了对讲语音来控制对讲,既需要图像数据满足第一预设条件也需要对讲语音满足第二预设条件,因此进一步提升了控制的准确性。This embodiment is aimed at the control of the intercom process, mainly for the control of the intercom process. If the technical features in some technical solutions are not further explained or the related technical effects are not described, please refer to the descriptions in the previous relevant parts, which will not be repeated here. The technical solution in this embodiment considers not only the image data but also the intercom voice, and the intercom voice is added to control the intercom, which requires both the image data to satisfy the first preset condition and the intercom voice to satisfy the second preset condition , thus further improving the accuracy of the control.
可以理解的,请参阅图3,本申请还提供一种电子设备500,包括:一个或多个处理器510;存储器520;一个或多个应用程序,其中一个或多个应用程序被存储在存储器520中并被配置为由一个或多个处理器510执行,一个或多个程序配置用于执行上述任一项的方法。It can be understood that, referring to FIG. 3 , the present application also provides an electronic device 500, comprising: one or more processors 510; a memory 520; one or more application programs, wherein one or more application programs are stored in the memory In 520 and configured to be executed by one or more processors 510, one or more programs are configured to perform the method of any of the above.
可以理解的,请参阅图4,本申请还提供一种计算机可读取存储介质600,该计算机可读取存储介质600中存储有程序代码610,程序代码610可被处理器调用执行上述任一项的方法。It can be understood that, referring to FIG. 4 , the present application also provides a computer-readable storage medium 600, where program codes 610 are stored in the computer-readable storage medium 600, and the program codes 610 can be called by the processor to execute any one of the above. item method.
请参阅图5,本申请还提供一种来访对讲控制装置1000,该来访对讲控制装置1000包括:Referring to FIG. 5 , the present application further provides a visiting intercom control device 1000 , and the visiting intercom control device 1000 includes:
确定单元1010,被配置用于确定对讲过程中或者对讲请求过程中采集的图像数据,其中图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;Determining unit 1010, configured to determine the image data collected in the intercom process or the intercom request process, wherein the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end;
触发单元1020,被配置用于当图像数据满足第一预设条件时,触发对讲过程结束或者触发对讲请求过程结束。The triggering unit 1020 is configured to trigger the end of the intercom process or the end of the intercom request process when the image data satisfies the first preset condition.
可以理解的,触发单元1020包括检测模块,被配置用于确定图像数据满足第一预设条件;进一步的,检测模块被配置用于确定图像数据未检测到人物特征信息,和/或,检测单元被配置用于确定图像数据中检测到人物预设行为信息。其中,人物特征信息包括:人脸信息、人体轮廓信息、人体红外信息中的至少一种;和/或,人物预设行为信息包括:后转信息、侧转信息、远离信息中的至少一种。进一步的,后转信息或侧转信息或远离信息根据人脸信息和人体轮廓信息的变化获得。It can be understood that the triggering unit 1020 includes a detection module configured to determine that the image data satisfies the first preset condition; further, the detection module is configured to determine that the image data does not detect character feature information, and/or the detection unit is configured to determine that a person preset behavior information is detected in the image data. Wherein, the character feature information includes: at least one of face information, human body contour information, and human body infrared information; and/or, the character preset behavior information includes: at least one of back-turn information, side-turn information, and distance information . Further, the backward turn information or the side turn information or the far away information is obtained according to the change of the face information and the body contour information.
可以理解的,检测模块还可以被配置用于确定图像数据中的一采用帧图像未检测到人物特征信息;检测模块还可以被配置用于确定图像数据中连续预设采样帧数的图像未检测到人物特征信息;检测模块还可以被配置用于确定图像数据中未检测到人物特征信息持续第一预设时间。可以理解的,检测模块还可以被配置用于确定图像数据中的连续采样帧数的图像中检测到人体轮廓从正面轮廓变为侧面轮廓;检测模块还可以被配置用于确定图像数据中的连续采样帧数的图像中检测到人体轮廓从正面轮廓变为侧面轮廓再变为背面轮廓;检测模块还可以被配置用于确定图像数据中的连续第一预设采样帧数的图像中检测到人体轮廓在图像中的占比变小;检测模块还可以被配置用于确定图像数据中的连续第二预设采样帧数的图像中检测到人体轮廓占人体全部轮廓的比例增加。其中,正面轮廓或背面轮廓或侧面轮廓能够根据人脸信息确定。It can be understood that the detection module can also be configured to determine that a frame image in the image data has not detected the character feature information; the detection module can also be configured to determine that the image of the continuous preset sampling frame number in the image data has not been detected. The detection module may also be configured to determine that no person feature information is detected in the image data for a first preset time. It can be understood that the detection module can also be configured to determine that the contour of the human body has changed from a frontal contour to a side contour in the image of the number of consecutive sampling frames in the image data; the detection module can also be configured to determine the continuous sampling in the image data. It is detected that the human body contour changes from the frontal contour to the side contour and then to the back contour in the image of the sampling frame number; The proportion of the contour in the image becomes smaller; the detection module may be further configured to determine that the proportion of the detected human contour in the image of the second consecutive preset sampling frame number in the image data increases in the proportion of the total contour of the human body. Wherein, the front profile, the back profile or the side profile can be determined according to the face information.
可以理解的,当图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据时,触发单元1020可以被配置用于当对讲请求端的图像数据和/或对讲接收端的图像数据满足第一预设条件时,触发对讲过程结束或者触发对讲请求过程结束。It can be understood that when the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end, the triggering unit 1020 can be configured to be used when the image data of the intercom requesting end and/or the image data of the intercom receiving end satisfy Under the first preset condition, the triggering of the intercom process ends or the triggering of the intercom request process ends.
可以理解的,该来访对讲控制装置可以应用于电子设备中,上述电子设备500包括但不限于:对讲请求端设备、对讲接收端设备、云端服务器中的至少一种。其中,对讲请求端设备包括但不限于:门铃外机、摄像头、门禁外机中的至少一种;对讲接收端设备包括但不限于:门铃内机、门禁内机、电视机、路由器、网关设备、客户前置设备CPE(Customer Premise Equipment)、音箱、智能摄像头、电视盒、电脑、手机中的至少一种。由于对讲控制装置能够应用于电子设备中,而该电子设备500可以是一种设备,也可以是多种设备(两种或两种以上),因此,对讲控制装置中的各个单元、模块在不冲突的情况下可以应用于不同的电子设备。It can be understood that the visiting intercom control device can be applied to electronic devices, and the above electronic device 500 includes, but is not limited to, at least one of an intercom requesting end device, an intercom receiving end device, and a cloud server. Among them, the intercom requesting terminal equipment includes but is not limited to: at least one of the doorbell outdoor unit, the camera, and the access control outdoor unit; the intercom receiving end equipment includes but is not limited to: the doorbell indoor unit, the access control indoor unit, TV, router, At least one of gateway equipment, Customer Premise Equipment (CPE), speakers, smart cameras, TV boxes, computers, and mobile phones. Since the intercom control device can be applied to electronic equipment, and the electronic device 500 may be one type of device, or multiple devices (two or more), therefore, each unit and module in the intercom control device Can be applied to different electronic devices without conflict.
可以理解的,对讲请求端的图像数据由对讲请求端设备采集;对讲接收端的图像数据由对讲接收端的设备采集。对讲控制装置还包括播放控制单元,可被配置用于根据对讲接收端设备确定的人物信息数据和/或设备状态数据确定对讲请求端的图像数据的播 放方式。该对讲控制装置可以应用于对讲请求端,虽然对讲请求端的图像数据需要在对讲接收端播放,但当请求接收端设备具有多个时,可以有对讲请求端设备来确定由哪个对讲接收端装置播放。对于应用于云端服务器也是类似情况,在此不再赘述。当对讲控制装置应用于对讲接收端设备时,则由对讲接收端设备来确定对讲请求端的图像数据的播放方式。It can be understood that the image data of the intercom requesting end is collected by the intercom requesting end device; the image data of the intercom receiving end is collected by the device of the intercom receiving end. The intercom control device further includes a playback control unit, which can be configured to determine the playback mode of the image data of the intercom requester according to the character information data and/or device status data determined by the intercom receiver device. The intercom control device can be applied to the intercom requesting end. Although the image data of the intercom requesting end needs to be played on the intercom receiving end, when there are multiple request receiving end devices, the intercom requesting end device can be used to determine which device is used. Intercom receiver device to play. The same applies to the cloud server, which will not be repeated here. When the intercom control device is applied to the intercom receiving end device, the intercom receiving end device determines the playback mode of the image data of the intercom requesting end.
可以理解的,该播放控制单元,可被配置用于当确定电视机处于运行状态,则通过电视机采用画中画的方式播放对讲请求端的图像数据;该播放控制单元,可被配置用于当确定电视机处于运行状态,且确定受访用户与电视机处于预设范围内,则通过电视机采用画中画的方式播放对讲请求端的图像数据;该播放控制单元,可被配置用于当确定电视机处于关闭状态,则通过电视机采用全屏显示的方式播放对讲请求端的图像数据;该播放控制单元,可被配置用于当确定电视机处于关闭状态,且确定受访用户与电视机处于预设范围内,则通过电视机采用全屏显示的方式播放对讲请求端的图像数据;该播放控制单元,可被配置用于当确定手机处于受访用户使用状态,则确定通过手机播放对讲请求端的图像数据;该播放控制单元,可被配置用于当确定电脑处于受访用户使用状态,则确定通过电脑播放对讲请求端的图像数据。It can be understood that the playback control unit can be configured to play the image data of the intercom request terminal through the TV in a picture-in-picture manner when it is determined that the TV is in a running state; the playback control unit can be configured to When it is determined that the TV is in the running state, and it is determined that the interviewed user and the TV are within the preset range, the image data of the intercom requesting terminal is played through the TV in a picture-in-picture manner; the playback control unit can be configured to When it is determined that the TV is turned off, the image data of the intercom requesting terminal is played through the TV in a full-screen display mode; the playback control unit can be configured to determine whether the TV is turned off, and to determine whether the interviewed user and the TV are related to each other. If the mobile phone is within the preset range, the image data of the intercom requesting terminal is played in a full-screen display mode through the TV; the playback control unit can be configured to determine that the mobile phone is in the use state of the interviewed user, and then determine to play the intercom request through the mobile phone. and the playback control unit can be configured to play the image data of the intercom requester through the computer when it is determined that the computer is in the use state of the interviewed user.
可以理解的,该对讲控制装置还包括启动对讲控制单元,被配置用于在对讲请求过程中,若电视机或电视机的遥控器采集到对讲接收端的受访用户的第一预设语音,则触发进入对讲过程;或者,该触发单元1020,还可以被配置用于在对讲过程中,若电视机或电视机的遥控器采集到对讲接收端的受访用户的第二预设语音,则触发结束本次对讲。需要说明的,对讲控制装置可以应用于电视机,也可以应用于其他电子设备。It can be understood that the intercom control device further includes a start intercom control unit, which is configured to be configured to, during the intercom request process, if the TV or the remote control of the TV collects the first preset of the interviewed user at the intercom receiving end. If the voice is set, it will trigger to enter the intercom process; alternatively, the trigger unit 1020 can also be configured to be used for, during the intercom process, if the TV or the remote control of the TV collects the second data of the interviewed user at the intercom receiving end The preset voice will trigger the end of this intercom. It should be noted that the intercom control device can be applied to a TV set, and can also be applied to other electronic devices.
可以理解的,在对讲过程中,该触发单元1020,可以被配置用于当图像数据满足第一预设条件且对讲语音满足第二预设条件时,触发对讲过程结束。进一步的,该触发单元1020,可以被配置用于确定对讲语音中包括预设关键词或未获取到对讲语音满足预设时间。It can be understood that during the intercom process, the triggering unit 1020 can be configured to trigger the end of the intercom process when the image data satisfies the first preset condition and the intercom voice satisfies the second preset condition. Further, the triggering unit 1020 may be configured to determine that the intercom speech includes a preset keyword or that the intercom speech has not been acquired for a preset time.
需要说明的是,上述来访对讲控制装置,能够根据对讲请求端和/或对讲接收端的图像数据来智能的确定是否触发结束对讲请求或结束对讲,从而可以无需手动操作,方便用户使用,进一步的,还可以结合语音控制等进一步方便用户的对讲操作。对于该装置部分未细化的技术效果,可参考相关方法部分,在此不再赘述。It should be noted that the above-mentioned visiting intercom control device can intelligently determine whether to trigger the end intercom request or end the intercom according to the image data of the intercom requesting end and/or the intercom receiving end, so that manual operation is not required, which is convenient for users. Use, further, can also be combined with voice control to further facilitate the user's intercom operation. For the technical effect of the device that is not detailed, reference may be made to the related method section, which will not be repeated here.
本申请实施例还提供另一种来访对讲控制装置,该对讲控制装置包括:确定单元,被配置用于确定对讲过程中采集的图像数据和对讲过程中采集的对讲语音,其中,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据,对讲语音包括对讲请求端的语音和/或对讲接收端的语音;The embodiment of the present application further provides another visiting intercom control device, the intercom control device includes: a determination unit configured to determine the image data collected during the intercom process and the intercom voice collected during the intercom process, wherein , the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end, and the intercom voice includes the voice of the intercom requesting end and/or the voice of the intercom receiving end;
触发单元1020,被配置用于当图像数据满足第一预设条件且对讲语音满足第二预设条件时,触发对讲过程结束。The triggering unit 1020 is configured to trigger the end of the intercom process when the image data satisfies the first preset condition and the intercom voice satisfies the second preset condition.
该对讲控制装置与相关对讲控制方法对应,其他部分参考方法部分,在此不做赘述。The intercom control device corresponds to a related intercom control method, and other parts refer to the method part, which will not be repeated here.
为了更清楚的解释本申请各实施例或实施方式,请参阅图6,进一步例举一种来访对讲系统2000,该来访对讲系统能够为来访用户2010和受访用户2020提供便捷的对讲控制体验。该系统2000包括门铃设备2100、电视机2200,门铃设备2100与电视机2200连接,可以有线连接也可以无线连接,无线连接可以通过共同的网络连接,也可以通过蓝牙等直接连接,二者中的至少一个还可以与云端服务器(未示出)连接,除了电视机外还可以包括其他对讲接收端设备2300,例如平板电脑、手机等。如图7所示,该系统可以用于执行以下步骤:In order to explain the various embodiments or implementations of the present application more clearly, please refer to FIG. 6, which further illustrates a visiting intercom system 2000, which can provide convenient intercom for the visiting user 2010 and the interviewed user 2020. Control the experience. The system 2000 includes a doorbell device 2100 and a TV 2200. The doorbell device 2100 is connected to the TV 2200, either by wired connection or wireless connection. The wireless connection can be connected through a common network or directly through Bluetooth, etc. At least one of them may also be connected to a cloud server (not shown), and may include other intercom receiver devices 2300, such as tablet computers, mobile phones, and the like, in addition to the television set. As shown in Figure 7, the system can be used to perform the following steps:
S200.门铃设备2100或电视机2200确定对讲过程中或者对讲请求过程中采集的图像数据,图像数据包括门铃设备2100采集的对讲请求端的图像数据和/或电视机2200采集的对讲接收端的图像数据;S200. The doorbell device 2100 or the TV 2200 determines the image data collected during the intercom process or the intercom request process, and the image data includes the image data of the intercom requester collected by the doorbell device 2100 and/or the intercom reception collected by the TV 2200 end image data;
S400.当图像数据满足第一预设条件时,门铃设备2100或电视机2200触发对讲过程 结束或者触发对讲请求过程结束。S400. When the image data satisfies the first preset condition, the doorbell device 2100 or the TV 2200 triggers the end of the intercom process or triggers the end of the intercom request process.
可以理解的,图像数据满足第一预设条件进一步包括:图像数据未检测到人物特征信息,和/或,图像数据中检测到人物预设行为信息。人物特征信息包括:人脸信息、人体轮廓信息、人体红外信息中的至少一种;人物预设行为信息包括:后转信息、侧转信息、远离信息中的至少一种。后转信息或侧转信息或远离信息根据人脸信息和人体轮廓信息的变化获得。图像数据中未检测到人物特征信息,包括以下至少一种:图像数据中的一采用帧图像未检测到人物特征信息;图像数据中连续预设采样帧数的图像未检测到人物特征信息;图像数据中未检测到人物特征信息持续第一预设时间。图像数据中检测到人物预设行为信息,包括以下至少一种:图像数据中的连续采样帧数的图像中检测到人体轮廓从正面轮廓变为侧面轮廓;图像数据中的连续采样帧数的图像中检测到人体轮廓从正面轮廓变为侧面轮廓再变为背面轮廓;图像数据中的连续第一预设采样帧数的图像中检测到人体轮廓在图像中的占比变小;图像数据中的连续采样帧数的图像中检测到人体轮廓在图像中的占比变小且小于预设占比;图像数据中的连续预设采样帧数的图像中检测到人体轮廓占人体全部轮廓的比例增加;图像数据中的连续采样帧数的图像中检测到人体轮廓占人体全部轮廓的比例增加且大于预设比例。其中,正面轮廓或背面轮廓或侧面轮廓能够根据人脸信息确定。It can be understood that the image data satisfying the first preset condition further includes: no character feature information is detected in the image data, and/or character preset behavior information is detected in the image data. The character feature information includes: at least one of face information, body contour information, and human body infrared information; and the character preset behavior information includes: at least one of backward turn information, side turn information, and distance information. The backward turn information or the side turn information or the far away information is obtained according to the change of the face information and the body contour information. No character feature information is detected in the image data, including at least one of the following: no character feature information is detected in a frame image in the image data; no character feature information is detected in an image with a continuous preset sampling number of frames in the image data; an image The person characteristic information is not detected in the data for the first preset time. The preset behavior information of the person detected in the image data includes at least one of the following: it is detected that the contour of the human body changes from the frontal contour to the side contour in the image of the continuous sampling frame number in the image data; the image of the continuous sampling frame number in the image data It is detected that the contour of the human body changes from the frontal contour to the side contour and then to the back contour; the proportion of the human contour detected in the image with the first consecutive preset sampling frames in the image data becomes smaller; The proportion of the human body contour detected in the image with the continuous sampling frame number becomes smaller and smaller than the preset proportion; the proportion of the human body contour detected in the image with the continuous preset sampling frame number in the image data to the total human body contour increases. ; It is detected that the proportion of the contours of the human body to the total contours of the human body increases and is greater than the preset proportion in the images of the consecutive sampling frames in the image data. Wherein, the front profile, the back profile or the side profile can be determined according to the face information.
可以理解的,当图像数据包括门铃设备采集的图像数据和/或电视机采集的图像数据时,当图像数据满足第一预设条件时,门铃设备或电视机触发对讲过程结束或者触发对讲请求过程结束,包括:当门铃设备采集的图像数据和/或电视机采集的图像数据满足第一预设条件时,门铃设备或电视机触发对讲过程结束或者触发对讲请求过程结束。It can be understood that when the image data includes the image data collected by the doorbell device and/or the image data collected by the TV, when the image data satisfies the first preset condition, the doorbell device or TV triggers the intercom process to end or triggers the intercom. The request process ends, including: when the image data collected by the doorbell device and/or the image data collected by the TV meet the first preset condition, the doorbell device or the TV triggers the intercom process to end or the trigger intercom request process ends.
上述系统还可以包括云端服务器,该云端服务器可以用于确定图像数据满足第一预设条件。当然,确定图像数据满足第一预设条件也可以是该门铃设备或电视机来确定,除了云端服务器外,甚至可以是其他电子设备确定后再通知给门铃设备或电视机,以触发对讲请求过程结束或对讲过程结束,在此不做限制。The above system may further include a cloud server, and the cloud server may be used to determine that the image data satisfies the first preset condition. Of course, it can also be determined by the doorbell device or the TV set to determine that the image data meets the first preset condition. In addition to the cloud server, it can even be determined by other electronic devices and then notified to the doorbell device or TV set to trigger the intercom request. The end of the process or the end of the intercom process is not limited here.
进一步的,该门铃设备或电视机根据电视机确定的人物信息数据和/或电视机状态数据确定门铃设备采集的图像数据的播放方式。根据电视机确定的人物信息数据和/或设备状态数据确定对讲请求端的图像数据的播放方式,包括但不限于:Further, the doorbell device or the TV determines the playback mode of the image data collected by the doorbell device according to the character information data and/or the TV state data determined by the TV. The playback method of the image data of the intercom requesting terminal is determined according to the character information data and/or device status data determined by the TV, including but not limited to:
当确定电视机处于运行状态,则确定通过电视机采用画中画的方式播放对讲请求端的图像数据;或者,When it is determined that the TV is in the running state, it is determined that the image data of the intercom requesting terminal is played through the TV in a picture-in-picture manner; or,
当确定电视机处于运行状态,且确定受访用户与电视机处于预设范围内,则确定通过电视机采用画中画的方式播放对讲请求端的图像数据;或者,When it is determined that the TV is in the running state, and it is determined that the interviewed user and the TV are within the preset range, it is determined that the image data of the intercom requesting terminal is played through the TV in a picture-in-picture manner; or,
当确定电视机处于关闭状态,则确定通过电视机采用全屏显示的方式播放对讲请求端的图像数据;或者,When it is determined that the TV is in an off state, it is determined that the image data of the intercom requesting end is played through the TV in a full-screen display mode; or,
当确定电视机处于关闭状态,且确定受访用户与电视机处于预设范围内,则确定通过电视机采用全屏显示的方式播放对讲请求端的图像数据。When it is determined that the TV is in an off state, and it is determined that the interviewed user and the TV are within a preset range, it is determined that the image data of the intercom requesting end is played through the TV in a full-screen display manner.
可以理解的,在对讲请求过程中,若电视机或电视机的遥控器采集到对讲接收端的受访用户的第一预设语音,则门铃设备或电视机触发进入对讲过程;或者,在对讲过程中,若电视机或电视机的遥控器采集到对讲接收端的受访用户的第二预设语音,则门铃设备或电视机触发结束本次对讲。It is understandable that during the intercom request process, if the TV or the remote control of the TV collects the first preset voice of the interviewed user at the intercom receiver, the doorbell device or the TV will trigger the intercom process; or, During the intercom process, if the TV or the remote control of the TV collects the second preset voice of the interviewed user at the intercom receiver, the doorbell device or the TV triggers the end of the intercom.
可以理解的,当图像数据满足第一预设条件且对讲语音满足第二预设条件时,则门铃设备或电视机触发对讲过程结束。其中,对讲语音满足第二预设条件,包括:对讲语音中包括预设关键词或未获取到对讲语音满足预设时间。It can be understood that when the image data satisfies the first preset condition and the intercom voice satisfies the second preset condition, the doorbell device or the TV set triggers the intercom process to end. Wherein, the intercom voice satisfies the second preset condition, including: the intercom voice includes a preset keyword or the intercom voice is not acquired for a preset time.
可以看出,本申请提供的实施例,确定对讲过程中或者对讲请求过程中采集的图像数据,图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;当图像数据满足第一预设条件时,触发对讲过程结束或者触发对讲请求过程结束。从而可以无需用户手动操作,方便用户使用,进一步的,通过对第一预设条件、第二预设条件的设置,可 以提升结束对讲请求或结束对讲的准确性,并合理释放资源,降低对用户使用其他设备的影响等。It can be seen that, in the embodiment provided by this application, the image data collected during the intercom process or the intercom request process is determined, and the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end; Under the first preset condition, the triggering of the intercom process ends or the triggering of the intercom request process ends. Therefore, manual operation by the user is not required, which is convenient for the user to use. Further, by setting the first preset condition and the second preset condition, the accuracy of the request for ending the intercom or the end of the intercom can be improved, and the resources can be reasonably released to reduce the The impact on the user's use of other devices, etc.
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本申请并不受所描述的动作顺序的限制,因为依据本申请,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例所涉及的动作和模块并不一定是本申请所必须的。It should be noted that, for the sake of simple description, the foregoing method embodiments are all expressed as a series of action combinations, but those skilled in the art should know that the present application is not limited by the described action sequence. Because in accordance with the present application, certain steps may be performed in other orders or concurrently. Secondly, those skilled in the art should also know that the actions and modules involved in the embodiments described in the specification are not necessarily required by the present application.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。在本申请所提供的几个实施例中,应该理解到,所揭露的装置,可通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如上述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性或其它的形式。上述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。上述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储器中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储器中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本申请各个实施例上述方法的全部或部分步骤。而前述的存储器包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储器中,存储器可以包括:闪存盘、只读存储器(英文:Read-Only Memory,简称:ROM)、随机存取器(英文:Random Access Memory,简称:RAM)、磁盘或光盘等。In the above-mentioned embodiments, the description of each embodiment has its own emphasis. For parts that are not described in detail in a certain embodiment, reference may be made to the relevant descriptions of other embodiments. In the several embodiments provided in this application, it should be understood that the disclosed apparatus may be implemented in other manners. For example, the device embodiments described above are only illustrative. For example, the division of the above-mentioned units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or integrated. to another system, or some features can be ignored, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical or other forms. The units described above as separate components may or may not be physically separated, and components shown as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment. In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit. The above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units. The above-mentioned integrated units, if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable memory. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence, or the part that contributes to the prior art, or all or part of the technical solution, and the computer software product is stored in a memory, Several instructions are included to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the above-mentioned methods in the various embodiments of the present application. The aforementioned memory includes: U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk and other media that can store program codes. Those skilled in the art can understand that all or part of the steps in the various methods of the above embodiments can be completed by instructing relevant hardware through a program, and the program can be stored in a computer-readable memory, and the memory can include: a flash disk , Read-only memory (English: Read-Only Memory, referred to as: ROM), random access device (English: Random Access Memory, referred to as: RAM), magnetic disk or optical disk, etc.
以上对本申请实施例进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上,本说明书内容不应理解为对本申请的限制。The embodiments of the present application have been introduced in detail above, and the principles and implementations of the present application are described in this paper by using specific examples. The descriptions of the above embodiments are only used to help understand the methods and core ideas of the present application; at the same time, for Persons of ordinary skill in the art, based on the idea of the present application, may have changes in the specific implementation manner and application scope. In conclusion, the contents of this description should not be construed as a limitation on the present application.

Claims (25)

  1. 一种来访对讲控制方法,其特征在于,应用于电子设备,所述方法包括:A visiting intercom control method, characterized in that, applied to electronic equipment, the method comprising:
    确定对讲过程中或者对讲请求过程中采集的图像数据,所述图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;Determine the image data collected during the intercom process or the intercom request process, and the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end;
    当所述图像数据满足第一预设条件时,触发所述对讲过程结束或者触发所述对讲请求过程结束。When the image data satisfies the first preset condition, the intercom process is triggered to end or the intercom request process is triggered to end.
  2. 根据权利要求1所述的方法,其特征在于,所述确定对讲过程中或者对讲请求过程中采集的图像数据,当所述图像数据满足第一预设条件时,触发所述对讲过程结束或者触发所述对讲请求过程结束,包括:The method according to claim 1, wherein the image data collected during the intercom process or the intercom request process is determined, and the intercom process is triggered when the image data satisfies a first preset condition End or trigger the end of the intercom request process, including:
    确定对讲过程中采集的图像数据;Determine the image data collected during the intercom;
    当所述图像数据满足第一预设条件时,触发所述对讲过程结束。When the image data satisfies the first preset condition, the intercom process is triggered to end.
  3. 根据权利要求1所述的方法,其特征在于,所述确定对讲过程中或者对讲请求过程中采集的图像数据,当所述图像数据满足第一预设条件时,触发所述对讲过程结束或者触发所述对讲请求过程结束,包括:The method according to claim 1, wherein the image data collected during the intercom process or the intercom request process is determined, and the intercom process is triggered when the image data satisfies a first preset condition End or trigger the end of the intercom request process, including:
    确定对讲请求过程中采集的图像数据;Determine the image data collected during the intercom request;
    当所述图像数据满足第一预设条件时,触发所述对讲请求过程结束。When the image data satisfies the first preset condition, the process of triggering the intercom request ends.
  4. 根据权利要求1-3任一所述的方法,其特征在于,所述图像数据满足第一预设条件包括:The method according to any one of claims 1-3, wherein the image data satisfying the first preset condition comprises:
    在所述图像数据中未检测到人物特征信息。No human characteristic information is detected in the image data.
  5. 根据权利要求1-3任一所述的方法,其特征在于,所述图像数据满足第一预设条件包括:The method according to any one of claims 1-3, wherein the image data satisfying the first preset condition comprises:
    在所述图像数据中检测到人物预设行为信息。Character preset behavior information is detected in the image data.
  6. 根据权利要求1-3任一所述的方法,其特征在于,所述图像数据满足第一预设条件包括:The method according to any one of claims 1-3, wherein the image data satisfying the first preset condition comprises:
    在所述图像数据中未检测到人物特征信息,以及在所述图像数据中检测到人物预设行为信息。No character feature information is detected in the image data, and character preset behavior information is detected in the image data.
  7. 根据权利要求4或6所述的方法,其特征在于,所述人物特征信息包括:人脸信息、人体轮廓信息、人体红外信息中的至少一种。The method according to claim 4 or 6, wherein the character feature information comprises: at least one of human face information, human body contour information, and human body infrared information.
  8. 根据权利要求5或6所述的方法,其特征在于,所述人物预设行为信息包括:后转信息、侧转信息、远离信息中的至少一种。The method according to claim 5 or 6, wherein the character preset behavior information includes at least one of backward turn information, side turn information, and away information.
  9. 根据权利要求8所述的方法,其特征在于,The method of claim 8, wherein:
    所述后转信息或所述侧转信息或所述远离信息根据所述人脸信息和所述人体轮廓信息的变化获得。The backward turn information or the side turn information or the distance information is obtained according to the change of the face information and the body contour information.
  10. 根据权利要求4或6所述的方法,其特征在于,在所述图像数据中未检测到人物特征信息,包括:The method according to claim 4 or 6, wherein no character feature information is detected in the image data, comprising:
    在所述图像数据中的一采样帧图像中未检测到人物特征信息;或者,Character feature information is not detected in a sample frame image in the image data; or,
    在所述图像数据中连续预设采样帧数的图像中未检测到人物特征信息;或者,No person feature information is detected in the image with the preset sampling frame number in the image data; or,
    在所述图像数据中未检测到人物特征信息持续第一预设时间。No person feature information is detected in the image data for a first preset time.
  11. 根据权利要求5或6所述的方法,其特征在于,在所述图像数据中检测到人物预设行为信息,包括:The method according to claim 5 or 6, wherein the preset behavior information of a person is detected in the image data, comprising:
    在所述图像数据中的连续采样帧数的图像中检测到人体轮廓从正面轮廓变为侧面轮廓;或者,It is detected that the contour of the human body changes from a frontal contour to a side contour in the image of the consecutively sampled frames in the image data; or,
    在所述图像数据中的连续采样帧数的图像中检测到人体轮廓从正面轮廓变为侧面轮廓再变为背面轮廓;或者,It is detected that the contour of the human body changes from a frontal contour to a side contour and then to a back contour in the image of the consecutive sampling frames in the image data; or,
    在所述图像数据中的连续第一预设采样帧数的图像中检测到人体轮廓在图像中的占比变小;或者,It is detected that the proportion of the human body contour in the image becomes smaller in the image of the continuous first preset sampling frame number in the image data; or,
    在所述图像数据中的连续采样帧数的图像中检测到人体轮廓在图像中的占比变小且 小于预设占比;或者,In the image of the continuous sampling frame number in the image data, it is detected that the proportion of the human body contour in the image becomes smaller and smaller than the preset proportion; or,
    在所述图像数据中的连续第二预设采样帧数的图像中检测到人体轮廓占人体全部轮廓的比例增加;或者,It is detected that the proportion of the outline of the human body to the total outline of the human body increases in the images of the second consecutive preset sampling frames in the image data; or,
    在所述图像数据中的连续采样帧数的图像中检测到人体轮廓占人体全部轮廓的比例增加且大于预设比例。It is detected that the proportion of the contours of the human body to the entire contours of the human body increases and is greater than a preset proportion in the images of the consecutively sampled frames in the image data.
  12. 根据权利要求11所述的方法,其特征在于,The method of claim 11, wherein:
    所述正面轮廓或所述背面轮廓或所述侧面轮廓根据人脸信息确定。The front profile or the back profile or the side profile is determined according to face information.
  13. 根据权利要求1所述的方法,其特征在于,当所述图像数据包括对讲请求端的图像数据和所述对讲接收端的图像数据时,所述当所述图像数据满足第一预设条件时,触发所述对讲结束或者触发所述对讲请求结束,包括:当所述对讲请求端的图像数据满足所述第一预设条件和所述对讲接收端的图像数据满足所述第一预设条件时,触发所述对讲过程结束或者触发所述对讲请求过程结束,所述对讲请求端的图像数据满足的所述第一预设条件和所述对讲接收端的图像数据满足的所述第一预设条件相同或不同。The method according to claim 1, wherein when the image data includes the image data of the intercom requesting end and the image data of the intercom receiving end, the when the image data satisfies a first preset condition , triggering the end of the intercom or triggering the end of the intercom request, including: when the image data of the intercom requesting end satisfies the first preset condition and the image data of the intercom receiving end satisfies the first preset condition When setting the conditions, trigger the end of the intercom process or trigger the end of the intercom request process, the first preset condition that is satisfied by the image data of the intercom requesting end and all that are satisfied by the image data of the intercom receiving end. The first preset conditions are the same or different.
  14. 根据权利要求1-13任一所述的方法,其特征在于,所述电子设备包括:对讲请求端设备、对讲接收端设备、云端服务器中的至少一种;The method according to any one of claims 1-13, wherein the electronic device comprises: at least one of an intercom requesting end device, an intercom receiving end device, and a cloud server;
  15. 根据权利要求14所述的方法,其特征在于,所述对讲请求端的图像数据由所述对讲请求端设备采集;所述对讲接收端的图像数据由所述对讲接收端的设备采集。The method according to claim 14, wherein the image data of the intercom requesting end is collected by the intercom requesting end device; the image data of the intercom receiving end is collected by the device of the intercom receiving end.
  16. 根据权利要求14所述的方法,其特征在于,所述方法还包括:根据所述对讲接收端设备确定的人物信息数据和/或设备状态数据确定所述对讲请求端的图像数据的播放方式。The method according to claim 14, wherein the method further comprises: determining a playback mode of the image data of the intercom requester according to the character information data and/or device status data determined by the intercom receiver device .
  17. 根据权利要求14-16任一所述的方法,其特征在于,所述对讲请求端设备包括:门铃外机、摄像头、门禁外机中的至少一种;所述对讲接收端设备包括:门铃内机、门禁内机、电视机、路由器、网关设备、客户前置设备CPE(Customer Premise Equipment)、音箱、智能摄像头、电视盒、电脑、手机中的至少一种。The method according to any one of claims 14-16, wherein the intercom requesting terminal device comprises: at least one of a doorbell outdoor unit, a camera, and an access control outdoor unit; the intercom receiving terminal device comprises: At least one of the doorbell indoor unit, access control indoor unit, TV, router, gateway device, customer premise equipment (CPE), speaker, smart camera, TV box, computer, and mobile phone.
  18. 根据权利要求17所述的方法,其特征在于,所述根据所述对讲接收端设备确定的人物信息数据和/或设备状态数据确定所述对讲请求端的图像数据的播放方式,包括:The method according to claim 17, wherein, determining the playback mode of the image data of the intercom requester according to the character information data and/or device status data determined by the intercom receiver device, comprising:
    当确定所述电视机处于运行状态,则确定通过所述电视机采用画中画的方式播放所述对讲请求端的图像数据;或者,When it is determined that the television is in the running state, it is determined that the image data of the intercom requester is played by the television in a picture-in-picture manner; or,
    当确定所述电视机处于运行状态,且确定受访用户与所述电视机处于预设范围内,则确定通过所述电视机采用画中画的方式播放所述对讲请求端的图像数据;或者,When it is determined that the television set is in a running state, and it is determined that the interviewed user and the television set are within a preset range, it is determined that the image data of the intercom requesting terminal is played through the television set in a picture-in-picture manner; or ,
    当确定所述电视机处于关闭状态,则确定通过所述电视机采用全屏显示的方式播放所述对讲请求端的图像数据;或者,When it is determined that the television is in an off state, it is determined that the image data of the intercom requester is played through the television in a full-screen display mode; or,
    当确定所述电视机处于关闭状态,且确定所述受访用户与所述电视机处于预设范围内,则确定通过所述电视机采用全屏显示的方式播放所述对讲请求端的图像数据;或者,When it is determined that the television is in an off state, and it is determined that the interviewed user and the television are within a preset range, it is determined that the image data of the intercom requester is played through the television in a full-screen display manner; or,
    当确定所述手机处于受访用户使用状态,则确定通过所述手机播放所述对讲请求端的图像数据;或者,When it is determined that the mobile phone is in the use state of the interviewed user, it is determined to play the image data of the intercom requester through the mobile phone; or,
    当确定所述电脑处于受访用户使用状态,则确定通过所述电脑播放所述对讲请求端的图像数据。When it is determined that the computer is in the use state of the interviewed user, it is determined that the image data of the intercom requesting end is played through the computer.
  19. 根据权利要求17所述的方法,其特征在于,在所述对讲请求过程中,若所述电视机或所述电视机的遥控器采集到所述对讲接收端的受访用户的第一预设语音,则触发进入所述对讲过程;或者,在对讲过程中,若所述电视机或所述电视机的遥控器采集到所述对讲接收端的受访用户的第二预设语音,则触发结束本次对讲。The method according to claim 17, wherein, in the intercom request process, if the television or the remote control of the television collects the first preset of the interviewed user of the intercom receiving end If the voice is set, it will trigger to enter the intercom process; or, during the intercom process, if the TV or the remote control of the TV collects the second preset voice of the interviewed user at the intercom receiver , it triggers the end of the intercom.
  20. 一种来访对讲控制方法,其特征在于,应用于电子设备,所述方法包括:A visiting intercom control method, characterized in that, applied to electronic equipment, the method comprising:
    确定对讲过程中采集的图像数据和所述对讲过程中采集的对讲语音,其中,所述图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据,所述对讲语音包括对讲请求端的语音和/或对讲接收端的语音;Determine the image data collected in the intercom process and the intercom voice collected during the intercom process, wherein the image data includes the image data of the intercom requesting end and/or the image data of the intercom receiving end, and the intercom voice Including the voice of the intercom requester and/or the voice of the intercom receiver;
    当所述图像数据满足第一预设条件且所述对讲语音满足第二预设条件时,触发所述对讲过程结束。When the image data satisfies the first preset condition and the intercom voice satisfies the second preset condition, the intercom process is triggered to end.
  21. 根据权利要求20所述的方法,其特征在于,所述对讲语音满足第二预设条件,包括:The method according to claim 20, wherein the intercom voice satisfies a second preset condition, comprising:
    所述对讲语音中包括预设关键词或未获取到对讲语音满足预设时间。The intercom voice includes preset keywords or the intercom voice is not acquired for a preset time.
  22. 一种来访对讲控制装置,其特征在于,所述对讲控制装置包括:A visiting intercom control device, characterized in that the intercom control device comprises:
    确定单元,被配置用于确定对讲过程中或者对讲请求过程中采集的图像数据,所述图像数据包括对讲请求端的图像数据和/或对讲接收端的图像数据;a determining unit, configured to determine the image data collected during the intercom process or the intercom request process, the image data including the image data of the intercom requesting end and/or the image data of the intercom receiving end;
    触发单元,被配置用于当所述图像数据满足第一预设条件时,触发所述对讲过程结束或者触发所述对讲请求过程结束。a triggering unit, configured to trigger the end of the intercom process or trigger the end of the intercom request process when the image data satisfies a first preset condition.
  23. 一种来访对讲系统,其特征在于,所述系统包括门铃设备、电视机,所述门铃设备与所述电视机连接,所述门铃设备或所述电视机被配置用于确定对讲过程中或者对讲请求过程中采集的图像数据,所述图像数据包括所述门铃设备采集的对讲请求端的图像数据和/或所述电视机采集的对讲接收端的图像数据;当所述图像数据满足第一预设条件时,所述门铃设备或所述电视机被配置用于触发所述对讲过程结束或者触发所述对讲请求过程结束。A visiting intercom system, characterized in that the system includes a doorbell device and a TV, the doorbell device is connected to the TV, and the doorbell device or the TV is configured to determine the intercom process. Or the image data collected in the intercom request process, the image data includes the image data of the intercom requesting end collected by the doorbell device and/or the image data of the intercom receiving end collected by the TV; Under the first preset condition, the doorbell device or the television set is configured to trigger the end of the intercom process or trigger the end of the intercom request process.
  24. 一种电子设备,其特征在于,包括:An electronic device, comprising:
    一个或多个处理器;存储器;one or more processors; memory;
    一个或多个应用程序,其中所述一个或多个应用程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个程序配置用于执行如权利要求1-19或20-21任一项所述的方法。One or more application programs, wherein the one or more application programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs are configured to perform such as The method of any one of claims 1-19 or 20-21.
  25. 一种计算机可读取存储介质,其特征在于,所述计算机可读取存储介质中存储有程序代码,所述程序代码可被处理器调用执行如权利要求1-19或20-21任一项所述的方法。A computer-readable storage medium, characterized in that the computer-readable storage medium stores program codes, and the program codes can be invoked by a processor to execute any one of claims 1-19 or 20-21 the method described.
PCT/CN2021/140086 2020-12-31 2021-12-21 Visitor talkback control method, talkback control apparatus, system, electronic device, and storage medium WO2022143300A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011629375.6A CN114697611B (en) 2020-12-31 2020-12-31 Visiting intercom control method, intercom control device, system, electronic equipment and storage medium
CN202011629375.6 2020-12-31

Publications (1)

Publication Number Publication Date
WO2022143300A1 true WO2022143300A1 (en) 2022-07-07

Family

ID=82133615

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/140086 WO2022143300A1 (en) 2020-12-31 2021-12-21 Visitor talkback control method, talkback control apparatus, system, electronic device, and storage medium

Country Status (2)

Country Link
CN (1) CN114697611B (en)
WO (1) WO2022143300A1 (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101557332A (en) * 2009-02-17 2009-10-14 刘利华 Intelligent household information management system
CN102610015A (en) * 2012-03-13 2012-07-25 浙江万里学院 Multimedia visual entrance guard system
US20130017812A1 (en) * 2011-07-14 2013-01-17 Colin Foster Remote access control to residential or office buildings
CN104504793A (en) * 2014-12-19 2015-04-08 天津市亚安科技股份有限公司 Intelligent door safety control system and method based on video service
CN106060463A (en) * 2016-06-12 2016-10-26 合肥日进软件技术开发有限公司 Remotely-controllable indoor machine control system for building interphone
CN108156387A (en) * 2018-01-12 2018-06-12 深圳奥比中光科技有限公司 Terminate the device and method of camera shooting automatically by detecting eye sight line
CN108683703A (en) * 2018-04-08 2018-10-19 陕西科技大学 A kind of intelligence message board information interaction system and application method

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5112905B2 (en) * 2008-02-20 2013-01-09 アイホン株式会社 Intercom device
US8842815B2 (en) * 2009-07-29 2014-09-23 Comcast Cable Communications, Llc Identity management and service access for local user group based on network-resident user profiles
US9349129B2 (en) * 2011-10-17 2016-05-24 Yahoo! Inc. Media enrichment system and method
CN104010154B (en) * 2013-02-27 2019-03-08 联想(北京)有限公司 Information processing method and electronic equipment
CN103955970A (en) * 2014-03-25 2014-07-30 京东方科技集团股份有限公司 Entrance guard system and control method thereof
EP3562150A1 (en) * 2018-04-24 2019-10-30 Panasonic Intellectual Property Management Co., Ltd. Intercom system
CN110009779A (en) * 2019-03-28 2019-07-12 武汉恒大智邦科技有限公司 A kind of building conversational system and method
CN110534109B (en) * 2019-09-25 2021-12-14 深圳追一科技有限公司 Voice recognition method and device, electronic equipment and storage medium
CN111405225A (en) * 2020-02-28 2020-07-10 北京爱接力科技发展有限公司 Method, device and system for realizing visual intercom service of access control and intelligent robot

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101557332A (en) * 2009-02-17 2009-10-14 刘利华 Intelligent household information management system
US20130017812A1 (en) * 2011-07-14 2013-01-17 Colin Foster Remote access control to residential or office buildings
CN102610015A (en) * 2012-03-13 2012-07-25 浙江万里学院 Multimedia visual entrance guard system
CN104504793A (en) * 2014-12-19 2015-04-08 天津市亚安科技股份有限公司 Intelligent door safety control system and method based on video service
CN106060463A (en) * 2016-06-12 2016-10-26 合肥日进软件技术开发有限公司 Remotely-controllable indoor machine control system for building interphone
CN108156387A (en) * 2018-01-12 2018-06-12 深圳奥比中光科技有限公司 Terminate the device and method of camera shooting automatically by detecting eye sight line
CN108683703A (en) * 2018-04-08 2018-10-19 陕西科技大学 A kind of intelligence message board information interaction system and application method

Also Published As

Publication number Publication date
CN114697611A (en) 2022-07-01
CN114697611B (en) 2023-07-14

Similar Documents

Publication Publication Date Title
US10514881B2 (en) Information processing device, information processing method, and program
US11570354B2 (en) Display assistant device having a monitoring mode and an assistant mode
US9263044B1 (en) Noise reduction based on mouth area movement recognition
US8665306B2 (en) Communication terminal, communication system, communication method, and medium storing communication control program
TW201923737A (en) Interactive Method and Device
WO2020082902A1 (en) Sound effect processing method for video, and related products
CN105513596B (en) Voice control method and control equipment
US8860771B2 (en) Method and system for making video calls
WO2020076365A1 (en) Display assistant device for home monitoring
US11429192B2 (en) Confidence-based application-specific user interactions
WO2021147480A1 (en) Live broadcast assistance method and electronic device
JP2012186622A (en) Information processing apparatus, information processing method, and program
WO2021190545A1 (en) Call processing method and electronic device
WO2024103926A1 (en) Voice control methods and apparatuses, storage medium, and electronic device
CN106412712A (en) Video playing method and apparatus
CN111630413B (en) Confidence-based application-specific user interaction
US12001614B2 (en) Confidence-based application-specific user interactions
WO2019227552A1 (en) Behavior recognition-based speech positioning method and device
CN113301372A (en) Live broadcast method, device, terminal and storage medium
CN109032554A (en) A kind of audio-frequency processing method and electronic equipment
CN111968680A (en) Voice processing method, device and storage medium
WO2022143300A1 (en) Visitor talkback control method, talkback control apparatus, system, electronic device, and storage medium
US20230179855A1 (en) Display assistant device having a monitoring mode and an assistant mode
CN105244037B (en) Audio signal processing method and device
US10838741B2 (en) Information processing device, information processing method, and program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21914039

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21914039

Country of ref document: EP

Kind code of ref document: A1