CN112992122A - Privacy security protection method and device for television camera - Google Patents

Privacy security protection method and device for television camera Download PDF

Info

Publication number
CN112992122A
CN112992122A CN202110246744.1A CN202110246744A CN112992122A CN 112992122 A CN112992122 A CN 112992122A CN 202110246744 A CN202110246744 A CN 202110246744A CN 112992122 A CN112992122 A CN 112992122A
Authority
CN
China
Prior art keywords
sound information
sound
television camera
preset
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110246744.1A
Other languages
Chinese (zh)
Inventor
李康
朱丽华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jovision Technology Co ltd
Original Assignee
Jovision Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jovision Technology Co ltd filed Critical Jovision Technology Co ltd
Priority to CN202110246744.1A priority Critical patent/CN112992122A/en
Publication of CN112992122A publication Critical patent/CN112992122A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Studio Devices (AREA)

Abstract

The application provides a privacy security protection method and equipment for a television camera, wherein the method comprises the following steps: the method comprises the steps of obtaining first sound information collected by audio collection equipment. The first sound information is used for indicating that the lens of the television camera is shielded. And determining whether a light shielding sheet of the television camera is in a first preset position or not based on the first sound information. The anti-dazzling screen is arranged on a movable plate of the television camera, and the movable plate is arranged in a shell of the television camera. And under the condition that the shading sheet is not positioned at the first preset position, generating a first control instruction and sending the first control instruction to the driving mechanism so as to enable the driving mechanism to operate according to the first control instruction and enable the moving plate connected with the driving mechanism to move until the shading sheet arranged on the moving plate is positioned at the first preset position, thereby realizing the shielding of the lens of the television camera. According to the method, the privacy safety and the user experience of the user are improved.

Description

Privacy security protection method and device for television camera
Technical Field
The application relates to the technical field of privacy security protection, in particular to a privacy security protection method and equipment for a television camera.
Background
With the development of science and technology and the gradual improvement of living standard of people, more and more functions are being integrated into the television to meet the increasing demands of people. The existing smart televisions are often equipped with corresponding cameras so as to meet the functions of large-screen video call, AI fitness, games and the like of users.
The television camera brings convenience to users and simultaneously hides huge potential safety hazards. Particularly, televisions are generally installed in areas such as bedrooms and living rooms of users, and the privacy of human family life is greatly threatened by television cameras.
In order to ensure the privacy safety of users, some users can directly cover the television camera by adopting corresponding shelters (such as black cloth), and the mode has poor user experience and poor aesthetic property. Or the existing television camera adopts a rotating mode, the television camera can be rotated to face one side of the wall, the mode is complex to operate, the actual shielding of the television camera is not realized, and the problem of user privacy disclosure still exists.
Therefore, a technical scheme for protecting the privacy security of the television camera is urgently needed to improve the privacy security and the user experience of the user and avoid the privacy disclosure of the user.
Disclosure of Invention
The embodiment of the application provides a privacy security protection method and equipment for a television camera, which are used for solving the privacy disclosure problem of a user caused by the existing television camera.
In one aspect, the present application provides a privacy security protection method for a television camera, including: first sound information is acquired. The first sound information is used for indicating that the lens of the television camera is shielded. And determining whether a light shielding sheet of the television camera is in a first preset position or not based on the first sound information. The anti-dazzling screen is arranged on a movable plate of the television camera, and the movable plate is arranged in a shell of the television camera. And under the condition that the shading sheet is not positioned at the first preset position, generating a first control instruction and sending the first control instruction to the driving mechanism so as to enable the driving mechanism to operate according to the first control instruction and enable the moving plate connected with the driving mechanism to move until the shading sheet arranged on the moving plate is positioned at the first preset position, thereby realizing the shielding of the lens of the television camera.
According to the embodiment of the application, the lens of the television camera is shielded, the user experience degree is improved, both hands of a user are liberated, and the shielding of the television camera is really realized. Moreover, the light shielding sheet is arranged in the shell of the television camera, so that the attractiveness of the television camera cannot be influenced, and the user experience is further improved.
In one implementation of the present application, a user rating corresponding to the first sound information is determined. And determining the execution time corresponding to the first sound information according to a preset rule and the user level corresponding to the first sound information. Acquiring the acquisition time of the first sound information, and determining whether the acquisition time of the first sound information is matched with the execution time corresponding to the first sound information. And under the condition that the acquisition time of the first sound information is matched with the execution time of the first sound information, determining whether a light shielding sheet of the television camera is at a first preset position.
In one implementation of the present application, second sound information is obtained. And the second sound information is used for indicating that the lens of the television camera is opened. And determining whether the light shielding sheet of the television camera is in a second preset position or not based on the second sound information. And under the condition that the shading sheet is not positioned at the second preset position, generating a second control instruction and sending the second control instruction to the driving mechanism so as to enable the driving mechanism to operate according to the second control instruction and enable the moving plate connected with the driving mechanism to move until the shading sheet is positioned at the second preset position, thereby realizing the opening of the lens of the television camera.
In one implementation of the present application, the number of the first sound information and the second sound information in the corresponding preset time period is determined. And determining whether the light shielding sheet of the television camera is in a second preset position or not under the condition that the number of the first sound information is larger than a second preset threshold value and/or the number of the second sound information is larger than the second preset threshold value. And under the condition that the shading sheet is not at the second preset position, generating a second control instruction and sending the second control instruction to the driving mechanism so as to enable the shading sheet to be at the second preset position. And under the condition that the light shielding sheet is at the second preset position, generating a third control instruction to enable the television camera to shoot to obtain a corresponding image and/or video, and sending the image and/or video to the user terminal to enable the user terminal to show the user.
In one implementation manner of the application, sound information acquired by an audio acquisition device is acquired, and a first sound feature corresponding to the sound information is extracted. And matching the first sound characteristic with a preset sound characteristic. The preset sound features at least comprise sound frequency and voiceprint information. And under the condition that the first sound characteristic is successfully matched with the preset sound characteristic, identifying the sound information corresponding to the first sound characteristic to determine the first sound information.
In one implementation of the present application, a sound frequency of sound information is obtained; and intercepting the sound information to be extracted, of which the sound frequency is in the preset sound frequency interval, in the sound information according to the preset sound frequency interval so as to realize the denoising of the sound information. And extracting sound features of the sound information to be extracted, and taking the extracted sound features as first sound features corresponding to the sound information.
In the embodiment of the application, a large amount of noise may exist in the environment where the television is located, and by setting the sound frequency interval, a part of irrelevant sound can be eliminated, so that the accuracy of controlling the television camera by sound is improved.
In one implementation of the present application, sound information corresponding to the first sound characteristic is converted into a digital signal by an analog-to-digital conversion device, and the digital signal is divided into a plurality of audio blocks. And matching the plurality of audio blocks with the keyword sequences in the preset keyword sequence library. The keyword sequence is a plurality of preset digital signal sequences. And under the condition that the audio blocks of the sound information are successfully matched with the keyword sequences in the preset keyword sequence library, determining the sound information corresponding to the first sound characteristic as the first sound information.
According to the method and the device, the plurality of audio blocks of the sound information are matched with the keyword sequence of the preset keyword sequence library, the situation that the wrong sound information can trigger the television camera to shield can be avoided, and the accuracy of sound information identification is improved.
In one implementation of the present application, feature extraction is performed on sound information to determine a first voiceprint feature vector of the sound information. And determining whether a first voiceprint feature vector of the voice information exists in a voiceprint sample library based on a preset voiceprint sample library. Wherein, the voiceprint sample library is a set of at least one voiceprint feature vector which is pre-recorded. And under the condition that the first voiceprint feature vector of the voice information exists in the voiceprint sample library, determining that the first voice feature is successfully matched with the preset voice feature.
In an implementation manner of the present application, under the condition that a first voiceprint feature vector of voice information does not exist in a voiceprint sample library, the number of times of acquiring the voice information corresponding to the first voiceprint feature vector within a preset time is recorded. And generating warning information and sending the warning information to the user terminal under the condition that the acquisition times of the sound information in the preset time are greater than or equal to a first preset threshold value. Wherein, the warning information is character information and/or sound information.
In one implementation manner of the application, under the condition that the light shielding sheet moves to the first preset position, first prompt information is generated and sent to the early warning device, so that the early warning device sends out early warning information according to the first prompt information, wherein the early warning information is sound information and/or light information. The first prompt message is used for indicating that the lens of the television camera is shielded.
On the other hand, this application still provides a privacy security protection equipment of TV camera, and this equipment includes: at least one processor; and a memory communicatively coupled to the at least one processor. Wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to: first sound information is acquired. The first sound information is used for indicating that the lens of the television camera is shielded. And determining whether a light shielding sheet of the television camera is in a first preset position or not based on the first sound information. The anti-dazzling screen is arranged on a movable plate of the television camera, and the movable plate is arranged in a shell of the television camera. And under the condition that the shading sheet is not positioned at the first preset position, generating a first control instruction and sending the first control instruction to the driving mechanism so as to enable the driving mechanism to operate according to the first control instruction and enable the moving plate connected with the driving mechanism to move until the shading sheet arranged on the moving plate is positioned at the first preset position, thereby shielding the lens of the television camera.
According to the scheme, the sound information used for indicating the shielding or opening of the television camera is determined from the sound information collected by the audio collection equipment, the driving mechanism in the television camera is controlled to operate according to the sound information, and the movable plate provided with the optical filter is enabled to shield or open the lens of the television camera. The mode that the user controlled the TV camera to shield or open is realized through sound, has liberated user's both hands, has improved the efficiency of shielding or opening the TV camera, has improved the experience degree that the user used the TV camera. According to the scheme, the characteristics of the sound are extracted according to the voiceprint information and the sound frequency of the sound, the television camera is prevented from being operated maliciously, and the privacy safety of a user is improved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:
fig. 1 is a schematic structural diagram of a filter switcher of a television camera according to an embodiment of the present disclosure;
fig. 2 is a flowchart of a privacy protection method for a television camera according to an embodiment of the present disclosure;
fig. 3 is another flowchart of a privacy protection method for a television camera according to an embodiment of the present disclosure;
fig. 4 is another flowchart of a privacy protection method for a television camera according to an embodiment of the present disclosure;
fig. 5 is another flowchart of a privacy protection method for a television camera according to an embodiment of the present disclosure;
fig. 6 is another flowchart of a privacy protection method for a television camera according to an embodiment of the present disclosure;
fig. 7 is another flowchart of a privacy protection method for a television camera according to an embodiment of the present disclosure;
fig. 8 is another flowchart of a privacy protection method for a television camera according to an embodiment of the present disclosure;
fig. 9 is a schematic structural diagram of a privacy protection apparatus for a television camera according to an embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the technical solutions of the present application will be described in detail and completely with reference to the following specific embodiments of the present application and the accompanying drawings. It should be apparent that the described embodiments are only some of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
With the technological progress and the improvement of the living standard, more and more functions are being integrated into the television, and convenience is brought to the life of the user. The smart television is often provided with a camera so as to meet the requirements of users on video call, AI fitness, games and the like.
The television camera brings convenience to users and simultaneously hides huge potential safety hazards. Since the television is generally installed in the bedroom, the living room, etc. of the user, the television camera has a great threat to the privacy of human family life. When not using TV camera, some users can use the shelter to cover TV camera, and some users also use rotatable TV camera, make it towards the wall body, but two kinds of modes all can make user experience poor, and the effect to privacy security protection is not good.
Based on this, the embodiment of the application provides a privacy security protection method and device for a television camera, which are used for improving the privacy security of a user in front of the television camera and improving the user experience.
Various embodiments of the present application are described in detail below with reference to the accompanying drawings.
The embodiment of the application provides a privacy security protection method for a television camera. It should be noted that, the execution subject in the embodiment of the present application is described by taking a server as an example, and those skilled in the art can know that the execution subject in the embodiment of the present application may also be other devices, and the present application is not particularly limited to this.
In the embodiment of the present application, the television camera is provided with a corresponding optical filter switcher. Fig. 1 is a schematic structural diagram of a filter switch according to an embodiment of the present disclosure, as shown in fig. 1, the filter switch includes: the device comprises a movable plate 1, a shading sheet 2, an infrared filter 3, a connecting piece of a driving mechanism 4, a supporting plate 5, a sliding rail 6 and a driving mechanism 7.
Two sliding rails 6 are arranged on the supporting plate 5 in parallel along the length direction of the supporting plate, the moving plate 1 is arranged above the sliding rails 6, and the moving plate 1 can slide back and forth along the sliding rails 6; the moving plate 1 is sequentially provided with a shading sheet 2 and an infrared filter 3, one end of a connecting piece 4 of the driving mechanism is connected with one end of the moving plate 1, and the other end of the connecting piece 4 of the driving mechanism is connected with the driving mechanism (not shown in the figure).
Fig. 2 is a flowchart of a privacy and security protection method for a television camera according to an embodiment of the present disclosure. As shown in fig. 2, the privacy and security protection method for a television camera provided in the embodiment of the present application may include S201 to S203:
s201, the server acquires first sound information.
In the embodiment of the application, the first sound information is used for indicating the server to shield the lens of the television camera.
Televisions are generally installed in areas such as bedrooms and living rooms, when children exist at home or more people at home cause noisy environments, a television camera can be turned off or turned on unintentionally, and unnecessary troubles can be brought to users. In order to avoid the above problem, the present application specifies the content and the sound characteristic of the first sound information, and further determines the first sound information in a sound matching manner to shield the television camera, as shown in fig. 3, and the method can be implemented by the following steps:
s301, the server acquires sound information acquired by the audio acquisition equipment and extracts first sound characteristics corresponding to the sound information.
In the embodiment of the application, after a user speaks to the audio acquisition device, the audio acquisition device acquires sound information, and the server takes sound frequency and sound pattern information in the sound information as the first sound characteristic.
In some embodiments of the present application, the presence of irrelevant sound in the sound information collected by the audio collection device may cause a server sound recognition error. Therefore, the purpose of noise reduction can be achieved by collecting the sound of the preset sound frequency interval.
First, the server acquires the sound frequency of the sound information.
And secondly, intercepting the to-be-extracted sound information of which the sound frequency is in the preset sound frequency interval in the sound information according to the preset sound frequency interval.
Because the environment of the television is uncertain, the purpose of noise reduction on the sound information can be achieved to a certain extent by extracting the sound in the preset sound frequency range.
Specifically, the server is preset with a sound frequency interval for extracting the sound information, for example, the sound frequency interval is [ a, B ] hertz, the server determines the sound frequency of the sound information, and further eliminates the sound in the sound information which is not in the sound frequency interval according to the preset sound frequency interval.
And finally, the server extracts the voice features of the voice information to be extracted and takes the extracted voice features as the first voice features corresponding to the voice information.
Due to the uncertainty of the environment of the television, various noises may exist around the television, and the scheme can eliminate part or all of irrelevant sounds in the sound information collected by the audio collecting equipment and can further improve the accuracy of sound information identification.
S302, the server matches the first sound characteristic with a preset sound characteristic.
The preset sound features at least comprise sound frequency and voiceprint information.
In the embodiment of the application, the server stores one or more sound characteristics in advance, the sound characteristics comprise sound frequency and voiceprint information, the preset sound characteristics are used for matching the sound characteristics corresponding to the sound information acquired by the audio acquisition equipment, and therefore whether shielding or opening operation of the television camera is performed or not is judged according to a matching result.
In an embodiment of the present application, the preset sound characteristics may include sound frequency, voiceprint information, and may further include information such as loudness and timbre of sound, which is used to screen the sound information collected by the audio collecting device. The embodiment of the application matches the first sound characteristic with the preset sound characteristic, such as sound frequency and voiceprint information.
For example, the sound frequency of the predetermined sound feature may be set to a predetermined [ a, b ] hz interval, and the server determines whether the sound frequency of the first sound feature is in the [ a, b ] hz interval, and if the sound frequency of the first sound feature is not in the [ a, b ] hz interval, it indicates that the first sound feature does not match the predetermined sound feature.
If the sound frequency of the first sound feature is in the [ a, b ] hertz interval, matching the voiceprint information in the preset sound feature with the voiceprint of the first sound feature, wherein the preset sound feature may include a voiceprint database which is recorded in advance so as to match the first sound feature of the sound information sent by different users.
And S303, under the condition that the first sound characteristic is successfully matched with the preset sound characteristic, the server identifies the sound information corresponding to the first sound characteristic to determine the first sound information.
In an embodiment of the application, the process of identifying the sound information corresponding to the first sound characteristic, as shown in fig. 4, may include the following steps:
s401, the server converts the sound information corresponding to the first sound characteristic into a digital signal through analog-to-digital conversion equipment, and divides the digital signal into a plurality of audio blocks.
In the embodiment of the application, the analog-to-digital conversion equipment converts the sound information collected by the audio collection equipment into the digital signal, and divides the digital signal into a plurality of audio blocks, namely, performs framing operation on the sound. The analog-to-digital conversion device may be a direct analog-to-digital converter or an indirect analog-to-digital converter, and the application does not limit the type of the analog-to-digital conversion device.
S402, the server matches the audio blocks with keyword sequences in a preset keyword sequence library.
The keyword sequence is a plurality of preset digital signal sequences. In one embodiment of the application, a plurality of audio blocks are matched with keyword sequences in a preset keyword sequence library to determine whether a user inputs a television camera control command. For example, the sequence of the combination of the audio blocks is 'camera shading', and the server searches whether a corresponding keyword sequence corresponds to the sequence of the combination of the audio blocks from a preset keyword sequence library. If the corresponding keyword sequence exists in the preset keyword sequence library, the matching is successful.
Specifically, the server may convert the sound information collected by the audio collecting device into digital information, and perform framing operation to obtain a plurality of corresponding audio blocks. After the server obtains the plurality of audio blocks, characteristics of Mel-frequency cepstral coefficients (MFCC) are extracted from the plurality of audio blocks, and acoustic characteristic extraction is completed. After the voice information is subjected to MFCC feature extraction, an N-dimensional matrix is obtained, wherein the columns of the matrix correspond to the number of a plurality of voice blocks. And sequentially inputting each column of the matrix into the trained acoustic model and language model to obtain the text information corresponding to the sound information, such as 'camera open'. The method comprises the steps that a keyword sequence is preset in a preset keyword sequence library of a server, and the server matches text information corresponding to sound information with the keyword sequence to determine whether the sound information is first sound information shielded by a television camera triggered by a user.
S403, under the condition that the audio blocks of the sound information are successfully matched with the keyword sequences in the preset keyword sequence library, the server determines that the sound information corresponding to the first sound characteristic is the first sound information.
In the embodiment of the application, after determining that the sound information acquired by the audio acquisition device is matched with the keyword sequence in the preset keyword sequence library, the server determines that the sound information is first sound information, and the first sound information is an instruction for triggering the television camera to shield. And the server generates a corresponding control instruction when determining that the first sound information is received, so that a driving mechanism in the television camera operates, and the light shielding sheet is moved to the front of a lens of the television camera. For example, the keyword sequence in the keyword sequence library may be a "camera", "mask", "open", or other keyword sequence.
In another embodiment of the present application, the server may set a voiceprint recognition function to distinguish between different users who control the television camera to be blocked or unblocked. Therefore, the process of recognizing the sound information corresponding to the first sound characteristic may further include S501-S502:
s501, the server extracts the characteristics of the sound information to determine a first voiceprint characteristic vector of the sound information.
In this embodiment of the present application, the same method may be used for extracting the sound information features as the acoustic feature extraction in S402, and the server records the acoustic features after the feature extraction as the first voiceprint feature vector.
S502, based on a preset voiceprint sample library, the server determines whether a first voiceprint feature vector of the voice information exists in the voiceprint sample library.
Wherein the voiceprint sample library is a set of pre-entered at least one voiceprint feature vector. In the embodiment of the application, the first voiceprint feature vector is matched with the voiceprint features in the preset voiceprint sample library, and if the matching is successful, the first voiceprint feature vector is determined to be stored in the voiceprint sample library.
In an embodiment of the present application, a neural network model may also be pre-established, and a voiceprint of a person in the user's home is used as an input sample to train the neural network model. For example, A, B, C three persons are included in the user's home, and the sounds of the three persons are used as samples and input into the neural network model, so that the trained neural network model capable of recognizing the voiceprint is obtained. And if the sound collected by the audio collection equipment is sent by the user A, inputting the corresponding first voiceprint characteristic vector into the trained neural network model capable of identifying the voiceprint, and determining the sound as the voiceprint of the user A by the server.
S503, under the condition that the first voiceprint feature vector of the voice information exists in the voiceprint sample library, the server determines that the first voice feature is successfully matched with the preset voice feature.
And under the condition that the first sound characteristic is successfully matched with the preset sound characteristic, the server determines the sound information acquired by the audio acquisition equipment as the first sound information so as to correspondingly control the television camera through the content of the first sound information.
In the actual use process of the television, a person may maliciously operate the shielding or opening state of the television camera, so that the privacy of the user is maliciously revealed. In order to avoid the malicious people who operate the television camera to achieve the purpose, the application provides the following embodiments: and recording the acquisition times of the sound information corresponding to the first voiceprint feature vector in the preset time under the condition that the first voiceprint feature vector of the sound information does not exist in the voiceprint sample library.
If a person who sends sound information wants to operate the television camera maliciously, the server judges that the voiceprint information of the person is not in the voiceprint sample library and does not perform television camera shielding operation, when the person sees that the television camera is not shielded, the person possibly sends a television camera shielding instruction for multiple times, and at the moment, the server can record the acquisition times of the sound information sent by the person in preset time, for example, the acquisition times of the sound information of the person who performs malicious operation are 15 times within two minutes. When someone maliciously operates the television camera, the method can prompt a user in the following mode:
and generating warning information and sending the warning information to the user terminal under the condition that the acquisition times of the sound information corresponding to the first voiceprint feature vector in the preset time are greater than or equal to a first preset threshold value. Wherein, the warning information is character information and/or sound information.
Specifically, in order that a user can know in time and stop a behavior of operating the television camera maliciously when someone operates the television camera maliciously, a first preset threshold may be set for the number of times of acquiring the voice information in a preset time, for example, the first preset threshold is 10, and when the number of times of acquiring the voiceprint feature vector without the voiceprint sample library in the preset time is greater than or equal to 10 times, the server may generate the warning information and send the warning information to the user terminal.
According to the scheme, when a person maliciously operates the television camera, the user can timely find the malicious behavior through the warning information received by the user terminal, and can timely take corresponding measures to process. According to the scheme, the privacy safety of the user is improved, and the use experience of the user on the television camera is improved.
It should be noted that the user terminal may be a mobile phone, a smart watch, a smart bracelet, or other devices of the user, which is not limited in this application. Further, the warning information may be a text information and/or a sound information, and the content of the warning information is, for example, "an illegal user is attempting to use the camera".
S202, whether a light shielding sheet of the television camera is located at a first preset position or not is determined based on the first sound information.
The anti-dazzling screen is arranged on a movable plate of the television camera, and the movable plate is arranged in a shell of the television camera. In the embodiment of the present application, the filter switcher corresponding to fig. 1 is inside the housing of the television camera, and the moving plate where the light shielding sheet is located is disposed in front of the lens of the television camera.
In the embodiment of the application, the first preset position can be that the light shielding sheet is positioned in front of the lens of the television camera and completely shields the lens of the television camera. The position detection equipment detects the position of the movable plate, so that whether the light shielding sheet of the television camera is in a first preset position or not is determined.
Televisions are commonly used for entertainment activities, and there may be differences in the times that family members use them for entertainment, and in some families, parents may limit the times that children use them for entertainment. Therefore, the server can set a corresponding user level for each family member, or a user sets a corresponding user level through the server user to distribute and control the time of the television camera.
In an embodiment of the present application, controlling the television camera according to the user level specifically includes the following steps, as shown in fig. 6:
s601, the server determines the user level corresponding to the first sound information.
The user level is determined according to the voiceprint information and is used for indicating the instruction execution level shielded by the television camera, and the instruction execution level corresponds to the instruction execution time period shielded by the television camera.
In the embodiment of the application, the server may set user levels for different users, such as a first level and a second level, where the corresponding instruction execution time periods are 0-24 hours and 19-21 hours, respectively. In actual use, the user level of the parent may be one level, the user level of the child may be two levels, and the corresponding instruction execution level is lower than the instruction execution level corresponding to the user level at home.
The server determines a corresponding user grade according to the voiceprint information corresponding to the first sound information, for example, if the first sound information is a voiceprint of a child, the corresponding user grade is a second grade.
S602, according to a preset rule and the user level corresponding to the first sound information, the server determines the execution time corresponding to the first sound information.
In the embodiment of the application, the preset rule is that the server stores the user level and the time period within which the instruction corresponding to the user level can be executed in advance, and according to the user level, the server determines the execution time corresponding to the first sound information. In the actual use process, the user can set the user level for each family member to distinguish the control time of the television camera, so that the conflict of the television use time is avoided.
S603, the server acquires the acquisition time of the first sound information and determines whether the acquisition time of the first sound information is matched with the execution time corresponding to the first sound information.
In the embodiment of the application, the server determines whether the acquisition time of the first sound information is within the execution time corresponding to the first sound information, and then executes the corresponding operation. If the collection time of the first sound information is 6 points and the execution time corresponding to the first sound information is 7 points to 9 points, the server determines that the collection time of the first sound information does not match the execution time corresponding to the first sound information.
S604, under the condition that the acquisition time of the first sound information is matched with the execution time of the first sound information, the server determines whether a light shielding sheet of the television camera is located at a first preset position.
In practical use, different user levels of family members are stored in the server in advance, and if the server determines that the first sound information is children and the instruction execution time period corresponding to the user level of the children is 19-21 and the acquisition time of the first sound information is 20, the server controls the driving mechanism to perform lens shielding operation of the television camera.
According to the method and the device, the corresponding execution time is matched with the corresponding execution time according to the user grade, so that the corresponding use time is set for the user, the user can be supervised, and the user is prevented from using the television camera for entertainment and the like for a long time. By the scheme, the server reasonably distributes the time for using the television cameras for each user, and the phenomenon that the family relationship is not harmonious due to the fact that the television cameras are contended among family members can be avoided.
And S203, under the condition that the shading sheet is not located at the first preset position, generating a first control instruction and sending the first control instruction to the driving mechanism so that the driving mechanism operates according to the first control instruction, and moving the moving plate connected with the driving mechanism until the shading sheet arranged on the moving plate is located at the first preset position, so as to shield the lens of the television camera.
And under the condition that the shading sheet is not positioned at the first preset position, generating a first control command and sending the first control command to the driving mechanism so as to move the shading sheet to the first preset position.
In an embodiment of the application, after the light shielding sheet moves to the first preset position, the server may generate first prompt information and send the first prompt information to the early warning device. The early warning device can be a device which is pre-installed on a television camera or a television and can send out sound information and/or light information to prompt a user to finish shielding the television camera.
When the light shielding sheet is at the first preset position, the server can not execute any operation, can also generate position information of the light shielding sheet and send the position information to the user terminal, or generate light information according to the position information of the light shielding sheet and convey the light information to the user. Wherein, the television camera can be provided with a light emitting diode for emitting light information.
Based on the method, the server determines first sound information used for indicating the shielding of the television camera from the sound information collected by the audio collecting device, and performs shielding operation on the television camera according to the first sound information. According to the scheme, the user controls the driving mechanism in the television camera to operate through sound, so that the movable plate provided with the optical filter moves, the hands of the user can be liberated, and the experience of the user in using the television camera is improved.
The server determines whether the sound information is preset sound or not through processing of denoising the sound information, extracting keywords of the sound information, extracting vocal print characteristics of the sound information and the like, and then determines that the sound information is first sound information and the like, the server can identify a user operating the television camera, and the situation that the privacy of the user is leaked due to malicious operation of the television camera is avoided.
The optical filter switcher is arranged in the shell of the television camera, the position of the optical filter cannot be seen from the outside, and the appearance of the television camera cannot be influenced when the lens of the television camera is shielded or opened, so that the attractiveness of the television camera can be ensured.
In some embodiments of the present application, the server may also perform an open operation on the television camera, where the open operation of the television camera includes the following steps:
s701, the server obtains second sound information.
In the embodiment of the application, the second sound information is used for indicating that the lens of the television camera is opened, the second sound information is acquired in the same manner as the first sound information, and the second sound information also has a corresponding user level and an instruction execution time period and corresponds to the first sound information.
S702, based on the second sound information, the server determines whether the shading sheet of the television camera is located at a second preset position.
In the embodiment of the application, when the light shielding sheet is located at the second preset position, the infrared filter sheet is located at the lens position of the television camera, so that the television camera can shoot or pick up the image. It should be noted that the infrared filter may be a transparent filter, which is not limited in the present application.
And S703, under the condition that the shading sheet is not located at the second preset position, the server generates a second control instruction and sends the second control instruction to the driving mechanism, so that the driving mechanism operates according to the second control instruction, and the moving plate connected with the driving mechanism moves until the shading sheet is located at the second preset position, and the lens of the television camera is opened.
In an embodiment of the application, after the light shielding sheet moves to the second preset position, the server may generate second prompt information and send the second prompt information to the early warning device. The early warning device can be a device which is pre-installed on a television camera or a television, and can send out sound information and/or light information to prompt a user to finish opening the television camera.
According to the embodiment of the present application, by using the above scheme, the operations of shielding and opening the television camera can be implemented, in an embodiment of the present application, in order to further enhance the privacy security protection function of the television camera, the privacy security protection method for the television camera further includes the following steps, as shown in fig. 8:
s801, the server determines the number of the first sound information and the second sound information in the corresponding preset time period.
In order to prevent a person from intentionally influencing the normal use of the television camera, the server may acquire the acquisition numbers of the first sound information and the second sound information within a preset time period, for example, 2 minutes, where the first sound information and the second sound information may come from different users. This embodiment can be used to avoid carrying out the repetitive operation to the shielding or opening of TV camera in the scene that the children of if transferring the skin deliberately influence the head of a family to use TV camera, influences TV camera's normal use.
S802, under the condition that the number of the first sound information is larger than a second preset threshold value, and/or under the condition that the number of the second sound information is larger than the second preset threshold value, whether a light shielding sheet of the television camera is located at a second preset position is determined.
In this embodiment of the application, the second preset threshold may be set to 3 times, and when the number of times the first sound information and/or the second sound information is acquired within a preset time period, for example, 20 seconds, is greater than 3 times, the server may determine that the operation mode is malicious use of the television camera. The server judges whether the position of the light shielding film is at a second preset position or not, and if the light shielding film is at the second preset position, the server enables the television camera to take a picture or make a video.
And S803, under the condition that the shading sheet is not at the second preset position, generating a second control command and sending the second control command to the driving mechanism so as to enable the shading sheet to be at the second preset position.
And S804, under the condition that the light shielding sheet is located at the second preset position, generating a third control instruction to enable the television camera to shoot to obtain a corresponding image and/or video, and sending the image and/or video to the user terminal to enable the user terminal to display the image and/or video to a user.
In the embodiment of the application, for safety consideration, the server carries out shooting under the condition that the television camera is maliciously operated, and sends the image and/or the video to the user terminal, so that the user can find out the problems in a family in time according to the image information displayed by the user terminal. And the user can be reminded of normally using the television camera, the phenomenon that the movable plate of the television camera slides for a short time for many times and the position of the optical filter is switched is avoided, so that the television camera is damaged.
The present application further provides a privacy security protection device for a television camera, as shown in fig. 9, the device includes:
at least one processor; and a memory communicatively coupled to the at least one processor. Wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to: the method comprises the steps of obtaining first sound information collected by audio collection equipment. The first sound information is used for indicating that the lens of the television camera is shielded. And determining whether a light shielding sheet of the television camera is in a first preset position or not based on the first sound information. The anti-dazzling screen is arranged on a movable plate of the television camera, and the movable plate is arranged in a shell of the television camera. And under the condition that the shading sheet is not positioned at the first preset position, generating a first control instruction and sending the first control instruction to the driving mechanism so as to enable the driving mechanism to operate according to the first control instruction and enable the moving plate connected with the driving mechanism to move until the shading sheet arranged on the moving plate is positioned at the first preset position, thereby shielding the lens of the television camera.
The embodiments in the present application are described in a progressive manner, and the same and similar parts among the embodiments can be referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for the apparatus embodiment, since it is substantially similar to the method embodiment, the description is relatively simple, and for the relevant points, reference may be made to the partial description of the method embodiment.
The devices and the methods provided by the embodiment of the application are in one-to-one correspondence, so the devices also have beneficial technical effects similar to the corresponding methods.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
The above description is only an example of the present application and is not intended to limit the present application. Various modifications and changes may occur to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the scope of the claims of the present application.

Claims (10)

1. A privacy security protection method of a television camera is characterized by comprising the following steps:
acquiring first sound information; the first sound information is used for indicating that the lens of the television camera is shielded;
determining whether a light shielding sheet of the television camera is located at a first preset position or not based on the first sound information; the anti-dazzling screen is arranged on a movable plate of the television camera, and the movable plate is arranged in a shell of the television camera;
and under the condition that the shading sheet is not located at the first preset position, generating a first control instruction and sending the first control instruction to a driving mechanism so that the driving mechanism operates according to the first control instruction and enables a moving plate connected with the driving mechanism to move until the shading sheet arranged on the moving plate is located at the first preset position, and shielding of a lens of the television camera is achieved.
2. The method according to claim 1, wherein the determining whether a shutter of the television camera is located at a first preset position based on the first sound information specifically includes:
determining a user level corresponding to the first sound information;
determining the execution time corresponding to the first sound information according to a preset rule and the user level corresponding to the first sound information;
acquiring the acquisition time of the first sound information, and determining whether the acquisition time of the first sound information is matched with the execution time corresponding to the first sound information;
and under the condition that the acquisition time of the first sound information is matched with the execution time of the first sound information, determining whether a light shielding sheet of the television camera is at a first preset position.
3. The method of claim 1, further comprising:
acquiring second sound information; the second sound information is used for indicating that a lens of the television camera is opened;
determining whether a light shielding sheet of the television camera is located at a second preset position or not based on the second sound information;
and under the condition that the shading sheet is not located at the second preset position, generating a second control instruction and sending the second control instruction to a driving mechanism so that the driving mechanism operates according to the second control instruction and enables a moving plate connected with the driving mechanism to move until the shading sheet is located at the second preset position, and therefore the lens of the television camera is opened.
4. The method of claim 3, further comprising:
determining the number of the first sound information and the second sound information in a corresponding preset time period;
determining whether a light shielding sheet of the television camera is located at a second preset position or not under the condition that the number of the first sound information is larger than a second preset threshold value and/or the number of the second sound information is larger than the second preset threshold value;
under the condition that the shading sheet is not located at the second preset position, generating a second control instruction and sending the second control instruction to a driving mechanism so that the shading sheet is located at the second preset position;
and under the condition that the light shielding sheet is at the second preset position, generating a third control instruction to enable the television camera to shoot to obtain a corresponding image and/or video, and sending the image and/or video to a user terminal to enable the user terminal to display the image and/or video to a user.
5. The method according to claim 1, wherein the acquiring the first sound information specifically includes:
acquiring sound information acquired by audio acquisition equipment, and extracting first sound characteristics corresponding to the sound information;
matching the first sound characteristic with a preset sound characteristic; the preset sound characteristics at least comprise sound frequency and voiceprint information;
and under the condition that the first sound characteristic is successfully matched with the preset sound characteristic, identifying the sound information corresponding to the first sound characteristic to determine the first sound information.
6. The method according to claim 5, wherein the acquiring of the sound information acquired by the audio acquisition device and the extracting of the first sound feature corresponding to the sound information specifically includes:
acquiring the sound frequency of the sound information;
intercepting the sound information to be extracted, of which the sound frequency is in a preset sound frequency interval, in the sound information according to the preset sound frequency interval;
and performing sound feature extraction on the sound information to be extracted, and taking the extracted sound feature as a first sound feature corresponding to the sound information.
7. The method according to claim 5, wherein, when the first sound feature is successfully matched with a preset sound feature, identifying sound information corresponding to the first sound feature to determine the first sound information specifically includes:
converting the sound information corresponding to the first sound characteristic into a digital signal through analog-to-digital conversion equipment, and dividing the digital signal into a plurality of audio blocks;
matching the plurality of audio blocks with keyword sequences in a preset keyword sequence library; the keyword sequence is a plurality of preset digital signal sequences;
and under the condition that the audio blocks of the sound information are successfully matched with the keyword sequences in a preset keyword sequence library, determining the sound information corresponding to the first sound characteristic as the first sound information.
8. The method of claim 5, further comprising:
performing feature extraction on the sound information to determine a first voiceprint feature vector of the sound information;
determining whether a first voiceprint feature vector of the sound information exists in a voiceprint sample library based on a preset voiceprint sample library; the voiceprint sample library is a set with at least one voiceprint characteristic vector which is input in advance;
and under the condition that the first voiceprint feature vector of the voice information exists in the voiceprint sample library, determining that the first voice feature is successfully matched with a preset voice feature.
9. The method of claim 8, further comprising:
under the condition that a first voiceprint feature vector of the voice information does not exist in the voiceprint sample library, recording the acquisition times of the voice information corresponding to the first voiceprint feature vector in a preset time;
generating warning information and sending the warning information to a user terminal under the condition that the acquisition times of the sound information corresponding to the first voiceprint feature vector in a preset time are greater than or equal to a first preset threshold value; wherein, the warning information is character information and/or sound information.
10. A privacy securing apparatus for a television camera, the apparatus comprising:
at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to:
acquiring first sound information; the first sound information is used for indicating that the lens of the television camera is shielded;
determining whether a light shielding sheet of the television camera is located at a first preset position or not based on the first sound information; the anti-dazzling screen is arranged on a movable plate of the television camera, and the movable plate is arranged in a shell of the television camera;
and under the condition that the shading sheet is not located at the first preset position, generating a first control instruction and sending the first control instruction to a driving mechanism so that the driving mechanism operates according to the first control instruction and enables a moving plate connected with the driving mechanism to move until the shading sheet arranged on the moving plate is located at the first preset position, and shielding of a lens of the television camera is achieved.
CN202110246744.1A 2021-03-05 2021-03-05 Privacy security protection method and device for television camera Pending CN112992122A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110246744.1A CN112992122A (en) 2021-03-05 2021-03-05 Privacy security protection method and device for television camera

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110246744.1A CN112992122A (en) 2021-03-05 2021-03-05 Privacy security protection method and device for television camera

Publications (1)

Publication Number Publication Date
CN112992122A true CN112992122A (en) 2021-06-18

Family

ID=76353194

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110246744.1A Pending CN112992122A (en) 2021-03-05 2021-03-05 Privacy security protection method and device for television camera

Country Status (1)

Country Link
CN (1) CN112992122A (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103730120A (en) * 2013-12-27 2014-04-16 深圳市亚略特生物识别科技有限公司 Voice control method and system for electronic device
CN106971718A (en) * 2017-04-06 2017-07-21 绵阳美菱软件技术有限公司 A kind of control method of air-conditioning and air-conditioning
CN208074467U (en) * 2018-03-06 2018-11-09 陆虎保罗(广州)智能科技有限公司 A kind of intelligent sound identification camera
CN108831461A (en) * 2018-06-22 2018-11-16 安徽江淮汽车集团股份有限公司 A kind of skylight sound control method and system
CN108877790A (en) * 2018-05-21 2018-11-23 江西午诺科技有限公司 Speaker control method, device, readable storage medium storing program for executing and mobile terminal
WO2019201304A1 (en) * 2018-04-20 2019-10-24 比亚迪股份有限公司 Face recognition-based voice processing method, and device
CN111145759A (en) * 2019-12-27 2020-05-12 周洋 Voice alarm system based on voiceprint feature recognition for trip industry
CN111243142A (en) * 2020-02-04 2020-06-05 浙江大华技术股份有限公司 Failure processing method and device for intelligent door lock and storage medium
US20200311290A1 (en) * 2019-03-29 2020-10-01 Lenovo (Singapore) Pte. Ltd. Apparatus, method, and program product for operating a display in privacy mode

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103730120A (en) * 2013-12-27 2014-04-16 深圳市亚略特生物识别科技有限公司 Voice control method and system for electronic device
CN106971718A (en) * 2017-04-06 2017-07-21 绵阳美菱软件技术有限公司 A kind of control method of air-conditioning and air-conditioning
CN208074467U (en) * 2018-03-06 2018-11-09 陆虎保罗(广州)智能科技有限公司 A kind of intelligent sound identification camera
WO2019201304A1 (en) * 2018-04-20 2019-10-24 比亚迪股份有限公司 Face recognition-based voice processing method, and device
CN108877790A (en) * 2018-05-21 2018-11-23 江西午诺科技有限公司 Speaker control method, device, readable storage medium storing program for executing and mobile terminal
CN108831461A (en) * 2018-06-22 2018-11-16 安徽江淮汽车集团股份有限公司 A kind of skylight sound control method and system
US20200311290A1 (en) * 2019-03-29 2020-10-01 Lenovo (Singapore) Pte. Ltd. Apparatus, method, and program product for operating a display in privacy mode
CN111145759A (en) * 2019-12-27 2020-05-12 周洋 Voice alarm system based on voiceprint feature recognition for trip industry
CN111243142A (en) * 2020-02-04 2020-06-05 浙江大华技术股份有限公司 Failure processing method and device for intelligent door lock and storage medium

Similar Documents

Publication Publication Date Title
EP3418881B1 (en) Information processing device, information processing method, and program
JP6635049B2 (en) Information processing apparatus, information processing method and program
KR101749100B1 (en) System and method for integrating gesture and sound for controlling device
US10834456B2 (en) Intelligent masking of non-verbal cues during a video communication
CN102522102A (en) Intelligent determination of replays based on event identification
CN207399418U (en) A kind of TV based on recognition of face
CN101925915A (en) Device access control
CN108962220A (en) Multimedia file plays the text display method and device under scene
CN109032345B (en) Equipment control method, device, equipment, server and storage medium
CN109240639A (en) Acquisition methods, device, storage medium and the terminal of audio data
CN108538284A (en) Simultaneous interpretation result shows method and device, simultaneous interpreting method and device
US9576587B2 (en) Example-based cross-modal denoising
CN106454462B (en) The viewing authority control method and device of smart television
CN112992122A (en) Privacy security protection method and device for television camera
Libal et al. Multimodal classification of activities of daily living inside smart homes
WO2019150708A1 (en) Information processing device, information processing system, information processing method, and program
US10838741B2 (en) Information processing device, information processing method, and program
US11315544B2 (en) Cognitive modification of verbal communications from an interactive computing device
JP2018180472A (en) Control device, control method, and control program
CN112420046A (en) Multi-person conference method, system and device suitable for hearing-impaired people to participate
US8203593B2 (en) Audio visual tracking with established environmental regions
US20210166688A1 (en) Device and method for performing environmental analysis, and voice-assistance device and method implementing same
CN111901552B (en) Multimedia data transmission method and device and electronic equipment
US20220217442A1 (en) Method and device to generate suggested actions based on passive audio
CN111639635B (en) Processing method and device for shooting pictures, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20210618

RJ01 Rejection of invention patent application after publication