WO2023124840A1 - Video content display method and apparatus, and electronic device and storage medium - Google Patents

Video content display method and apparatus, and electronic device and storage medium Download PDF

Info

Publication number
WO2023124840A1
WO2023124840A1 PCT/CN2022/137028 CN2022137028W WO2023124840A1 WO 2023124840 A1 WO2023124840 A1 WO 2023124840A1 CN 2022137028 W CN2022137028 W CN 2022137028W WO 2023124840 A1 WO2023124840 A1 WO 2023124840A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
video content
identity
video
content
Prior art date
Application number
PCT/CN2022/137028
Other languages
French (fr)
Chinese (zh)
Inventor
朱红军
官丹
赵志东
梅君君
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2023124840A1 publication Critical patent/WO2023124840A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/441Acquiring end-user identification, e.g. using personal code sent by the remote control or by inserting a card
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/441Acquiring end-user identification, e.g. using personal code sent by the remote control or by inserting a card
    • H04N21/4415Acquiring end-user identification, e.g. using personal code sent by the remote control or by inserting a card using biometric characteristics of the user, e.g. by voice recognition or fingerprint scanning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/454Content or additional data filtering, e.g. blocking advertisements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/454Content or additional data filtering, e.g. blocking advertisements
    • H04N21/4542Blocking scenes or portions of the received content, e.g. censoring scenes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/475End-user interface for inputting end-user data, e.g. personal identification number [PIN], preference data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/475End-user interface for inputting end-user data, e.g. personal identification number [PIN], preference data
    • H04N21/4756End-user interface for inputting end-user data, e.g. personal identification number [PIN], preference data for rating content, e.g. scoring a recommended movie

Definitions

  • the embodiments of the present application relate to the field of communication technologies, and in particular, to a video content display method, device, electronic device, and storage medium.
  • the data transmission rate is greatly improved, and the data transmission delay is further reduced, which can flexibly support various devices for communication and interaction.
  • 5G networks can also support the access of wearable devices.
  • the 5G network will improve the end-to-end experience and performance, and people's requirements for audio and video communication are getting higher and higher.
  • access terminals are becoming more and more extensive, including mobile phones, tablets, TVs, wearable devices, etc.
  • Video conferencing application scenarios are not only multi-party video conferencing, but also video content sharing scenarios involving multiple parties such as live video conferences, distance education interactive video conferencing, telemedicine interactive video conferencing, and XR video conferencing.
  • the existing video conferencing application system after the terminal accesses, the current video content is directly shared by the users accessing through the terminal equipment.
  • users who use terminal devices to access are children, teenagers or the elderly, etc., after joining the application scene of the video conference, they may be affected by unhealthy content in the application scene of the video conference, and may even be seriously injured physically and mentally.
  • the main purpose of the embodiments of the present application is to propose a video content display method, device, electronic device, and storage medium, aiming at detecting and shielding illegal content of the video content, avoiding unhealthy video content due to receiving unhealthy video content in the video conference scene. cause physical and mental harm to users.
  • an embodiment of the present application provides a method for displaying video content, including: receiving a video content acquisition request from a user, and obtaining the identity of the user; According to the corresponding relationship, the target research and judgment rules corresponding to the user’s identity are obtained; according to the target research and judgment rules, the video content requested by the user is detected for illegal content, and the video content is blocked for illegal content according to the detection results, and sent to The user displays the video content after blocking the illegal content.
  • an embodiment of the present application also provides a video content display device, including: a receiving module, configured to receive a user's video content acquisition request, and acquire the identity of the user; an acquisition module, configured to The identity of the user and the preset corresponding relationship between the user identity and the research and judgment rules, and the target research and judgment rules corresponding to the identity of the user are obtained; the processing module is used to detect the illegal content of the video content requested by the user according to the target research and judgment rules , blocking the illegal content of the video content according to the detection result, and displaying the video content after the blocking of the illegal content to the user.
  • an embodiment of the present application further provides an electronic device, the device includes: at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores information that can be Instructions executed by the at least one processor, the instructions are executed by the at least one processor, so that the at least one processor can execute the video content display method as described above.
  • the embodiment of the present application also proposes a computer-readable storage medium storing a computer program, and when the computer program is executed by a processor, the above-mentioned method for displaying video content is realized.
  • Fig. 1 is the flow chart of the method for displaying video content in the embodiment of the present application
  • FIG. 2 is a schematic structural diagram of a video conferencing system in an embodiment of the present application
  • FIG. 3 is a flow chart of a non-5G terminal accessing a video conference scenario in an embodiment of the present application
  • FIG. 4 is a flow chart of a 5G terminal accessing a video conference scenario in an embodiment of the present application
  • FIG. 5 is a flow chart of a method for detecting and processing illegal content in an embodiment of the present application
  • FIG. 6 is a schematic structural diagram of a video content display device in another embodiment of the present application.
  • Fig. 7 is a schematic structural diagram of an electronic device in another embodiment of the present application.
  • the video content display method in the current video conferencing scene directly displays the video content for the user after the user accesses it through the terminal device.
  • Unhealthy video content may cause adverse physical and mental health effects on groups such as teenagers, children, and the elderly. Therefore, it is an urgent technical issue how to prevent video content from affecting or even harming the physical, mental and growth of vulnerable groups in the video conferencing scenario.
  • an embodiment of the present application provides a method for displaying video content, including: receiving a user's video content acquisition request, and obtaining the user's identity; The target research and judgment rules corresponding to the user's identity; according to the target research and judgment rules, the video content requested by the user is detected for illegal content, and the video content is blocked according to the detection results, and the video content after the illegal content is blocked is displayed to the user.
  • the video conferencing application system when the video conferencing application system receives a video content acquisition request from a user, it identifies the identity of the user requesting to acquire the video content, and acquires the user's identity. Then, according to the acquired identity of the user, the corresponding target research and judgment rules are obtained, and the illegal content of the video content is detected and blocked according to the target research and judgment rules, and then the video content after the illegal content is blocked is displayed to the user.
  • the video content in the video conference scene can be displayed before displaying
  • the video content that users watch is as positive as possible, and the unhealthy content in the video content is prevented from affecting or even harming the user's body and mind, so as to protect the user's physical and mental health and user experience.
  • the first aspect of the embodiment of the present application provides a method for displaying video content.
  • the method for displaying video content is applied to a terminal deployed in a video conferencing system. It is any electronic device with communication and processing functions, such as mobile phones, computers, servers and other electronic devices.
  • the application in the server is used as an example for illustration.
  • the video content display method includes at least but not limited to the following steps:
  • Step 101 receiving a user's video content acquisition request, and acquiring the user's identity.
  • the server after receiving the user's request to access the current video conference scene and obtain the video content, the server prompts the user to input identity verification information, and identifies the user's identity according to the identity verification information input by the user.
  • identity verification information may include information such as age and gender.
  • the server before acquiring the user's identity, the server further includes: acquiring the user's terminal type; acquiring the target identification method according to the terminal type; acquiring the user's identity includes: acquiring the user's identity according to the target identification method.
  • the structural diagram of the video conferencing system can refer to Figure 2, including the access terminal and server used by the user.
  • the server can be composed of multiple different sub-servers, for example, the video conferencing server and rule server, terminal identification server and judgment server deployed on the edge cloud. After the access terminal used by the user is started, the start message is sent to the terminal identification server, the terminal identification server is started, and then when the access terminal initiates a video content acquisition request to the video conference server, the terminal identification server checks the information of the access terminal used by the user.
  • the terminal type is identified to obtain the terminal type of the user terminal.
  • the terminal identification server After determining the terminal type of the user terminal, the terminal identification server requests the corresponding target identification method from the rule server according to the terminal type, and sends an identification instruction to the user according to the target identification method fed back by the rule server, and then according to the identity verification input by the user
  • the information identifies the user.
  • the server obtains the target identification method according to the terminal type, including: if the terminal type is a non-5G terminal, obtaining the user's identity according to the target identification method includes one or any combination of the following: according to the user's login information Identify the user's identity, identify the user's identity based on the user's voice, and identify the user's identity based on the user's video image; when the terminal type is a 5G terminal, obtaining the user's identity according to the target identification method includes: identifying based on the user's fingerprint information The user's identity, and/or identify the user's identity based on the user's iris information.
  • the terminal identification server acquires a corresponding target identification method according to the user's terminal type, and acquires the user's identity according to the target identification method.
  • the terminal identification server may include multiple components, for example, a terminal type identification control component, an audio identification component, a video identification component, and the like.
  • the terminal recognition server requests the rule server for an audio recognition algorithm, a video recognition algorithm or a configuration information recognition algorithm.
  • the recognition algorithm contained in the response returned by the received rule server according to the login information when the user logs in, and the real-time audio signal or video signal that the user is requested to input, identify the user's identity and determine the user's belonging group.
  • the terminal identification server requests the fingerprint identification algorithm or iris identification algorithm from the rule server, and according to the identification algorithm contained in the response returned by the received rule server, according to the real-time collected user fingerprint information or iris information to identify the user.
  • different identification methods are used to identify the user identity. On the one hand, it ensures that the existing terminal can also be used as an access terminal to protect the investment of operators and consumers. On the other hand, it can also expand the accessible terminals. types, ensuring the wide application of the video content display method.
  • the user's identity is mainly identified through biometrics.
  • the biometrics can be iris, fingerprints, or other biometrics that can identify the user's identity. information, this embodiment does not limit the specific biometric information used.
  • non-5G terminals can include traditional terminals and smart terminals.
  • Traditional terminals mainly refer to hard terminal devices that are traditionally connected to video conferencing scenarios. They are proprietary video conferencing devices that are used by users in industries such as government and enterprises. And it will continue to be used as a proprietary asset of video conferencing; smart terminals mainly refer to smart TVs, smart boxes, smart phones and other devices, which can independently install smart applications and serve as video conferencing clients to access video conferencing application scenarios.
  • 5G terminals mainly refer to high-end smart watches, AR, VR, MR equipment, etc. This is only for ease of understanding and distinction, rather than limiting the network that the terminal can adopt.
  • identifying a user's identity it may be identified only by one algorithm, or a combination of multiple algorithms may be used for identification, which is not limited in this embodiment.
  • Step 301 after the non-5G terminal starts, it sends a start message to the terminal identification server, and starts the terminal identification server.
  • step 302 the terminal identification server determines the target identification method and acquires the corresponding identification rules, and issues an identification instruction.
  • the terminal identification server requests identification rules from the rule server, and synchronizes the corresponding configuration information identification rules, audio identification rules and video identification rules in the rule server to the cache according to the response of the rule server, and the audio identification module and video identification module will The latest rules are used for end-user identification. Select the target identification method for identifying the identity of the user corresponding to the accessed non-5G terminal, and issue an identification instruction to the access terminal according to the selected identification method.
  • Step 303 the non-5G terminal uploads user identity information according to the identification instruction.
  • the non-5G terminal determines whether to identify directly through the configuration information, or through real-time audio recognition or real-time video recognition to identify the belonging group of the user corresponding to the access terminal, prompts the user to input identity information, and sends the user identity
  • the information is uploaded to the terminal identification server.
  • the user's belonging group includes children, teenagers, old people, etc., and can be further divided into boys, girls, teenagers (male), teenagers (female), old people (male), old people (female) and so on.
  • Step 304 the terminal identification server identifies the user identity according to the received user identity information, and completes the access of the non-5G terminal.
  • the terminal identification server includes multiple modules, such as an identification control module, an audio identification module, and a video identification module.
  • the terminal identification server identifies the user identity according to the login information and the obtained configuration identification rules, and sends the identification result to the video conference server, and the video conference server saves the identification result to the terminal information on the server for Subsequent video content identification is used.
  • the non-5G terminal sends an identification request message to the identification control module of the terminal identification server, and the request message carries terminal audio channel information.
  • the identification control module establishes an audio identification channel on the audio identification module.
  • the non-5G terminal sends the received user identification audio to the audio identification module through the specified audio identification channel, and the audio identification module performs audio identification through NLP intelligent learning voiceprint technology, and sends the identification result to the identification control module.
  • the recognition control module sends the recognition result to the non-5G terminal and the video conferencing server, and the video conferencing server saves the recognition result to the terminal information on the server for subsequent video content identification.
  • the non-5G terminal sends a video identification request message to the identification control module of the terminal identification server, and the request message carries terminal video channel information.
  • the recognition control module establishes a video recognition channel on the video recognition module. Then the non-5G terminal sends the received user identification video to the video identification module through the specified video identification channel, and the video identification module performs video identification on the video according to the video identification rules, and sends the identification result to the identification control module.
  • the terminal identification server sends the identification result to the non-5G terminal and the video conference server, and the video conference server saves the identification result to the terminal information on the server for subsequent video content identification.
  • voiceprint features are mainly determined by timbre, the gender and age group of users can be distinguished through voiceprint features; when user identification is performed based on user video images, faces can be directly recognized, and facial feature codes can be extracted to identify users It can also identify the user's age and gender directly through the face image in the registration information database. This embodiment does not limit the specific identification information and identification methods used.
  • Step 401 after the 5G terminal is started, it sends a start message to the terminal identification server, and starts the terminal identification server.
  • step 402 the terminal identification server determines the target identification method and obtains the corresponding identification rules, and issues an identification instruction.
  • the terminal identification server includes a plurality of modules, such as an identification control module, a biometric identification module, etc.
  • the terminal identification server requests identification rules from the rule server, and according to the response from the rule server, the corresponding biometric identification rules and The algorithm is synchronized to the cache, and then the biometric identification module is called to use the latest biometric identification rules for end user identification.
  • the biometric feature recognition rule here may be an iris feature recognition rule and a regularization algorithm, a fingerprint feature recognition rule and a regularization algorithm, or other biometric feature recognition rules and a regularization algorithm.
  • Step 403 the 5G terminal uploads user identity information according to the identification instruction.
  • the 5G terminal determines the user's belonging group through fingerprint identification or iris identification according to the received identification instruction, prompts the user to enter the corresponding biometric information, and uploads the user identity information to the terminal identification server.
  • Step 404 the terminal identification server identifies the user identity according to the received user identity information, and completes the access of the 5G terminal.
  • the 5G terminal After the 5G terminal judges that the user's belonging group is determined through biometrics, it sends a biometrics request message to the identification control module of the terminal identification server, and the request message carries terminal biometrics identification channel information.
  • the identification control module establishes a biometric identification channel on the biometric identification module. Then the 5G terminal sends the received user biometric information to the biometric identification module in the terminal identification server through the biometric identification transmission channel.
  • the biometric identification module processes and regulates the initial biometric information through biometric identification rules and algorithms. Extract the biometric code. Then send the biometric code as a unique identification code to the application server, and the application server uses the code to retrieve and confirm the user's identity. Then the terminal recognition server sends the recognition result to the application server, 5G terminal and video conferencing server.
  • the application server and the video conferencing server save the identification result in the terminal information on the server for subsequent video content identification.
  • the application server searches based on the feature code, if it detects that the user has completed biometric identification, and the generation date of the previous identification result meets the identification requirements, it will directly use the previous biometric identification result as The result of this identification is used to complete this biometric identification. If it is detected that the user has not completed biometric identification, or does not match the required identification results, the biometric identification module will send the biometric identification data to a third-party identity authentication server for a new identity authentication. The third-party identity authentication server returns the authentication result and personal information to the terminal identification server.
  • Step 102 according to the user's identity and the preset corresponding relationship between the user's identity and the research and judgment rules, obtain the target research and judgment rules corresponding to the user's identity.
  • the server first judges whether the video content sent to the user needs to be detected and processed according to the identity of the user, and if the video content needs to be detected and processed, According to the preset corresponding relationship between the user identity and the judgment rule stored in advance, the target research and judgment rule corresponding to the user's identity is obtained.
  • the target research and judgment rules determined by the server include: audio research and judgment rules, video research and judgment rules, or multidimensional research and judgment rules.
  • audio research and judgment rules By performing audio, video or multi-dimensional illegal content detection on video content, while ensuring the accuracy of detection, it can also ensure the timeliness of detection as much as possible, and avoid excessive delays caused by healthy processing of video content.
  • Step 103 Perform illegal content detection on the video content requested by the user according to the target research and judgment rules, block the illegal content of the video content according to the detection result, and display the blocked video content to the user.
  • the server determines the target research and judgment rules, it detects the illegal content of the video content requested by the user according to the target research and judgment rules, determines whether there is any content in the video content requested by the user that does not meet the target research and judgment rules, and based on the detection results, the video The content in the content that does not meet the target research and judgment rules will be blocked, and then the blocked video content will be displayed to the user.
  • the server determines the target research and judgment rules, it detects the illegal content of the video content requested by the user according to the target research and judgment rules, determines whether there is any content in the video content requested by the user that does not meet the target research and judgment rules, and based on the detection results, the video The content in the content that does not meet the target research and judgment rules will be blocked, and then the blocked video content will be displayed to the user.
  • the server determines the target research and judgment rules, it detects the illegal content of the video content requested by the user according to the target research and judgment rules, determines whether there is any content in the video content requested
  • the server detects illegal content on the video content according to the target research and judgment rules, including one or any combination of the following: sensitive word detection on the video content, illegal action detection on the video content, and illegal screen detection on the video content.
  • sensitive word detection on the video content e.g., a word detection on the video content
  • illegal action detection on the video content e.g., a motion detection on the video content
  • illegal screen detection on the video content e.g., video content detection of the video content.
  • the server screens the illegal content of the video content according to the detection results, including one or any combination of the following: muffle, delete or replace sensitive words, code, delete or replace illegal actions, and Code, delete or replace. Effectively deal with illegal content by blocking, deleting or replacing the illegal content.
  • Step 501 after the user terminal accesses, start the research and judgment server, and synchronize the target research and judgment rules to the research and judgment server.
  • the video conferencing server instructs the judgment server to start, and then the judgment server will initiate a judgment rule request to the rule server, and according to the response of the rule server, the target audio judgment rule, target video
  • the rules or target multi-dimensional research and judgment rules are synchronized to the cache.
  • the audio research and judgment module and the video research and judgment module will use the latest rules to judge the video content requested by the user.
  • Step 502 creating an audio and video media communication port for detecting and processing violating video content.
  • the research and judgment server includes a research and judgment control module, a video research and judgment module, and an audio judgment module.
  • the first communication port of audio and video media allocated by the video research and judgment module is sent to the user terminal.
  • the user terminal carries the audio and video media communication port information allocated by the audio and video research and judgment modules of the research and judgment server, and initiates a join request to the video conference server.
  • the video conference server returns the second communication port information of the audio and video media allocated by the video conference server to the user terminal.
  • the access terminal sends the audio and video judgment request message to the judgment control module of the judgment server, and the request information carries the audio and video channel information of the terminal and the second communication port information of the audio and video media on the server side returned by the video conference server, so as to facilitate the establishment of the whole process of the judgment server Audio and video media communication process.
  • Step 503 the research and judgment server acquires video content through the audio and video media communication port, and detects and processes illegal content of the video content.
  • the research and judgment server establishes an audio research and judgment channel on the audio research and judgment module, and receives the audio media data sent by the access terminal through the designated audio research and judgment channel. word detection), and process the audio media based on the detection results. Audio media processing can be recording, alerting, prohibiting sending, etc.
  • the research and judgment server establishes a video research and judgment channel on the video research and judgment module. Receive the video media data sent by the access terminal through the designated video judgment channel, and the video judgment module conducts video judgment on the video media according to the video judgment rules, mainly to detect illegal pictures and illegal actions in the video, and process according to the detection results video media. Video media processing can be occlusion, deletion, replacement, etc.
  • the judgment server establishes an audio communication channel with the video conference server according to the audio communication port information returned by the video conference server.
  • the audio media processed by the audio research and judgment rules is sent to the video conferencing server through the audio communication channel established on the server side, and the video conferencing server performs the next step of processing;
  • Audio research and judgment rules use NLP intelligent learning voiceprint technology to conduct audio research and judgment, and send the downlink audio media after research and judgment to the access terminal for display.
  • the judgment server establishes a video communication channel with the video conference server according to the video communication port information returned by the video conference server.
  • the multi-dimensional video media processed by the multi-dimensional video research and judgment rules is sent to the video conferencing server through the video communication channel established on the server side, and the video conferencing server performs the next step of processing; on the other hand, the downlink video media of the video conferencing server According to the multi-dimensional video research and judgment rules, the video media is judged, mainly the pictures and actions in the video are judged, and the judged downlink multi-dimensional video media is sent to the access terminal for display.
  • video media can be two-dimensional video media or multi-dimensional video media.
  • Multi-dimensional video mainly refers to 3D video media and video media transmitted in more latitudes.
  • the normal video media is 2D video media.
  • joint detection, multi-dimensional recognition and multi-dimensional reconstruction technologies are mainly used for multi-dimensional research and judgment.
  • the server after the server shows the user the video content blocked by the illegal content, it also includes: periodically obtaining the user's identity, and determining the target research and judgment rules based on the currently obtained user identity; when the target research and judgment rules occur In the case of a change, the video content will be detected for illegal content according to the changed target research and judgment rules.
  • the server After displaying the video content to the user, the server periodically re-identifies the user's identity according to the preset interval, and re-determines the target research and judgment rules based on the currently obtained user's identity, and then re-identifies the user based on the currently obtained user identity. The consistency check between the target research and judgment rules determined by the identity of the user and the previously obtained target research and judgment rules is carried out.
  • the target research and judgment rules re-determined based on the currently obtained user identity will be used as the research and judgment rules used in the video content violation detection process, and the subsequent display will be based on the changed target research and judgment rules.
  • the server also dynamically updates the research and judgment rules in the preset correspondence according to the content type of the video content. Specifically, after the server detects and processes the illegal content of the video content, and detects that the content type of the video content in the video conferencing scene is updated, it uses deep learning or a preset neural network model to generate a corresponding video based on the newly added video content.
  • the hot patch of the research and judgment rules and then dynamically maintain the existing research and judgment rules through the generated hot patch of the research and judgment rules, and update the preset correspondence between the research and judgment rules and the terminal.
  • the judgment rule is dynamically maintained in the form of a hot patch to ensure the real-time and correctness of the judgment rule.
  • the illegal content detection of video content can be carried out through the research and judgment server deployed on the cloud, or directly through the research and judgment module deployed on the smart terminal according to the obtained target research and judgment rules.
  • Deploying in the cloud can determine whether traditional access terminals are It can realize the health of video content and optimize global processing resources; deployment on smart terminals can more effectively ensure the purity of uplink and downlink streams.
  • the architecture and deployment methods of the video conferencing system can be adjusted as needed. This embodiment does not limit specific deployment.
  • FIG. 6 Another aspect of the embodiment of the present application also provides a video content display device, referring to Figure 6, including:
  • the receiving module 601 is configured to receive a user's video content acquisition request and acquire the user's identity.
  • the obtaining module 602 is used to obtain the target research and judgment rules corresponding to the user's identity according to the user's identity and the preset corresponding relationship between the user's identity and the research and judgment rules; wherein, the research and judgment rules are used to detect whether there is illegal content in the video content.
  • the processing module 603 is configured to detect the illegal content of the video content requested by the user according to the target research and judgment rules, block the illegal content of the video content according to the detection result, and display the blocked video content to the user.
  • this embodiment is an apparatus embodiment corresponding to the method embodiment, and this embodiment can be implemented in cooperation with the method embodiment.
  • the relevant technical details mentioned in the method embodiments are still valid in this embodiment, and will not be repeated here in order to reduce repetition.
  • the related technical details mentioned in this embodiment can also be applied in the method embodiment.
  • modules involved in this embodiment are logical modules.
  • a logical unit can be a physical unit, or a part of a physical unit, or multiple physical units. Combination of units.
  • units that are not closely related to solving the technical problem proposed in the present application are not introduced in this embodiment, but this does not mean that there are no other units in this embodiment.
  • FIG. 7 Another aspect of the embodiment of the present application also provides an electronic device, referring to FIG. 7 , including: including at least one processor 701; Instructions executed by at least one processor 701, the instructions are executed by at least one processor 701, so that at least one processor 701 can execute the video content display method described in any one of the above method embodiments.
  • the memory 702 and the processor 701 are connected by a bus, and the bus may include any number of interconnected buses and bridges, and the bus connects one or more processors 701 and various circuits of the memory 702 together.
  • the bus may also connect together various other circuits such as peripherals, voltage regulators, and power management circuits, all of which are well known in the art and therefore will not be further described herein.
  • the bus interface provides an interface between the bus and the transceivers.
  • a transceiver may be a single element or multiple elements, such as multiple receivers and transmitters, providing means for communicating with various other devices over a transmission medium.
  • the data processed by the processor 701 is transmitted on the wireless medium through the antenna, and further, the antenna also receives the data and transmits the data to the processor 701 .
  • the processor 701 is responsible for managing the bus and general processing, and may also provide various functions including timing, peripheral interface, voltage regulation, power management and other control functions. And the memory 702 may be used to store data used by the processor 701 when performing operations.
  • Embodiments of the present application also provide a computer-readable storage medium storing a computer program.
  • the above method embodiments are implemented when the computer program is executed by the processor.
  • a storage medium includes several instructions to make a device ( It may be a single-chip microcomputer, a chip, etc.) or a processor (processor) to execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disc, etc., which can store program codes. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Databases & Information Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The present application relates to the technical field of communications. Disclosed are a video content display method and apparatus, and an electronic device and a storage medium. The method comprises: receiving a video content acquisition request of a user, and acquiring the identity of the user; according to the identity of the user and a preset correspondence between the identity of the user and research and judgment rules, acquiring a target research and judgment rule corresponding to the identity of the user; and according to the target research and judgment rule, performing violation content detection on video content which is requested by the user, performing violation content shielding on the video content according to a detection result, and displaying, to the user, the video content after having been subjected to violation content shielding.

Description

视频内容展示方法、装置、电子设备和存储介质Video content display method, device, electronic device and storage medium
相关申请related application
本申请要求于2021年12月27日申请的、申请号为202111619546.1的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to a Chinese patent application with application number 202111619546.1 filed on December 27, 2021, the entire contents of which are incorporated herein by reference.
技术领域technical field
本申请实施例涉及通信技术领域,特别涉及一种视频内容展示方法、装置、电子设备和存储介质。The embodiments of the present application relate to the field of communication technologies, and in particular, to a video content display method, device, electronic device, and storage medium.
背景技术Background technique
在5G时代数据的传输速率大大提高,数据传输延时进一步降低,可以灵活的支持各种不同的设备进行通信交互。例如,除手机和平板电脑外,5G网络还可以支持可佩戴式设备的接入。5G网络将会改善端到端的体验方式和性能,而人们在音视频通讯方面的要求也越来越高,一方面是因为接入终端越来越广泛,包括手机、平板、电视、穿戴设备等,甚至现实增强设备(Augmented Reality,AR)、虚拟现实设备(Augmented Reality,VR)、混合现实设备(Mixed Reality,MR)也会逐渐接入进来;另一方面是因为视频会议的应用场景也会越来越广泛,加入的成员也会从单一的人群,逐渐扩展到所有年龄段的人群。视频会议应用场景不仅仅是多方视频会议,还包括:视频直播会议、远程教育互动视频会议、远程医疗互动视频会议、XR视频会议等多方参与的视频内容共享场景。In the 5G era, the data transmission rate is greatly improved, and the data transmission delay is further reduced, which can flexibly support various devices for communication and interaction. For example, in addition to mobile phones and tablets, 5G networks can also support the access of wearable devices. The 5G network will improve the end-to-end experience and performance, and people's requirements for audio and video communication are getting higher and higher. On the one hand, because access terminals are becoming more and more extensive, including mobile phones, tablets, TVs, wearable devices, etc. , and even augmented reality (Augmented Reality, AR), virtual reality (Augmented Reality, VR), and mixed reality (Mixed Reality, MR) devices will gradually come in; on the other hand, because the application scenarios of video conferencing will also As it becomes wider and wider, the members who join will gradually expand from a single group to people of all ages. Video conferencing application scenarios are not only multi-party video conferencing, but also video content sharing scenarios involving multiple parties such as live video conferences, distance education interactive video conferencing, telemedicine interactive video conferencing, and XR video conferencing.
现有的视频会议应用系统,在终端接入后为直接为通过终端设备接入的用户共享当前的视频内容。当使用终端设备接入的用户是儿童、青少年或老人等,加入到视频会议的应用场景后,可能会受到视频会议应用场景中的不健康的内容的影响,甚至有可能身心受到严重的伤害。In the existing video conferencing application system, after the terminal accesses, the current video content is directly shared by the users accessing through the terminal equipment. When users who use terminal devices to access are children, teenagers or the elderly, etc., after joining the application scene of the video conference, they may be affected by unhealthy content in the application scene of the video conference, and may even be seriously injured physically and mentally.
发明内容Contents of the invention
本申请实施例的主要目的在于提出一种视频内容展示方法、装置、电子设备和存储介质,旨在通过对视频内容进行违规内容检测和屏蔽,避免由于在视频会议场景下接收到不健康的视频内容导致用户身心受到伤害。The main purpose of the embodiments of the present application is to propose a video content display method, device, electronic device, and storage medium, aiming at detecting and shielding illegal content of the video content, avoiding unhealthy video content due to receiving unhealthy video content in the video conference scene. cause physical and mental harm to users.
为实现上述目的,本申请实施例提供了一种视频内容展示方法,包括:接收用户的视频内容获取请求,获取所述用户的身份;根据所述用户的身份及用户身份与研判规则的预设对应关系,获取与所述用户的身份对应的目标研判规则;根据所述目标研判规则对所述用户请求的视频内容进行违规内容检测,根据检测结果对所述视频内容进行违规内容屏蔽,并向所述用户展示违规内容屏蔽后的所述视频内容。In order to achieve the above purpose, an embodiment of the present application provides a method for displaying video content, including: receiving a video content acquisition request from a user, and obtaining the identity of the user; According to the corresponding relationship, the target research and judgment rules corresponding to the user’s identity are obtained; according to the target research and judgment rules, the video content requested by the user is detected for illegal content, and the video content is blocked for illegal content according to the detection results, and sent to The user displays the video content after blocking the illegal content.
为实现上述目的,本申请实施例还提供了一种视频内容展示装置,包括:接收模块,用于接收用户的视频内容获取请求,获取所述用户的身份;获取模块,用于根据所述用户的身份及用户身份与研判规则的预设对应关系,获取与所述用户的身份对应的目标研判规则;处理模块,用于根据所述目标研判规则对所述用户请求的视频内容进行违规内容检测,根据检 测结果对所述视频内容进行违规内容屏蔽,并向所述用户展示违规内容屏蔽后的所述视频内容。In order to achieve the above purpose, an embodiment of the present application also provides a video content display device, including: a receiving module, configured to receive a user's video content acquisition request, and acquire the identity of the user; an acquisition module, configured to The identity of the user and the preset corresponding relationship between the user identity and the research and judgment rules, and the target research and judgment rules corresponding to the identity of the user are obtained; the processing module is used to detect the illegal content of the video content requested by the user according to the target research and judgment rules , blocking the illegal content of the video content according to the detection result, and displaying the video content after the blocking of the illegal content to the user.
为实现上述目的,本申请实施例还提供了一种电子设备,所述设备包括:至少一个处理器;以及,与所述至少一个处理器通信连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如上所述的视频内容展示方法。To achieve the above purpose, an embodiment of the present application further provides an electronic device, the device includes: at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores information that can be Instructions executed by the at least one processor, the instructions are executed by the at least one processor, so that the at least one processor can execute the video content display method as described above.
为实现上述目的,本申请实施例还提出了计算机可读存储介质,存储有计算机程序,所述计算机程序被处理器执行时实现如上所述的视频内容展示方法。In order to achieve the above purpose, the embodiment of the present application also proposes a computer-readable storage medium storing a computer program, and when the computer program is executed by a processor, the above-mentioned method for displaying video content is realized.
附图说明Description of drawings
一个或多个实施例通过与之对应的附图中的图片进行示例性说明,这些示例性说明并不构成对实施例的限定。One or more embodiments are exemplified by pictures in the accompanying drawings, and these exemplifications are not intended to limit the embodiments.
图1是本申请实施例中的视频内容展示方法流程图;Fig. 1 is the flow chart of the method for displaying video content in the embodiment of the present application;
图2是本申请实施例中的视频会议系统的结构示意图;FIG. 2 is a schematic structural diagram of a video conferencing system in an embodiment of the present application;
图3是本申请实施例中的非5G终端接入视频会议场景的流程图;FIG. 3 is a flow chart of a non-5G terminal accessing a video conference scenario in an embodiment of the present application;
图4是本申请实施例中5G终端接入视频会议场景的流程图;FIG. 4 is a flow chart of a 5G terminal accessing a video conference scenario in an embodiment of the present application;
图5是本申请实施例中的违规内容检测处理方法流程图;FIG. 5 is a flow chart of a method for detecting and processing illegal content in an embodiment of the present application;
图6是本申请另一实施例中的视频内容展示装置的结构示意图;FIG. 6 is a schematic structural diagram of a video content display device in another embodiment of the present application;
图7是本申请另一实施例中的电子设备的结构示意图。Fig. 7 is a schematic structural diagram of an electronic device in another embodiment of the present application.
具体实施方式Detailed ways
由背景技术可知,当前视频会议场景下的视频内容展示方法,在用户通过终端设备接入后,直接为用户展示视频内容,不健康的视频内容可能会对青少年、儿童和老人等群体造成不良的身心影响,因此,如何避免视频会议场景下视频内容对弱势群体的身心和成长造成影响甚至是伤害是一个迫切的技术问题。It can be seen from the background technology that the video content display method in the current video conferencing scene directly displays the video content for the user after the user accesses it through the terminal device. Unhealthy video content may cause adverse physical and mental health effects on groups such as teenagers, children, and the elderly. Therefore, it is an urgent technical issue how to prevent video content from affecting or even harming the physical, mental and growth of vulnerable groups in the video conferencing scenario.
为了解决上述问题,本申请实施例提供了一种视频内容展示方法,包括:接收用户的视频内容获取请求,获取用户的身份;根据用户的身份及用户身份与研判规则的预设对应关系,获取与用户的身份对应的目标研判规则;根据目标研判规则对用户请求的视频内容进行违规内容检测,根据检测结果对视频内容进行违规内容屏蔽,并向用户展示违规内容屏蔽后的视频内容。In order to solve the above problems, an embodiment of the present application provides a method for displaying video content, including: receiving a user's video content acquisition request, and obtaining the user's identity; The target research and judgment rules corresponding to the user's identity; according to the target research and judgment rules, the video content requested by the user is detected for illegal content, and the video content is blocked according to the detection results, and the video content after the illegal content is blocked is displayed to the user.
本申请实施例提供的视频内容展示方法,视频会议应用系统在接收到用户的视频内容获取请求时,对请求获取视频内容的用户的身份进行识别,获取用户的身份。然后根据获取到的用户的身份获取与其对应的目标研判规则,并根据目标研判规则对视频内容进行违规内容检测和屏蔽,然后向用户展示违规内容屏蔽后的视频内容。通过在接收到用户的视频内容获取请求时,对用户身份进行识别,并根据用户身份选择恰当的研判规则对用户请求的视频内容进行违规检测和处理,使得视频会议场景下的视频内容在展示前经过智能的研判和健康化处理,从而尽可能保证用户观看到的视频内容是积极向上的,避免视频内容中的不健康内容对用户身心造成影响甚至是伤害,保护用户的身心健康和用户体验。In the method for displaying video content provided by the embodiment of the present application, when the video conferencing application system receives a video content acquisition request from a user, it identifies the identity of the user requesting to acquire the video content, and acquires the user's identity. Then, according to the acquired identity of the user, the corresponding target research and judgment rules are obtained, and the illegal content of the video content is detected and blocked according to the target research and judgment rules, and then the video content after the illegal content is blocked is displayed to the user. By identifying the user's identity when receiving the user's video content acquisition request, and selecting the appropriate judgment rule based on the user's identity to detect and process violations of the video content requested by the user, the video content in the video conference scene can be displayed before displaying After intelligent research and judgment and healthy processing, the video content that users watch is as positive as possible, and the unhealthy content in the video content is prevented from affecting or even harming the user's body and mind, so as to protect the user's physical and mental health and user experience.
为使本申请实施例的目的、技术方案和优点更加清楚,下面将结合附图对本申请的各实 施例进行详细的阐述。然而,本领域的普通技术人员可以理解,在本申请各实施例中,为了使读者更好地理解本申请而提出了许多技术细节。但是,即使没有这些技术细节和基于以下各实施例的种种变化和修改,也可以实现本申请所要求保护的技术方案。以下各个实施例的划分是为了描述方便,不应对本申请的具体实现方式构成任何限定,各个实施例在不矛盾的前提下可以相互结合相互引用。In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the following will describe each embodiment of the present application in detail with reference to the accompanying drawings. However, those of ordinary skill in the art can understand that in each embodiment of the application, many technical details are provided for readers to better understand the application. However, even without these technical details and various changes and modifications based on the following embodiments, the technical solutions claimed in this application can also be realized. The division of the following embodiments is for the convenience of description, and should not constitute any limitation to the specific implementation of the present application, and the embodiments can be combined and referred to each other on the premise of no contradiction.
下面将对结合具体的实施例的对本申请记载的视频内容展示方法的实现细节进行具体的说明,以下内容仅为方便理解提供的实现细节,并非实施本方案的必须。The implementation details of the method for displaying video content recorded in this application will be described in detail below in conjunction with specific embodiments. The following content is only implementation details provided for easy understanding, and is not necessary for implementing this solution.
本申请实施例的第一方面提供了一种视频内容展示方法,视频内容展示方法的具体流程可以参考图1,在一些实施例中,视频内容展示方法应用于视频会议系统部署的终端,终端可以是任何具备通信和处理功能的电子设备,如手机、电脑、服务器等电子设备,本实施例以应用在服务器为例进行说明,视频内容展示方法至少包括但不限于以下步骤:The first aspect of the embodiment of the present application provides a method for displaying video content. For the specific process of the method for displaying video content, refer to FIG. 1. In some embodiments, the method for displaying video content is applied to a terminal deployed in a video conferencing system. It is any electronic device with communication and processing functions, such as mobile phones, computers, servers and other electronic devices. In this embodiment, the application in the server is used as an example for illustration. The video content display method includes at least but not limited to the following steps:
步骤101,接收用户的视频内容获取请求,获取用户的身份。 Step 101, receiving a user's video content acquisition request, and acquiring the user's identity.
具体地说,服务器在接收到用户请求接入当前视频会议场景,获取视频内容的请求后,提示用户输入身份验证信息,并根据用户输入的身份验证信息,对用户的身份进行识别。其中,用户的身份可以包括:年龄和性别等信息。Specifically, after receiving the user's request to access the current video conference scene and obtain the video content, the server prompts the user to input identity verification information, and identifies the user's identity according to the identity verification information input by the user. Wherein, the identity of the user may include information such as age and gender.
在一个例子中,服务器在获取用户的身份前,还包括:获取用户的终端类型;根据终端类型,获取目标识别方法;获取用户的身份,包括:根据目标识别方法获取用户的身份。具体而言,视频会议系统的结构示意图可以参考图2,包括用户使用的接入终端和服务器,服务器可以由多个不同的子服务器构成,例如,部署在核心网络中心云的视频会议服务器和规则服务器、部署在边缘云的终端识别服务器和研判服务器。用户使用的接入终端启动后,将启动消息发送给终端识别服务器,启动终端识别服务器,然后接入终端向视频会议服务器发起视频内容获取请求的时候,终端识别服务器对用户使用的接入终端的终端类型进行识别,获取用户终端的终端类型。在确定用户终端的终端类型后,终端识别服务器根据终端类型,向规则服务器请求对应的目标识别方法,并根据规则服务器反馈的目标识别方法,向用户发送身份识别指示,然后根据用户输入的身份验证信息对用户身份进行识别。通过根据用户的终端类型选择对应的用户识别方法,保证身份识别准确性的同时,提高视频内容展示方法兼容的终端类型。In an example, before acquiring the user's identity, the server further includes: acquiring the user's terminal type; acquiring the target identification method according to the terminal type; acquiring the user's identity includes: acquiring the user's identity according to the target identification method. Specifically, the structural diagram of the video conferencing system can refer to Figure 2, including the access terminal and server used by the user. The server can be composed of multiple different sub-servers, for example, the video conferencing server and rule server, terminal identification server and judgment server deployed on the edge cloud. After the access terminal used by the user is started, the start message is sent to the terminal identification server, the terminal identification server is started, and then when the access terminal initiates a video content acquisition request to the video conference server, the terminal identification server checks the information of the access terminal used by the user. The terminal type is identified to obtain the terminal type of the user terminal. After determining the terminal type of the user terminal, the terminal identification server requests the corresponding target identification method from the rule server according to the terminal type, and sends an identification instruction to the user according to the target identification method fed back by the rule server, and then according to the identity verification input by the user The information identifies the user. By selecting a corresponding user identification method according to the user's terminal type, while ensuring the accuracy of identification, the terminal types compatible with the video content display method are improved.
在一实施例中,服务器根据终端类型,获取目标识别方法,包括:在终端类型为非5G终端的情况下,根据目标识别方法获取用户的身份包括以下之一或任意组合:根据用户的登录信息识别用户的身份、根据用户的声音识别用户的身份、根据用户的视频图像识别用户的身份;在终端类型为5G终端的情况下,根据目标识别方法获取用户的身份包括:根据用户的指纹信息识别用户的身份,和/或根据用户的虹膜信息识别用户的身份。具体而言,服务器在获取到用户的终端类型后,终端识别服务器根据用户的终端类型,获取相应的目标识别方法,并根据目标识别方法获取用户的身份。终端识别服务器可以包括多个组件,例如,终端类型识别控制组件、音频识别组件、视频识别组件等。在根据用户的终端类型获取对应的目标识别方法获取用户身份时,在用户的终端类型为非5G终端的情况下,终端识别服务器向规则服务器请求音频识别算法、视频识别算法或配置信息识别算法。根据接收到的规则服务器返回的响应中包含的识别算法,根据用户登录时的登录信息、请求用户输入的实时音频信号或者视频信号,对用户身份进行识别,确定用户的归属群体。在用户的终端类型为5G终 端的情况下,终端识别服务器向规则服务器请求指纹识别算法或虹膜识别算法,根据接收到的规则服务器返回的响应中包含的识别算法,根据实时采集到的用户指纹信息或虹膜信息对用户的身份进行识别。根据用户终端的终端类型,采用不同的识别方法对用户身份进行识别,一方面保证现有终端也能作为接入终端,保护运营商和消费者的投资,另一方面也能够拓展可接入终端的类型,保证视频内容展示方法的应用广泛性。In an embodiment, the server obtains the target identification method according to the terminal type, including: if the terminal type is a non-5G terminal, obtaining the user's identity according to the target identification method includes one or any combination of the following: according to the user's login information Identify the user's identity, identify the user's identity based on the user's voice, and identify the user's identity based on the user's video image; when the terminal type is a 5G terminal, obtaining the user's identity according to the target identification method includes: identifying based on the user's fingerprint information The user's identity, and/or identify the user's identity based on the user's iris information. Specifically, after the server acquires the user's terminal type, the terminal identification server acquires a corresponding target identification method according to the user's terminal type, and acquires the user's identity according to the target identification method. The terminal identification server may include multiple components, for example, a terminal type identification control component, an audio identification component, a video identification component, and the like. When obtaining the user identity according to the corresponding target recognition method according to the user's terminal type, if the user's terminal type is a non-5G terminal, the terminal recognition server requests the rule server for an audio recognition algorithm, a video recognition algorithm or a configuration information recognition algorithm. According to the recognition algorithm contained in the response returned by the received rule server, according to the login information when the user logs in, and the real-time audio signal or video signal that the user is requested to input, identify the user's identity and determine the user's belonging group. When the user's terminal type is a 5G terminal, the terminal identification server requests the fingerprint identification algorithm or iris identification algorithm from the rule server, and according to the identification algorithm contained in the response returned by the received rule server, according to the real-time collected user fingerprint information or iris information to identify the user. According to the terminal type of the user terminal, different identification methods are used to identify the user identity. On the one hand, it ensures that the existing terminal can also be used as an access terminal to protect the investment of operators and consumers. On the other hand, it can also expand the accessible terminals. types, ensuring the wide application of the video content display method.
值得一提的是,在用户的终端类型为5G终端的情况下,主要是通过生物特征进行用户的身份识别,生物特征可以是虹膜、指纹,也可以是其他能够识别出用户的身份的生物特征信息,本实施例对具体采用的生物特征信息不做限制。It is worth mentioning that when the user's terminal type is a 5G terminal, the user's identity is mainly identified through biometrics. The biometrics can be iris, fingerprints, or other biometrics that can identify the user's identity. information, this embodiment does not limit the specific biometric information used.
需要说明的是,非5G终端可以包括传统终端和智能终端,传统终端主要是指传统接入视频会议场景的硬终端设备,属于专有视频会议设备,在政企等行业用户使用的比较多,并且还会继续作为视频会议的专有资产投入使用;智能终端主要是指智能电视、智能盒子、智能手机等设备,可以自主安装智能应用,作为视频会议客户端接入视频会议应用场景。而5G终端主要是指高端的智能手表、AR、VR、MR设备等。这里只是为了便于理解和区分,而不是对终端能够采用的网络进行限定。另外,对用户身份进行识别的时候,可以仅依靠一种算法进行识别,也可以多种算法相结合进行身份识别,本实施例对此不做限制。It should be noted that non-5G terminals can include traditional terminals and smart terminals. Traditional terminals mainly refer to hard terminal devices that are traditionally connected to video conferencing scenarios. They are proprietary video conferencing devices that are used by users in industries such as government and enterprises. And it will continue to be used as a proprietary asset of video conferencing; smart terminals mainly refer to smart TVs, smart boxes, smart phones and other devices, which can independently install smart applications and serve as video conferencing clients to access video conferencing application scenarios. 5G terminals mainly refer to high-end smart watches, AR, VR, MR equipment, etc. This is only for ease of understanding and distinction, rather than limiting the network that the terminal can adopt. In addition, when identifying a user's identity, it may be identified only by one algorithm, or a combination of multiple algorithms may be used for identification, which is not limited in this embodiment.
非5G终端接入视频会议场景的流程图可以参考图3,至少包括但不限于以下步骤:Refer to Figure 3 for the flow chart of the non-5G terminal accessing the video conference scenario, at least including but not limited to the following steps:
步骤301,非5G终端启动后将启动消息发给终端识别服务器,启动终端识别服务器。 Step 301, after the non-5G terminal starts, it sends a start message to the terminal identification server, and starts the terminal identification server.
步骤302,终端识别服务器确定目标识别方法并获取对应的识别规则,下发识别指令。In step 302, the terminal identification server determines the target identification method and acquires the corresponding identification rules, and issues an identification instruction.
具体而言,终端识别服务器向规则服务器请求识别规则,根据规则服务器的响应将规则服务器中对应的配置信息识别规则、音频识别规则和视频识别规则同步到缓存中,音频识别模块和视频识别模块将会使用最新的规则进行终端用户识别。选定识别接入的非5G终端对应的用户的身份的目标识别方法,并根据选定的识别方法向接入终端下发识别指令。Specifically, the terminal identification server requests identification rules from the rule server, and synchronizes the corresponding configuration information identification rules, audio identification rules and video identification rules in the rule server to the cache according to the response of the rule server, and the audio identification module and video identification module will The latest rules are used for end-user identification. Select the target identification method for identifying the identity of the user corresponding to the accessed non-5G terminal, and issue an identification instruction to the access terminal according to the selected identification method.
步骤303,非5G终端根据识别指令上传用户身份信息。 Step 303, the non-5G terminal uploads user identity information according to the identification instruction.
具体而言,非5G终端根据接收到的识别指令,确定通过配置信息直接识别,还是通过实时音频识别或实时视频识别接入终端对应的用户的归属群体,提示用户输入身份信息,并将用户身份信息上传到终端识别服务器。其中,用户的归属群体包括儿童、青少年、老人等,更进一步的可以分为男童、女童、青少年(男)、青少年(女),老人(男)、老人(女)等。Specifically, according to the received identification instruction, the non-5G terminal determines whether to identify directly through the configuration information, or through real-time audio recognition or real-time video recognition to identify the belonging group of the user corresponding to the access terminal, prompts the user to input identity information, and sends the user identity The information is uploaded to the terminal identification server. Wherein, the user's belonging group includes children, teenagers, old people, etc., and can be further divided into boys, girls, teenagers (male), teenagers (female), old people (male), old people (female) and so on.
步骤304,终端识别服务器根据接收到的用户身份信息对用户身份进行识别,并完成非5G终端的接入。 Step 304, the terminal identification server identifies the user identity according to the received user identity information, and completes the access of the non-5G terminal.
具体而言,终端识别服务器包括多个模块,例如,识别控制模块、音频识别模块、视频识别模块等,若根据用户登录时的登录信息识别用户身份,则非5G终端直接将用户的登录信息发给终端识别服务器,终端识别服务器依据登录信息和获取到的配置识别规则对用户身份进行识别,并将识别结果发给视频会议服务器,视频会议服务器将识别结果保存到服务器上的终端信息中,供后续进行视频内容鉴别使用。Specifically, the terminal identification server includes multiple modules, such as an identification control module, an audio identification module, and a video identification module. To the terminal identification server, the terminal identification server identifies the user identity according to the login information and the obtained configuration identification rules, and sends the identification result to the video conference server, and the video conference server saves the identification result to the terminal information on the server for Subsequent video content identification is used.
若根据用户的声音识别用户的身份,则非5G终端将识别请求消息发给终端识别服务器的识别控制模块,请求信息中携带终端音频通道信息。识别控制模块在音频识别模块上建立音频识别通道。然后非5G终端将接收到的用户识别音频通过指定的音频识别通道发送给音频识别模块,音频识别模块通过NLP智能学习音纹技术进行音频识别,并将识别结果发给识别控制模块。然后识别控制模块将识别结果发给非5G终端和视频会议服务器,视频会议服 务器将识别结果保存到服务器上的终端信息中,供后续进行视频内容鉴别使用。If the user's identity is identified according to the user's voice, the non-5G terminal sends an identification request message to the identification control module of the terminal identification server, and the request message carries terminal audio channel information. The identification control module establishes an audio identification channel on the audio identification module. Then the non-5G terminal sends the received user identification audio to the audio identification module through the specified audio identification channel, and the audio identification module performs audio identification through NLP intelligent learning voiceprint technology, and sends the identification result to the identification control module. Then the recognition control module sends the recognition result to the non-5G terminal and the video conferencing server, and the video conferencing server saves the recognition result to the terminal information on the server for subsequent video content identification.
若通过用户视频图像识别用户身份,非5G终端将视频识别请求消息发给终端识别服务器的识别控制模块,请求信息中携带终端视频通道信息。识别控制模块在视频识别模块上建立视频识别通道。然后非5G终端将接收到的用户识别视频通过指定的视频识别通道发送给视频识别模块,视频识别模块依据视频识别规则对视频进行视频识别,并将识别结果发给识别控制模块。终端识别服务器将识别结果发给非5G终端和视频会议服务器,视频会议服务器将识别结果保存到服务器上的终端信息中,供后续进行视频内容鉴别使用。If the user's identity is identified through the user's video image, the non-5G terminal sends a video identification request message to the identification control module of the terminal identification server, and the request message carries terminal video channel information. The recognition control module establishes a video recognition channel on the video recognition module. Then the non-5G terminal sends the received user identification video to the video identification module through the specified video identification channel, and the video identification module performs video identification on the video according to the video identification rules, and sends the identification result to the identification control module. The terminal identification server sends the identification result to the non-5G terminal and the video conference server, and the video conference server saves the identification result to the terminal information on the server for subsequent video content identification.
值得一提的是,根据用户登录信息进行用户身份识别时,主要是根据用户登录时输入的ID或者身份证号等用户身份信息进行识别;根据用户声音进行用户身份识别时,主要是采集用户的声纹特征,声纹特征主要由音色决定,可以通过声纹特征区分用户的性别和年龄段;根据用户视频图像进行用户身份识别时,可以直接对人脸进行识别,提取人脸特征码对用户的年龄和性别进行识别,也可以直接通过人脸图像在注册信息库中检索用户的身份。本实施例对具体采用的识别信息和识别方式不做限制。It is worth mentioning that when user identification is performed based on user login information, it is mainly based on user identification information such as the ID or ID number entered by the user when logging in; when user identification is performed based on user voice, it is mainly based on collecting user information Voiceprint features, voiceprint features are mainly determined by timbre, the gender and age group of users can be distinguished through voiceprint features; when user identification is performed based on user video images, faces can be directly recognized, and facial feature codes can be extracted to identify users It can also identify the user's age and gender directly through the face image in the registration information database. This embodiment does not limit the specific identification information and identification methods used.
5G终端接入视频会议场景的流程图可以参考图4,至少包括但不限于以下步骤:Refer to Figure 4 for the flow chart of the 5G terminal accessing the video conferencing scenario, at least including but not limited to the following steps:
步骤401,5G终端启动后将启动消息发给终端识别服务器,启动终端识别服务器。 Step 401, after the 5G terminal is started, it sends a start message to the terminal identification server, and starts the terminal identification server.
步骤402,终端识别服务器确定目标识别方法并获取对应的识别规则,下发识别指令。In step 402, the terminal identification server determines the target identification method and obtains the corresponding identification rules, and issues an identification instruction.
具体而言,终端识别服务器包括多个模块,例如,识别控制模块、生物特征识别模块等,终端识别服务器向规则服务器请求识别规则,根据规则服务器的响应将规则服务器中对应的生物特征识别规则及算法同步到缓存中,然后调用生物特征识别模块使用最新的生物特征识别规则进行终端用户识别。此处的生物特征识别规则,可以为虹膜特征识别规则及规整算法、指纹特征识别规则及规整算法或者其他的生物特征识别规则及规整算法等。Specifically, the terminal identification server includes a plurality of modules, such as an identification control module, a biometric identification module, etc. The terminal identification server requests identification rules from the rule server, and according to the response from the rule server, the corresponding biometric identification rules and The algorithm is synchronized to the cache, and then the biometric identification module is called to use the latest biometric identification rules for end user identification. The biometric feature recognition rule here may be an iris feature recognition rule and a regularization algorithm, a fingerprint feature recognition rule and a regularization algorithm, or other biometric feature recognition rules and a regularization algorithm.
步骤403,5G终端根据识别指令上传用户身份信息。 Step 403, the 5G terminal uploads user identity information according to the identification instruction.
具体而言,5G终端根据接收到的识别指令,确定通过指纹识别用户的归属群体还是通过虹膜识别用户的归属群体,提示用户输入对应的生物特征信息,并将用户身份信息上传到终端识别服务器。Specifically, the 5G terminal determines the user's belonging group through fingerprint identification or iris identification according to the received identification instruction, prompts the user to enter the corresponding biometric information, and uploads the user identity information to the terminal identification server.
步骤404,终端识别服务器根据接收到的用户身份信息对用户身份进行识别,并完成5G终端的接入。 Step 404, the terminal identification server identifies the user identity according to the received user identity information, and completes the access of the 5G terminal.
具体而言,5G终端判断通过生物识别确定用户的归属群体后,将生物识别请求消息发给终端识别服务器的识别控制模块,请求信息中携带终端生物特征识别通道信息。识别控制模块在生物特征识别模块上建立生物特征识别通道。然后5G终端将接收到的用户生物特征信息通过生物特征识别传输通道发送给终端识别服务器中的生物特征识别模块,生物特征识别模块通过生物特征识别规则和算法,对初始生物特征信息进行处理和规整提取生物特征码。然后将生物特征码作为唯一识别码发给应用服务器,应用服务器使用特征码进行检索确认该用户身份。然后终端识别服务器将识别结果发送给应用服务器、5G终端和视频会议服务器。应用服务器和视频会议服务器将识别结果保存到服务器上的终端信息中,供后续进行视频内容鉴别使用。Specifically, after the 5G terminal judges that the user's belonging group is determined through biometrics, it sends a biometrics request message to the identification control module of the terminal identification server, and the request message carries terminal biometrics identification channel information. The identification control module establishes a biometric identification channel on the biometric identification module. Then the 5G terminal sends the received user biometric information to the biometric identification module in the terminal identification server through the biometric identification transmission channel. The biometric identification module processes and regulates the initial biometric information through biometric identification rules and algorithms. Extract the biometric code. Then send the biometric code as a unique identification code to the application server, and the application server uses the code to retrieve and confirm the user's identity. Then the terminal recognition server sends the recognition result to the application server, 5G terminal and video conferencing server. The application server and the video conferencing server save the identification result in the terminal information on the server for subsequent video content identification.
值得一提的是,应用服务器根据特征码进行检索的时候,若检测到用户已完成过生物识别,并且前一次识别结果的生成日期满足识别要求,则直接取用前一次的生物识别结果,作为本次识别结果进行使用,从而完成本次的生物特征识别。若检测到用户未完成过生物识别, 或者未匹配到满足要求的识别结果,则生物特征识别模块,将生物特征识别数据发送给第三方身份认证服务器,进行全新的身份认证。第三方身份认证服务器将认证结果和个人信息返回给终端识别服务器。It is worth mentioning that when the application server searches based on the feature code, if it detects that the user has completed biometric identification, and the generation date of the previous identification result meets the identification requirements, it will directly use the previous biometric identification result as The result of this identification is used to complete this biometric identification. If it is detected that the user has not completed biometric identification, or does not match the required identification results, the biometric identification module will send the biometric identification data to a third-party identity authentication server for a new identity authentication. The third-party identity authentication server returns the authentication result and personal information to the terminal identification server.
步骤102,根据用户的身份及用户身份与研判规则的预设对应关系,获取与用户的身份对应的目标研判规则。Step 102, according to the user's identity and the preset corresponding relationship between the user's identity and the research and judgment rules, obtain the target research and judgment rules corresponding to the user's identity.
具体地说,服务器在对用户的身份完成识别后,先根据用户的身份对发送给用户的视频内容是否需要进行视频内容检测和处理进行判断,在需要对视频内容进行检测和处理的情况下,根据预先存储的用户身份与研判规则的预设对应关系,获取与用户的身份对应的目标研判规则。Specifically, after identifying the identity of the user, the server first judges whether the video content sent to the user needs to be detected and processed according to the identity of the user, and if the video content needs to be detected and processed, According to the preset corresponding relationship between the user identity and the judgment rule stored in advance, the target research and judgment rule corresponding to the user's identity is obtained.
在一个例子中,服务器确定的目标研判规则包括:音频研判规则、视频研判规则或多维研判规则。通过对视频内容进行音频、视频或多维的违规内容检测,保证检测准确性的同时,尽可能保证检测时效性,避免视频内容健康化处理带来过大的延迟。In an example, the target research and judgment rules determined by the server include: audio research and judgment rules, video research and judgment rules, or multidimensional research and judgment rules. By performing audio, video or multi-dimensional illegal content detection on video content, while ensuring the accuracy of detection, it can also ensure the timeliness of detection as much as possible, and avoid excessive delays caused by healthy processing of video content.
步骤103,根据目标研判规则对用户请求的视频内容进行违规内容检测,根据检测结果对视频内容进行违规内容屏蔽,并向用户展示违规内容屏蔽后的视频内容。Step 103: Perform illegal content detection on the video content requested by the user according to the target research and judgment rules, block the illegal content of the video content according to the detection result, and display the blocked video content to the user.
具体地说,服务器在确定目标研判规则后,根据目标研判规则对用户请求的视频内容进行违规内容检测,确定用户请求的视频内容中是否存在不符合目标研判规则的内容,并根据检测结果对视频内容中不符合目标研判规则的内容进行违规内容屏蔽,然后将违规内容屏蔽后的视频内容展示给用户。通过根据目标研判规则对视频内容进行健康化处理,避免不健康的违规内容对用户身心造成影响甚至造成伤害,提高用户体验。Specifically, after the server determines the target research and judgment rules, it detects the illegal content of the video content requested by the user according to the target research and judgment rules, determines whether there is any content in the video content requested by the user that does not meet the target research and judgment rules, and based on the detection results, the video The content in the content that does not meet the target research and judgment rules will be blocked, and then the blocked video content will be displayed to the user. Through the healthy processing of video content according to the target research and judgment rules, it can avoid unhealthy and illegal content from affecting or even causing harm to the user's body and mind, and improve user experience.
在一个例子中,服务器根据目标研判规则对视频内容进行违规内容检测,包括以下之一或任意组合:对视频内容进行敏感词检测,对视频内容进行违规动作检测,对视频内容进行违规画面检测。通过根据目标研判规则对视频内容中可能存在的敏感词、违规动作和违规画面进行检测,准确识别出违规内容,保证健康化处理的准确性。In an example, the server detects illegal content on the video content according to the target research and judgment rules, including one or any combination of the following: sensitive word detection on the video content, illegal action detection on the video content, and illegal screen detection on the video content. By detecting sensitive words, illegal actions and illegal pictures that may exist in the video content according to the target research and judgment rules, the illegal content can be accurately identified to ensure the accuracy of healthy processing.
在一实施例中,服务器根据检测结果对视频内容进行违规内容屏蔽,包括以下之一或任意组合:对敏感词进行消音、删除或替换,对违规动作进行打码、删除或替换,对违规画面进行打码、删除或替换。通过对违规内容进行遮挡、删除或者替换等方式,有效地对违规内容进行处理。In one embodiment, the server screens the illegal content of the video content according to the detection results, including one or any combination of the following: muffle, delete or replace sensitive words, code, delete or replace illegal actions, and Code, delete or replace. Effectively deal with illegal content by blocking, deleting or replacing the illegal content.
违规内容检测和处理的流程可以参考图5,至少包括但不限于以下步骤:Refer to Figure 5 for the flow of illegal content detection and processing, which at least includes but is not limited to the following steps:
步骤501,用户终端接入后,启动研判服务器,并将目标研判规则同步到研判服务器中。 Step 501, after the user terminal accesses, start the research and judgment server, and synchronize the target research and judgment rules to the research and judgment server.
具体而言,用户终端成功接入后,视频会议服务器指令研判服务器启动,然后研判服务器会向规则服务器发起研判规则请求,并根据规则服务器的响应,将规则服务器中的目标音频研判规则、目标视频规则或目标多维研判规则同步到缓存中。音频研判模块和视频研判模块将会使用最新的规则对用户请求的视频内容进行研判。Specifically, after the user terminal is successfully connected, the video conferencing server instructs the judgment server to start, and then the judgment server will initiate a judgment rule request to the rule server, and according to the response of the rule server, the target audio judgment rule, target video The rules or target multi-dimensional research and judgment rules are synchronized to the cache. The audio research and judgment module and the video research and judgment module will use the latest rules to judge the video content requested by the user.
步骤502,创建视频内容违规内容检测和处理的音视频媒体通信端口。 Step 502, creating an audio and video media communication port for detecting and processing violating video content.
具体而言,研判服务器包括研判控制模块、视频研判模块和音频研判模块,在对视频内容进行违规检测和处理时,研判服务器完成对音频研判模块和视频研判模块的资源分配后,将音频研判模块和视频研判模块分配的音视频媒体第一通信端口发给用户终端。用户终端携带研判服务器的音频研判模块和视频研判模块分配的音视频媒体通信端口信息,向视频会议服务器发起加入请求。在协商完成后,视频会议服务器向用户终端返回视频会议服务器端分 配的音视频媒体第二通信端口信息。接入终端将音视频研判请求消息发给研判服务器的研判控制模块,请求信息中携带终端音视频通道信息以及视频会议服务器返回的服务器端的音视频媒体第二通信端口信息,便于研判服务器建立全流程的音视频媒体通信流程。Specifically, the research and judgment server includes a research and judgment control module, a video research and judgment module, and an audio judgment module. The first communication port of audio and video media allocated by the video research and judgment module is sent to the user terminal. The user terminal carries the audio and video media communication port information allocated by the audio and video research and judgment modules of the research and judgment server, and initiates a join request to the video conference server. After the negotiation is completed, the video conference server returns the second communication port information of the audio and video media allocated by the video conference server to the user terminal. The access terminal sends the audio and video judgment request message to the judgment control module of the judgment server, and the request information carries the audio and video channel information of the terminal and the second communication port information of the audio and video media on the server side returned by the video conference server, so as to facilitate the establishment of the whole process of the judgment server Audio and video media communication process.
步骤503,研判服务器通过音视频媒体通信端口获取视频内容,并对视频内容进行违规内容检测和处理。 Step 503, the research and judgment server acquires video content through the audio and video media communication port, and detects and processes illegal content of the video content.
具体而言,研判服务器在音频研判模块上建立音频研判通道,接收接入终端通过指定的音频研判通道发送的音频媒体数据,音频研判模块通过NLP智能学习及音纹技术进行音频研判(例如,敏感词检测),并依据检测结果来处理音频媒体。音频媒体处理可以是记录、告警、禁止发送等。研判服务器在视频研判模块上建立视频研判通道。接收接入终端通过指定的视频研判通道发送的视频媒体数据,视频研判模块依据视频研判规则对视频媒体进行视频研判,主要是对视频中的违规画面和违规动作进行检测,并依据检测结果来处理视频媒体。视频媒体处理可以是遮挡、删除、替换等。Specifically, the research and judgment server establishes an audio research and judgment channel on the audio research and judgment module, and receives the audio media data sent by the access terminal through the designated audio research and judgment channel. word detection), and process the audio media based on the detection results. Audio media processing can be recording, alerting, prohibiting sending, etc. The research and judgment server establishes a video research and judgment channel on the video research and judgment module. Receive the video media data sent by the access terminal through the designated video judgment channel, and the video judgment module conducts video judgment on the video media according to the video judgment rules, mainly to detect illegal pictures and illegal actions in the video, and process according to the detection results video media. Video media processing can be occlusion, deletion, replacement, etc.
即,研判服务器依据视频会议服务器返回的音频通信端口信息,建立与视频会议服务器的音频通信通道。一方面,将音频研判规则处理后的音频媒体通过服务器端建立的音频通信通道发送给视频会议服务器,由视频会议服务器进行下一步的处理;另一方面,对视频会议服务器下行的音频媒体,依据音频研判规则通过NLP智能学习音纹技术进行音频研判,并将研判后的下行音频媒体发给接入终端进行展示。研判服务器依据视频会议服务器返回的视频通信端口信息,建立与视频会议服务器的视频通信通道。一方面,将多维视频研判规则处理后的多维视频媒体通过服务器端建立的视频通信通道发送给视频会议服务器,由视频会议服务器进行下一步的处理;另一方面,对视频会议服务器下行的视频媒体,依据多维视频研判规则对视频媒体进行视频研判,主要是视频中的画面和动作进行视频研判,并将研判后的下行多维视频媒体发给接入终端进行展示。That is, the judgment server establishes an audio communication channel with the video conference server according to the audio communication port information returned by the video conference server. On the one hand, the audio media processed by the audio research and judgment rules is sent to the video conferencing server through the audio communication channel established on the server side, and the video conferencing server performs the next step of processing; Audio research and judgment rules use NLP intelligent learning voiceprint technology to conduct audio research and judgment, and send the downlink audio media after research and judgment to the access terminal for display. The judgment server establishes a video communication channel with the video conference server according to the video communication port information returned by the video conference server. On the one hand, the multi-dimensional video media processed by the multi-dimensional video research and judgment rules is sent to the video conferencing server through the video communication channel established on the server side, and the video conferencing server performs the next step of processing; on the other hand, the downlink video media of the video conferencing server According to the multi-dimensional video research and judgment rules, the video media is judged, mainly the pictures and actions in the video are judged, and the judged downlink multi-dimensional video media is sent to the access terminal for display.
值得一提的是,视频媒体可以是二维视频媒体,也可以是多维视频媒体,多维视频主要是指3D视频媒体以及更多纬度传输的视频媒体。正常视频媒体为2维视频媒体,针对多维视频主要采用联合检测、多维识别和多维重建技术来进行多维研判。It is worth mentioning that video media can be two-dimensional video media or multi-dimensional video media. Multi-dimensional video mainly refers to 3D video media and video media transmitted in more latitudes. The normal video media is 2D video media. For multi-dimensional video, joint detection, multi-dimensional recognition and multi-dimensional reconstruction technologies are mainly used for multi-dimensional research and judgment.
在另一个例子中,服务器在向用户展示违规内容屏蔽后的视频内容后,还包括:周期性获取用户的身份,并根据当前获取到的用户的身份,确定目标研判规则;在目标研判规则发生变更的情况下,根据变更后的目标研判规则对视频内容进行违规内容检测。服务器在向用户展示视频内容后,按照预设的间隔时长,周期性对用户的身份进行重识别,并根据当前获取到的用户的身份,重新确定目标研判规则,然后对根据当前获取到的用户的身份确定出的目标研判规则与之前获取的目标研判规则进行一致性检测。在检测到目标研判规则不一致的情况下,将根据当前获取到的用户的身份重新确定的目标研判规则,作为视频内容违规检测过程中使用的研判规则,并根据变更后的目标研判规则对后续展示给用户的视频内容进行违规内容检测。通过周期性的进行用户身份核验及目标研判规则的更新,避免终端使用过程中更换用户后出现视频内容处理或漏处理的问题,进一步提高视频内容展示方法的有效性和实用性。In another example, after the server shows the user the video content blocked by the illegal content, it also includes: periodically obtaining the user's identity, and determining the target research and judgment rules based on the currently obtained user identity; when the target research and judgment rules occur In the case of a change, the video content will be detected for illegal content according to the changed target research and judgment rules. After displaying the video content to the user, the server periodically re-identifies the user's identity according to the preset interval, and re-determines the target research and judgment rules based on the currently obtained user's identity, and then re-identifies the user based on the currently obtained user identity. The consistency check between the target research and judgment rules determined by the identity of the user and the previously obtained target research and judgment rules is carried out. In the case of inconsistent target research and judgment rules, the target research and judgment rules re-determined based on the currently obtained user identity will be used as the research and judgment rules used in the video content violation detection process, and the subsequent display will be based on the changed target research and judgment rules. Perform illegal content detection on the user's video content. Through periodic verification of user identity and update of target research and judgment rules, the problem of video content processing or missing processing after changing users during terminal use is avoided, and the effectiveness and practicability of the video content display method are further improved.
在另一个例子中,服务器还对预设对应关系中的研判规则根据视频内容的内容类型动态更新。具体而言,服务器对视频内容进行违规内容检测和处理后,在检测到视频会议场景下视频内容的内容类型发生更新后,通过深度学习或者预设神经网络模型,基于新增的视频内 容生成对应的研判规则热补丁,然后通过生成的研判规则热补丁对现有的研判规则进行动态维护,对研判规则和终端的预设对应关系进行更新。通过根据视频内容生成研判规则热补丁,以热补丁的方式来动态维护研判规则,确保研判规则的实时性和正确性。In another example, the server also dynamically updates the research and judgment rules in the preset correspondence according to the content type of the video content. Specifically, after the server detects and processes the illegal content of the video content, and detects that the content type of the video content in the video conferencing scene is updated, it uses deep learning or a preset neural network model to generate a corresponding video based on the newly added video content. The hot patch of the research and judgment rules, and then dynamically maintain the existing research and judgment rules through the generated hot patch of the research and judgment rules, and update the preset correspondence between the research and judgment rules and the terminal. By generating a hot patch of the judgment rule based on the video content, the judgment rule is dynamically maintained in the form of a hot patch to ensure the real-time and correctness of the judgment rule.
另外,对视频内容进行违规内容检测可以通过在云端部署的研判服务器进行,也可以直接通过部署在智能终端上的研判模块根据获取到的目标研判规则进行,部署在云端可以确定传统接入终端也能实现视频内容的健康化,优化全局处理资源;部署在智能终端上能够更加有效的确保上行码流和下行码流的纯净性,实际应用中可以根据需要调整视频会议系统的构架和部署方式,本实施例对具体的部署不做限制。In addition, the illegal content detection of video content can be carried out through the research and judgment server deployed on the cloud, or directly through the research and judgment module deployed on the smart terminal according to the obtained target research and judgment rules. Deploying in the cloud can determine whether traditional access terminals are It can realize the health of video content and optimize global processing resources; deployment on smart terminals can more effectively ensure the purity of uplink and downlink streams. In practical applications, the architecture and deployment methods of the video conferencing system can be adjusted as needed. This embodiment does not limit specific deployment.
此外,应当理解的是,上面各种方法的步骤划分,只是为了描述清楚,实现时可以合并为一个步骤或者对某些步骤进行拆分,分解为多个步骤,只要包括相同的逻辑关系,都在本专利的保护范围内;对算法中或者流程中添加无关紧要的修改或者引入无关紧要的设计,但不改变其算法和流程的核心设计都在该专利的保护范围内。In addition, it should be understood that the division of steps in the above methods is only for clarity of description, and may be combined into one step or split into multiple steps during implementation. As long as the same logical relationship is included, all Within the scope of protection of this patent; adding insignificant modifications or introducing insignificant designs to the algorithm or process, but not changing the core design of the algorithm and process are all within the scope of protection of the patent.
本申请实施例的另一方面还提供了一种视频内容展示装置,参考图6,包括:Another aspect of the embodiment of the present application also provides a video content display device, referring to Figure 6, including:
接收模块601,用于接收用户的视频内容获取请求,获取用户的身份。The receiving module 601 is configured to receive a user's video content acquisition request and acquire the user's identity.
获取模块602,用于根据用户的身份及用户身份与研判规则的预设对应关系,获取与用户的身份对应的目标研判规则;其中,研判规则用于检测视频内容中是否存在违规内容。The obtaining module 602 is used to obtain the target research and judgment rules corresponding to the user's identity according to the user's identity and the preset corresponding relationship between the user's identity and the research and judgment rules; wherein, the research and judgment rules are used to detect whether there is illegal content in the video content.
处理模块603,用于根据目标研判规则对用户请求的视频内容进行违规内容检测,根据检测结果对视频内容进行违规内容屏蔽,并向用户展示违规内容屏蔽后的视频内容。The processing module 603 is configured to detect the illegal content of the video content requested by the user according to the target research and judgment rules, block the illegal content of the video content according to the detection result, and display the blocked video content to the user.
不难发现,本实施例为与方法实施例相对应的装置实施例,本实施例可与方法实施例互相配合实施。方法实施例中提到的相关技术细节在本实施例中依然有效,为了减少重复,这里不再赘述。相应地,本实施例中提到的相关技术细节也可应用在方法实施例中。It is not difficult to find that this embodiment is an apparatus embodiment corresponding to the method embodiment, and this embodiment can be implemented in cooperation with the method embodiment. The relevant technical details mentioned in the method embodiments are still valid in this embodiment, and will not be repeated here in order to reduce repetition. Correspondingly, the related technical details mentioned in this embodiment can also be applied in the method embodiment.
值得一提的是,本实施例中所涉及到的各模块均为逻辑模块,在实际应用中,一个逻辑单元可以是一个物理单元,也可以是一个物理单元的一部分,还可以以多个物理单元的组合实现。此外,为了突出本申请的创新部分,本实施例中并没有将与解决本申请所提出的技术问题关系不太密切的单元引入,但这并不表明本实施例中不存在其它的单元。It is worth mentioning that all the modules involved in this embodiment are logical modules. In practical applications, a logical unit can be a physical unit, or a part of a physical unit, or multiple physical units. Combination of units. In addition, in order to highlight the innovative part of the present application, units that are not closely related to solving the technical problem proposed in the present application are not introduced in this embodiment, but this does not mean that there are no other units in this embodiment.
本申请实施例的另一方面还提供了一种电子设备,参考图7,包括:包括至少一个处理器701;以及,与至少一个处理器701通信连接的存储器702;其中,存储器702存储有可被至少一个处理器701执行的指令,指令被至少一个处理器701执行,以使至少一个处理器701能够执行上述任一方法实施例所描述的视频内容展示方法。Another aspect of the embodiment of the present application also provides an electronic device, referring to FIG. 7 , including: including at least one processor 701; Instructions executed by at least one processor 701, the instructions are executed by at least one processor 701, so that at least one processor 701 can execute the video content display method described in any one of the above method embodiments.
其中,存储器702和处理器701采用总线方式连接,总线可以包括任意数量的互联的总线和桥,总线将一个或多个处理器701和存储器702的各种电路连接在一起。总线还可以将诸如外围设备、稳压器和功率管理电路等之类的各种其他电路连接在一起,这些都是本领域所公知的,因此,本文不再对其进行进一步描述。总线接口在总线和收发机之间提供接口。收发机可以是一个元件,也可以是多个元件,比如多个接收器和发送器,提供用于在传输介质上与各种其他装置通信的单元。经处理器701处理的数据通过天线在无线介质上进行传输,进一步,天线还接收数据并将数据传输给处理器701。Wherein, the memory 702 and the processor 701 are connected by a bus, and the bus may include any number of interconnected buses and bridges, and the bus connects one or more processors 701 and various circuits of the memory 702 together. The bus may also connect together various other circuits such as peripherals, voltage regulators, and power management circuits, all of which are well known in the art and therefore will not be further described herein. The bus interface provides an interface between the bus and the transceivers. A transceiver may be a single element or multiple elements, such as multiple receivers and transmitters, providing means for communicating with various other devices over a transmission medium. The data processed by the processor 701 is transmitted on the wireless medium through the antenna, and further, the antenna also receives the data and transmits the data to the processor 701 .
处理器701负责管理总线和通常的处理,还可以提供各种功能,包括定时,外围接口,电压调节、电源管理以及其他控制功能。而存储器702可以被用于存储处理器701在执行操作时所使用的数据。The processor 701 is responsible for managing the bus and general processing, and may also provide various functions including timing, peripheral interface, voltage regulation, power management and other control functions. And the memory 702 may be used to store data used by the processor 701 when performing operations.
本申请的实施方式还提供了一种计算机可读存储介质,存储有计算机程序。计算机程序被处理器执行时实现上述方法实施例。Embodiments of the present application also provide a computer-readable storage medium storing a computer program. The above method embodiments are implemented when the computer program is executed by the processor.
即,本领域技术人员可以理解,实现上述实施例方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序存储在一个存储介质中,包括若干指令用以使得一个设备(可以是单片机,芯片等)或处理器(processor)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。That is, those skilled in the art can understand that all or part of the steps in the method of the above-mentioned embodiments can be completed by instructing related hardware through a program, the program is stored in a storage medium, and includes several instructions to make a device ( It may be a single-chip microcomputer, a chip, etc.) or a processor (processor) to execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disc, etc., which can store program codes. .
本领域的普通技术人员可以理解,上述各实施例是实现本申请的具体实施例,而在实际应用中,可以在形式上和细节上对其作各种改变,而不偏离本申请的精神和范围。Those of ordinary skill in the art can understand that the above-mentioned embodiments are specific embodiments for realizing the present application, and in practical applications, various changes can be made to it in form and details without departing from the spirit and spirit of the present application. scope.

Claims (11)

  1. 一种视频内容展示方法,包括:A method for displaying video content, comprising:
    接收用户的视频内容获取请求,获取所述用户的身份;Receive the user's video content acquisition request, and acquire the identity of the user;
    根据所述用户的身份及用户身份与研判规则的预设对应关系,获取与所述用户的身份对应的目标研判规则;Acquiring target research and judgment rules corresponding to the user's identity according to the user's identity and the preset corresponding relationship between the user's identity and the research and judgment rules;
    根据所述目标研判规则对所述用户请求的视频内容进行违规内容检测,根据检测结果对所述视频内容进行违规内容屏蔽,并向所述用户展示违规内容屏蔽后的所述视频内容。Perform illegal content detection on the video content requested by the user according to the target research and judgment rules, block the illegal content on the video content according to the detection result, and display the blocked video content to the user.
  2. 根据权利要求1所述的视频内容展示方法,其中,在所述获取所述用户的身份前,还包括:The video content display method according to claim 1, wherein, before said acquiring the user's identity, further comprising:
    获取所述用户的终端类型;Obtain the terminal type of the user;
    根据所述终端类型,获取目标识别方法;Acquiring a target identification method according to the terminal type;
    所述获取所述用户的身份,包括:The acquiring the identity of the user includes:
    根据所述目标识别方法获取所述用户的身份。Obtain the identity of the user according to the target identification method.
  3. 根据权利要求2所述的视频内容展示方法,其中,所述根据所述终端类型,获取目标识别方法,包括:The method for displaying video content according to claim 2, wherein said acquiring a target identification method according to said terminal type includes:
    在所述终端类型为非5G终端的情况下,所述根据所述目标识别方法获取所述用户的身份包括以下之一或任意组合:In the case where the terminal type is a non-5G terminal, the acquisition of the user's identity according to the target identification method includes one or any combination of the following:
    根据所述用户的登录信息识别所述用户的身份、根据所述用户的声音识别所述用户的身份、根据所述用户的视频图像识别所述用户的身份;Identifying the identity of the user based on the user's login information, identifying the identity of the user based on the user's voice, identifying the identity of the user based on the video image of the user;
    在所述终端类型为5G终端的情况下,所述根据所述目标识别方法获取所述用户的身份包括:根据所述用户的指纹信息识别所述用户的身份,和/或根据所述用户的虹膜信息识别所述用户的身份。In the case where the terminal type is a 5G terminal, the acquiring the user's identity according to the target identification method includes: identifying the user's identity according to the user's fingerprint information, and/or according to the user's The iris information identifies the user.
  4. 根据权利要求1所述的视频内容展示方法,其中,所述目标研判规则,包括:音频研判规则、视频研判规则或多维研判规则。The method for displaying video content according to claim 1, wherein the target judgment rules include: audio judgment rules, video judgment rules or multi-dimensional judgment rules.
  5. 根据权利要求4所述的视频内容展示方法,其中,所述根据所述目标研判规则对视频内容进行违规内容检测,包括以下之一或任意组合:The method for displaying video content according to claim 4, wherein said detection of illegal content on video content according to said target research and judgment rules includes one or any combination of the following:
    对所述视频内容进行敏感词检测,对所述视频内容进行违规动作检测,对所述视频内容进行违规画面检测。Sensitive word detection is performed on the video content, illegal action detection is performed on the video content, and illegal screen detection is performed on the video content.
  6. 根据权利要求5所述的视频内容展示方法,其中,所述根据检测结果对所述视频内容进行违规内容屏蔽,包括以下之一或任意组合:The method for displaying video content according to claim 5, wherein said blocking illegal content of said video content according to the detection result comprises one or any combination of the following:
    对所述敏感词进行消音、删除或替换,对所述违规动作进行打码、删除或替换,对所述违规画面进行打码、删除或替换。Mute, delete or replace the sensitive word, code, delete or replace the illegal action, code, delete or replace the illegal picture.
  7. 根据权利要求1所述的视频内容展示方法,其中,在所述向所述用户展示违规内容屏蔽后的所述视频内容后,还包括:The method for displaying video content according to claim 1, wherein, after displaying the video content blocked from illegal content to the user, further comprising:
    周期性获取所述用户的身份,并根据当前获取到的所述用户的身份,确定所述目标研判规则;Periodically acquire the identity of the user, and determine the target research and judgment rules according to the currently acquired identity of the user;
    在所述目标研判规则发生变更的情况下,根据变更后的所述目标研判规则对所述视频内容进行违规内容检测。In the case that the target research and judgment rules are changed, the illegal content detection is performed on the video content according to the changed target research and judgment rules.
  8. 根据权利要求1至7中任一项所述的视频内容展示方法,其中,还包括:所述预设对 应关系中的研判规则根据视频内容的内容类型动态更新。The video content presentation method according to any one of claims 1 to 7, further comprising: dynamically updating the research and judgment rules in the preset correspondence according to the content type of the video content.
  9. 一种视频内容展示装置,包括:A video content display device, comprising:
    接收模块,设置为接收用户的视频内容获取请求,获取所述用户的身份;The receiving module is configured to receive the video content acquisition request of the user, and acquire the identity of the user;
    获取模块,设置为根据所述用户的身份及用户身份与研判规则的预设对应关系,获取与所述用户的身份对应的目标研判规则;The obtaining module is configured to obtain the target research and judgment rules corresponding to the user's identity according to the user's identity and the preset corresponding relationship between the user's identity and the research and judgment rules;
    处理模块,设置为根据所述目标研判规则对所述用户请求的视频内容进行违规内容检测,根据检测结果对所述视频内容进行违规内容屏蔽,并向所述用户展示违规内容屏蔽后的所述视频内容。The processing module is configured to detect the illegal content of the video content requested by the user according to the target research and judgment rules, block the illegal content of the video content according to the detection result, and show the user the illegal content blocked. video content.
  10. 一种电子设备,包括:An electronic device comprising:
    至少一个处理器;以及,at least one processor; and,
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein,
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如权利要求1至8中任意一项视频内容展示方法。The memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor, so that the at least one processor can perform the video as described in any one of claims 1 to 8. Content display method.
  11. 一种计算机可读存储介质,存储有计算机程序,其中,所述计算机程序被处理器执行时实现权利要求1至8中任一项所述的视频内容展示方法。A computer-readable storage medium storing a computer program, wherein when the computer program is executed by a processor, the method for displaying video content according to any one of claims 1 to 8 is realized.
PCT/CN2022/137028 2021-12-27 2022-12-06 Video content display method and apparatus, and electronic device and storage medium WO2023124840A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111619546.1A CN116405629A (en) 2021-12-27 2021-12-27 Video content display method, device, electronic equipment and storage medium
CN202111619546.1 2021-12-27

Publications (1)

Publication Number Publication Date
WO2023124840A1 true WO2023124840A1 (en) 2023-07-06

Family

ID=86997623

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/137028 WO2023124840A1 (en) 2021-12-27 2022-12-06 Video content display method and apparatus, and electronic device and storage medium

Country Status (2)

Country Link
CN (1) CN116405629A (en)
WO (1) WO2023124840A1 (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105939487A (en) * 2016-06-06 2016-09-14 乐视控股(北京)有限公司 Video processing method and device
CN109800868A (en) * 2018-12-25 2019-05-24 福州瑞芯微电子股份有限公司 A kind of data encoding chip and method based on deep learning
CN110852231A (en) * 2019-11-04 2020-02-28 云目未来科技(北京)有限公司 Illegal video detection method and device and storage medium
CN111209440A (en) * 2020-01-13 2020-05-29 腾讯科技(深圳)有限公司 Video playing method, device and storage medium
CN111416997A (en) * 2020-03-31 2020-07-14 百度在线网络技术(北京)有限公司 Video playing method and device, electronic equipment and storage medium
CN111432274A (en) * 2019-01-10 2020-07-17 百度在线网络技术(北京)有限公司 Video processing method and device
WO2021087747A1 (en) * 2019-11-05 2021-05-14 深圳市欢太科技有限公司 Method and apparatus for processing push content, and electronic device and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105939487A (en) * 2016-06-06 2016-09-14 乐视控股(北京)有限公司 Video processing method and device
CN109800868A (en) * 2018-12-25 2019-05-24 福州瑞芯微电子股份有限公司 A kind of data encoding chip and method based on deep learning
CN111432274A (en) * 2019-01-10 2020-07-17 百度在线网络技术(北京)有限公司 Video processing method and device
CN110852231A (en) * 2019-11-04 2020-02-28 云目未来科技(北京)有限公司 Illegal video detection method and device and storage medium
WO2021087747A1 (en) * 2019-11-05 2021-05-14 深圳市欢太科技有限公司 Method and apparatus for processing push content, and electronic device and storage medium
CN111209440A (en) * 2020-01-13 2020-05-29 腾讯科技(深圳)有限公司 Video playing method, device and storage medium
CN111416997A (en) * 2020-03-31 2020-07-14 百度在线网络技术(北京)有限公司 Video playing method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN116405629A (en) 2023-07-07

Similar Documents

Publication Publication Date Title
US10938725B2 (en) Load balancing multimedia conferencing system, device, and methods
US11636710B2 (en) Methods and apparatus for reducing false positives in facial recognition
US11341351B2 (en) Methods and apparatus for facial recognition on a user device
JP2024504092A (en) Mirroring methods, devices, electronic equipment and storage media
JP2016536945A (en) Video providing method and video providing system
US12015657B2 (en) Personal video recorder with limited attached local storage
US20240064356A1 (en) User Chosen Watch Parties
KR20160135155A (en) Method and device for sharing image
US11503110B2 (en) Method for presenting schedule reminder information, terminal device, and cloud server
WO2022027948A1 (en) Client, cloud server and identity recognition method therefor, system, and computer storage medium
US9742744B1 (en) Documents with location attributes for access and storage
US20140041054A1 (en) Attestation of possession of media content items using fingerprints
US20200162698A1 (en) Smart contact lens based collaborative video conferencing
WO2023124840A1 (en) Video content display method and apparatus, and electronic device and storage medium
US20200322648A1 (en) Systems and methods of facilitating live streaming of content on multiple social media platforms
US10599928B2 (en) Method and system for enabling information in augmented reality applications
WO2023213095A1 (en) Data archiving method and apparatus
WO2022089220A1 (en) Image data processing method and apparatus, device, storage medium, and product
CN114171172A (en) System and method for matching and distributing medical information
CN115310977A (en) Payment method, system, equipment and storage medium based on payment electronic system
Segundo et al. CrowdSync: User generated videos synchronization using crowdsourcing
CN113490186B (en) Cloud video enhanced p2p real-time communication method, system and storage medium
CN116846832A (en) Flow control method and device, storage medium and electronic equipment
US20200213479A1 (en) Sound syncing sign-language interpretation system
CN114861931A (en) Front-end and back-end separated asynchronous federal learning method, system, device and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22914095

Country of ref document: EP

Kind code of ref document: A1