WO2022262719A1 - Live streaming processing method and apparatus, storage medium, and electronic device - Google Patents

Live streaming processing method and apparatus, storage medium, and electronic device Download PDF

Info

Publication number
WO2022262719A1
WO2022262719A1 PCT/CN2022/098645 CN2022098645W WO2022262719A1 WO 2022262719 A1 WO2022262719 A1 WO 2022262719A1 CN 2022098645 W CN2022098645 W CN 2022098645W WO 2022262719 A1 WO2022262719 A1 WO 2022262719A1
Authority
WO
WIPO (PCT)
Prior art keywords
taboo
live
action
live broadcast
recognition result
Prior art date
Application number
PCT/CN2022/098645
Other languages
French (fr)
Chinese (zh)
Inventor
刘伟科
郐洪楠
韩卫召
沈俊杰
邵京平
Original Assignee
北京沃东天骏信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京沃东天骏信息技术有限公司 filed Critical 北京沃东天骏信息技术有限公司
Publication of WO2022262719A1 publication Critical patent/WO2022262719A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44213Monitoring of end-user related data
    • H04N21/44218Detecting physical presence or behaviour of the user, e.g. using sensors to detect if the user is leaving the room or changes his face expression during a TV program
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4882Data services, e.g. news ticker for displaying messages, e.g. warnings, reminders

Definitions

  • the embodiments of the present application relate to the field of live broadcast technologies, for example, to a live broadcast processing method, device, storage medium, and electronic equipment.
  • Embodiments of the present application provide a live broadcast processing method, device, storage medium, and electronic equipment, so as to realize the identification of taboo behaviors in the live broadcast process and prompt the host end.
  • the embodiment of the present application provides a live broadcast processing method, including:
  • the taboo library includes taboo behavior information of a plurality of live display objects
  • a taboo prompt is sent to the live broadcast terminal.
  • the embodiment of the present application also provides a live broadcast processing device, including:
  • the live data stream acquisition module is configured to acquire the live display objects and live data streams at the live end;
  • the video frame recognition module is configured to perform action recognition and object recognition on the live data stream, and determine the action-object recognition result in the live data stream;
  • the taboo matching module is configured to perform taboo matching on the recognition result of the action-object and the live display object in a tabu library, wherein the taboo library includes taboo behavior information of multiple live display objects;
  • the taboo prompting module is configured to send a taboo prompt to the live broadcast terminal if the recognition result of the action-object is successfully matched in the tabu library.
  • the embodiment of the present application also provides an electronic device, including a memory, a processor, and a computer program stored on the memory and operable on the processor.
  • the processor executes the program, it implements the The live broadcast processing method provided by any embodiment.
  • the embodiment of the present application further provides a computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, the live broadcast processing method provided in any embodiment of the present application is implemented.
  • FIG. 1 is a schematic flowchart of a live broadcast processing method provided in Embodiment 1 of the present application;
  • FIG. 2 is a schematic diagram of a live broadcast scene provided by an embodiment of the present application.
  • FIG. 3 is a schematic diagram of a process of generating a live data stream at a live end provided by an embodiment of the present application
  • FIG. 4 is a schematic flowchart of a live broadcast processing method provided in Embodiment 2 of the present application.
  • FIG. 5 is a schematic structural diagram of a live broadcast processing device provided in Embodiment 3 of the present application.
  • FIG. 6 is a schematic structural diagram of an electronic device provided in Embodiment 4 of the present application.
  • FIG. 1 is a schematic flow chart of a live broadcast processing method provided in Embodiment 1 of the present application.
  • This embodiment is applicable to real-time detection of taboo behavior during live broadcast during live broadcast.
  • This method can be processed by the live broadcast provided by the embodiment of the present application
  • the live broadcast processing device can be implemented by software and/or hardware, and the live broadcast processing device can be configured on an electronic computing device such as a server or a computer, including the following steps:
  • S120 Perform action recognition and object recognition on the live data stream, and determine an action-object recognition result in the live data stream.
  • FIG. 2 is a schematic diagram of a live broadcast scene provided by an embodiment of the present application.
  • the live broadcast terminal and the client are respectively connected to the live broadcast platform.
  • two different electronic devices in other embodiments, the live broadcast end and the client end can also be the same electronic device, and different user permissions can be distinguished through different display interfaces, or by logging in to different applications Programs (Application, APP) are distinguished, or areas are identified by verifying different role identities.
  • Programs Application, APP
  • the live broadcast terminal and the client can be terminal devices such as mobile phones and tablet computers, which have different permissions.
  • the live broadcast terminal is set to provide live data, and has the live background management authority and the live broadcast object setting authority.
  • the client can watch the live broadcast, and only has the right to watch the live broadcast and the transaction right to the live broadcast objects in the live broadcast.
  • the live broadcast terminal needs to be registered.
  • the live broadcast platform is equipped with a registration module, which receives the user's registration request and user information, registers the user's live broadcast identity, and has corresponding permissions after successful registration.
  • a registration module which receives the user's registration request and user information, registers the user's live broadcast identity, and has corresponding permissions after successful registration.
  • the authority of the device is determined, that is, the device is determined to be one of the live broadcast terminal and the client.
  • you can set up a live broadcast set the live broadcast object (such as the item or game to be introduced), generate the live data stream, and transmit the generated live data stream to the live broadcast platform in real time, so that the client can obtain the live data stream from the live broadcast platform.
  • FIG. 3 is a schematic diagram of a process of generating a live data stream by a live broadcast terminal provided by an embodiment of the present application.
  • the live broadcast terminal sets the live broadcast display object, where the live broadcast display object is the country that the live broadcast display is oriented to, such as country A, country B, etc., and any live broadcast terminal can set at least one live broadcast display object.
  • the live broadcast end includes local live broadcast collection equipment, for example, live broadcast collection equipment includes but is not limited to cameras, microphones, mobile phones and other terminal equipment.
  • the live broadcast application on the terminal device calls the local live broadcast collection device to collect audio and video data to form a live data stream, and sends the live data stream to the live broadcast platform (such as a live server) through the live broadcast application on the terminal device.
  • the live broadcast platform conducts authority identification on the information of the live broadcast terminal. If the live broadcast terminal has the live broadcast authority, it will receive the live data stream and store it, and encapsulate an anchor service layer in the outer layer for external services, which is convenient for receiving the live data stream sent by the client. When requested, the live data stream is sent to the client for display.
  • the live broadcast platform receives the live display objects sent by the live broadcast end.
  • Different live display objects have different taboo behaviors. For example, when the live display objects include country C, the behavior of touching Buddha statues cannot occur in the live broadcast. When country D is included, the behavior of sending even-numbered flowers cannot appear in the live broadcast.
  • pre-setting live display objects it is convenient to detect taboo behaviors in a targeted manner, so as to avoid taboo behaviors of live display objects in live video streams.
  • the taboo library is pre-set in the live broadcast platform, and the taboo library includes taboo behavior information in various countries.
  • the tabu database is stored based on a data structure of ⁇ country-action(act: thing) ⁇ , where country is country information, action is a taboo action, act is an action, and thing is an object.
  • the taboo library may include ⁇ country C-interact with Buddha statues (interaction-Buddha statues) ⁇ , ⁇ country B-display flowers (holding/holding-lotus) ⁇ and so on.
  • the taboo behavior information in the taboo database can be obtained by means of internet search and encyclopedia query.
  • the taboo behavior information in the taboo database can be edited according to needs, such as adding countries and adding country-specific taboo behavior information, or , Add, modify and delete any taboo behavior information, etc.
  • pre-setting the taboo library it is convenient to use the taboo behavior information in the taboo library to perform taboo detection on the live data stream of each live broadcast terminal, and provide taboo prompts for the live data stream of the live broadcast terminal, so as to avoid taboo behavior when the anchor does not understand the national conditions of each country Condition.
  • Taboo behavior information is formed by the combination of actions and objects. For any live data stream, action recognition and object recognition are performed, and taboo behavior information is matched based on the recognized action results and object results. At the same time, the taboo behavior information stored in the taboo library Including corresponding actions and objects, it is convenient for precise matching and improves the high-precision recognition of taboo behavior information.
  • action recognition and object recognition may be performed on each video frame in the live data stream, or action recognition and object recognition may be performed on partial video frames in the live data stream. Not limited.
  • performing action recognition and object recognition on the live data stream, and determining the action-object recognition result in the live data stream includes: determining the detected video frame in the live data stream based on a preset time interval; Perform motion recognition on the video frame to obtain a motion recognition result; perform object recognition on the video frame, the object recognition result, wherein the object recognition result includes object type and object attribute; based on the motion recognition result of the video frame and the object recognition result to obtain the current action-object recognition result of the live data stream.
  • the preset time interval can be 3s-5s, which can be set according to requirements.
  • Each video frame in the live video stream is configured with a time stamp, and each video frame to be detected can be determined according to the time stamp and a preset time interval. The local video frames to be detected are determined by the preset time interval, so as to avoid the problem of excessive calculation caused by detecting the full video frames of the live data stream and affecting the quality of the live broadcast.
  • motion recognition can be performed on video frames based on a preset motion recognition model, and the extracted video frames are input into the pre-trained motion recognition model as input information to obtain the motion type output by the motion recognition model, for example
  • the type of action may include but is not limited to walking, eating, smoking, holding, touching, etc.
  • the action recognition model may be a key point recognition model, and the key point recognition model is set to recognize the key point positions of the characters in the video frame, and determine the corresponding action type based on the combination of key point positions, wherein the key point positions may include Head, hands, legs, and multiple joint points for hands and legs.
  • the action recognition model can be a skeleton recognition model, and the skeleton recognition model is set to recognize the skeleton diagram of the person in the video frame, and determine the corresponding action type based on the positions of multiple bones in the skeleton diagram.
  • the action recognition model includes but is not limited to a neural network model, a boosted tree model, a classifier model, etc., which is not limited thereto.
  • the action recognition model can be trained based on the sample image and the action label of the sample image, wherein the sample image can be collected based on the required action classification, for example, it can be obtained by collecting specific actions of the target object at different angles through an image acquisition device such as a camera , images at different positions and different light intensities are used as sample images, and images obtained based on specific actions as search words in a search engine are used as sample images.
  • the specific action may at least include actions in taboo behavior information according to each country in the taboo library.
  • object recognition can be performed on video frames based on a preset object recognition model, and the extracted video frames are input into the pre-trained object recognition model as input information to obtain the object recognition output by the object recognition model result.
  • the object recognition result may include object type and object attribute, and object attribute may include but not limited to object quantity, object color and object size, etc.
  • object attribute may be based on taboo behavior information of each country in the taboo library Object attributes are determined.
  • the taboo behavior information corresponding to country D includes the behavior of sending even-numbered flowers.
  • the object attributes include the number of objects; the taboo behavior information corresponding to country A includes the behavior of sending green hats, that is, the corresponding , the object properties include the object color.
  • the object recognition model can be trained based on sample images and object labels of the sample images, wherein the sample images can be obtained through a search engine, and the objects in the sample images can at least include objects in the taboo behavior information of each category in the taboo library.
  • the object recognition model includes but is not limited to a neural network model, a boosted tree model, a classifier model, etc., which is not limited thereto.
  • action recognition and object recognition can be performed synchronously, that is, the video frame is synchronously input to the action recognition model and the object recognition model, and corresponding recognition results are obtained respectively, wherein the recognition results carry the time stamp of the video frame And the live identification of the live video stream to which it belongs, combine the action recognition results and object recognition results with the same live identification and timestamp to obtain the current action-object recognition result, avoiding the combination of recognition results of different video frames or different live video streams Actions that contraindicate the case of a mismatch.
  • the action-object recognition results in the taboo library are targeted for taboo matching to improve the matching accuracy and avoid problems such as omissions or false reminders caused by wrong matching.
  • performing tabu matching on the action-object recognition result and the live display object in the tabu library includes: determining a matching range in the tabu library based on the live display object, wherein the The matching range includes the taboo behavior information of the live broadcast display object; the recognition result of the action-object is matched within the matching range of the live broadcast display object.
  • the taboo behavior information of the live display object selected by the live broadcast terminal is extracted from the taboo database to form a matching range, which includes all the taboo behavior information of each live display object selected by the live broadcast terminal in the taboo database, which will be passed through
  • the action-object recognition results obtained by video frame recognition are matched within the above matching range, and the matching pertinence and matching accuracy are improved on the basis of reducing the amount of matching data.
  • the recognition result of the action-object is holding in hand-flowers (three), and sending even-numbered flowers
  • the attribute of the object in the behavior is inconsistent, which is not a taboo behavior.
  • the action-object recognition result is holding flowers (two flowers), which is consistent with the action, object, and object attributes in the behavior of sending even-numbered flowers, which is a taboo behavior.
  • the recognition accuracy is improved, and at the same time, the object and the action are matched separately, which reduces the difficulty of behavior matching and improves the accuracy of behavior matching.
  • a taboo reminder is sent to the live broadcast terminal corresponding to the live video stream, which is used to prompt the anchor to stop the taboo behavior and correct it.
  • sending a taboo reminder to the live broadcast terminal includes: extracting the prompt content of the successfully matched taboo behavior information, and sending the prompt content to the live broadcast terminal, so that the live broadcast terminal displays the prompt content.
  • the taboo library may store prompt content containing taboo behavior information, and the prompt content may include descriptive information including taboo behavior and correct behavior corresponding to the taboo behavior.
  • the taboo library may be stored in a data structure of ⁇ country-action(act: thing)-taboo-right ⁇ , where taboo is the description information of the taboo action, and right is the correct action corresponding to the taboo action.
  • ⁇ Country D-present flowers (send-flowers)-cannot be double (especially not 2)-must be singular ⁇
  • ⁇ Country B-display flowers (take/hold-flowers)-cannot be lotus-other flowers can ⁇
  • ⁇ Country C Interact with Buddha Statue (Interaction-Buddha Statue) - Cannot touch Buddha Statue (especially the head) - Need to maintain respect for Buddha Statue ⁇ .
  • the prompt content includes at least one of text, picture and video.
  • the correct behavior corresponding to the taboo behavior can be at least one form of text, picture and video.
  • the content of the text prompt can be displayed on the interface of the live broadcast terminal based on text bullet screens and text pop-up windows
  • the content of the picture prompt can be displayed on the interface of the live broadcast terminal in the form of a certain screen ratio, and the preset duration is displayed in suspension
  • the video prompt The content can be to repeatedly play the picture-in-picture video N times with a certain shielding ratio on the interface of the live broadcast terminal.
  • a transparent mask layer can be added to the live broadcast layer of the live broadcast terminal, and the specified video can be played at the above ratio and position of the transparent mask layer.
  • the technical solution of this embodiment ensures the completeness and accuracy of the taboo behavior information by setting a taboo database, including the taboo behavior information of each country.
  • Perform action recognition and object recognition on the acquired live data stream determine the action-object recognition result in the live data stream, and perform targeted taboo matching in the tabu library based on the action-object recognition result in the live broadcast display object,
  • the difficulty of behavior recognition and matching is reduced, and the recognition accuracy and matching accuracy are improved.
  • a taboo reminder is sent to the live broadcast terminal, which avoids the problem that the anchor does not understand the taboo behavior information of other countries, and the problem of live taboo occurs, and improves the standardization of live broadcast for different display objects.
  • Fig. 4 is a schematic flow chart of a live broadcast processing method provided in Embodiment 2 of the present application, which is refined on the basis of the above embodiment.
  • the recognition result of the action-object is in the taboo library
  • the method further includes: determining whether the successfully matched taboo behavior information satisfies the judgment condition, and if the successfully matched taboo behavior information does not meet the judgment condition, recording the successfully matched taboo behavior information, and continuing to Perform action recognition and object recognition in the next video frame; when the successfully matched taboo behavior information satisfies the judgment condition, execute the step of sending a taboo prompt to the live broadcast terminal.
  • the method includes:
  • S220 Perform action recognition and object recognition on the live data stream, and determine an action-object recognition result in the live data stream.
  • step S240 If the recognition result of the action-object is successfully matched in the taboo library, determine whether the successfully matched taboo behavior information satisfies the judgment condition, and if the successfully matched taboo behavior information satisfies the judgment condition, execute In step S260, if the successfully matched taboo behavior information does not satisfy the judgment condition, step S250 is executed.
  • step S250 Record the successfully matched taboo behavior information, return to step S220, and perform action recognition and object recognition on the next video frame.
  • a judgment condition is set for each taboo behavior information.
  • the judgment condition is used to determine the degree of taboo behavior. Different taboo behavior information can correspond to different judgment conditions. If the judgment conditions are met , then the live broadcast terminal is triggered to send a taboo prompt to prompt the host to make corrections. When the judgment condition is not met, it can record the taboo behavior information that matches successfully, and perform conditional accumulation on the recorded taboo behavior information, and perform a conditional accumulation on the next video frame Identify and match taboo behavior information until the judgment condition is met. By setting the judgment conditions, no prompts will be given for the live video streams that do not meet the judgment conditions, so as to avoid frequent prompts causing interference to the live broadcast.
  • the judging condition includes one or both of a duration condition and a frequency condition.
  • the judging condition may be that the cumulative duration of the successfully matched taboo behavior information exceeds the preset duration N, and/or, the successfully matched taboo behavior
  • the cumulative frequency of information exceeds the preset frequency M, where N is a natural number greater than 0, and M is a positive integer greater than or equal to 1.
  • a recognition list is formed according to an action-object recognition result of each detected video frame, and the recognition list may include a duration recognition list and/or a frequency recognition list.
  • the duration recognition list can be the data structure of List[ ⁇ action-startTimestamp-ts ⁇ ], where action is a taboo behavior, startTimestamp is the trigger start time, ts is the total duration of the behavior, and List means that the data structure is an ordered array .
  • a record will be saved as: ⁇ Dedicated Flowers-16021313211-0 ⁇ .
  • the video frame is detected based on the preset time interval (for example, every 3 seconds). If the taboo match is successful, a new record is generated and compared with the last record in the List array.
  • the comparison content includes action and startTimestamp, For example, if the action remains unchanged and the startTimestamp interval is 3 seconds (plus or minus 300ms error is allowed), startTimestamp will be overwritten and ts data will be increased by 3.
  • startTimestamp increases by 3 and ts also increases Got 3. If the action remains the same and the startTimestamp interval is greater than 3 seconds when detecting again, the startTimestamp will be overwritten, but the ts data will not be modified.
  • startTimestamp update the start time of each taboo behavior triggered, which is convenient to identify whether the current taboo behavior trigger and the previous trigger taboo behavior are continuous triggers, if the startTimestamp of the current taboo behavior trigger is the previous trigger taboo behavior
  • the sum of startTimestamp and ts data indicates a continuous behavior, and the duration can be superimposed.
  • startTimestamp of the current taboo behavior is not the sum of the startTimestamp and ts data of the previous current taboo behavior, it indicates a discontinuous behavior. If the action changes when detecting again, a new record will be created and appended to the end of the ordered list.
  • the frequency identification list can be a List[ ⁇ action-count ⁇ ] data structure, where count is the cumulative number of times.
  • the List structure is initialized and an action-count data is formed, such as ⁇ touch Buddha statue-1 ⁇ . Every time taboo content is triggered, it is judged whether there is the same action data in the List, and if there is the same action data in the List In this case, its count+1, if there is no same action data in the List, add an act-count data.
  • determining whether the successfully matched taboo behavior information satisfies the judgment condition includes: updating the duration information and/or frequency information of the taboo behavior information based on the successfully matched taboo behavior information; Whether the duration information and/or frequency information of the taboo behavior information satisfies the judgment condition corresponding to the taboo behavior information.
  • the duration identification list and/or the frequency identification list may be updated based on the successfully matched taboo behavior information, and the current duration information and/or frequency information may be determined.
  • Judgment conditions corresponding to successfully matched taboo behavior information are extracted from the taboo database, and current duration information and/or frequency information are compared with the extracted judgment conditions.
  • the judgment condition is that the accumulated duration is greater than 15s. If the current duration information is less than 15s, the judgment condition is not satisfied, and if the current duration information is greater than 15s, the judgment condition is satisfied.
  • the action-object recognition result is successfully matched in the taboo library, it is determined whether the successfully matched taboo behavior information meets the judgment condition, and when the judgment condition is met, the prompt to the live broadcast terminal is triggered. Live video streams that meet the judgment conditions will not be prompted to avoid frequent reminders causing interference to the live broadcast.
  • FIG. 4 is a schematic structural diagram of a live broadcast processing device provided in Embodiment 3 of the present application.
  • the live broadcast processing device may be configured in a host platform or a live broadcast server.
  • the device includes:
  • the live data stream obtaining module 310 is configured to obtain the live display object and the live data stream at the live end;
  • the video frame recognition module 320 is configured to perform action recognition and object recognition on the live data stream, and determine the action-object recognition result in the live data stream;
  • the taboo matching module 330 is configured to perform taboo matching on the action-object recognition result and the live display object in a tabu library, wherein the taboo library includes taboo behavior information of multiple live display objects;
  • the taboo prompting module 340 is configured to send a taboo prompt to the live broadcast terminal if the recognition result of the action-object is successfully matched in the tabu library.
  • the video frame identification module 320 is set to:
  • the object recognition result includes object type and object attribute
  • the current action-object recognition result of the live data stream is obtained based on the action recognition result and the object recognition result of the video frame.
  • the contraindication matching module 330 is set to:
  • the device also includes:
  • the taboo judgment module is configured to determine whether the successfully matched taboo behavior information satisfies the judgment condition after the recognition result of the action-object is successfully matched in the taboo library, and in response to the successfully matched taboo behavior information not meeting the judgment condition, Record the successfully matched taboo behavior information, and continue to perform action recognition and object recognition on the next video frame; in response to the successfully matched taboo behavior information meeting the judgment condition, execute the step of sending a taboo prompt to the live broadcast terminal.
  • the judgment conditions include one or two of duration conditions and frequency conditions
  • the taboo judgment module is set to:
  • the determining whether the matching taboo behavior information satisfies the judging conditions includes:
  • the contraindication prompting module 340 is set to:
  • the prompt content includes at least one of text, picture and video.
  • the live broadcast processing device provided in the embodiment of the present application can execute the live broadcast processing method provided in any embodiment of the present application, and has corresponding functional modules for executing the method.
  • FIG. 6 is a schematic structural diagram of an electronic device provided in Embodiment 4 of the present application.
  • FIG. 6 shows a block diagram of an electronic device 12 suitable for implementing embodiments of the present application.
  • the electronic device 12 shown in FIG. 6 is only an example, and should not limit the functions and scope of use of the embodiment of the present application.
  • Device 12 is typically an electronic device undertaking image classification functions.
  • electronic device 12 takes the form of a general-purpose computing device.
  • Components of the electronic device 12 may include, but are not limited to: at least one processor 16, a storage device 28, and a bus 18 connecting various system components (including the storage device 28 and the processor 16).
  • Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, a processor, or a local bus using any of a variety of bus structures.
  • these architectures include but are not limited to Industry Standard Architecture (Industry Standard Architecture, ISA) bus, Micro Channel Architecture (Micro Channel Architecture, MCA) bus, Enhanced ISA bus, Video Electronics Standards Association (Video Electronics Standards Association, VESA) local bus and peripheral component interconnect (Peripheral Component Interconnect, PCI) bus.
  • Electronic device 12 typically includes a variety of computer system readable media. These media can be any available media that can be accessed by electronic device 12 and include both volatile and nonvolatile media, removable and non-removable media.
  • Storage device 28 may include computer system readable media in the form of volatile memory, such as random access memory (Random Access Memory, RAM) 30 and/or cache memory 32 .
  • the electronic device 12 may further include other removable/non-removable, volatile/nonvolatile computer system storage media.
  • storage system 34 may be used to read and write to non-removable, non-volatile magnetic media (not shown in FIG. 6, commonly referred to as a "hard drive").
  • a disk drive for reading and writing to a removable nonvolatile disk may be provided, as well as a removable nonvolatile disk (such as a Compact Disc- Read Only Memory, CD-ROM), Digital Video Disc (Digital Video Disc-Read Only Memory, DVD-ROM) or other optical media) CD-ROM drive.
  • each drive may be connected to bus 18 via at least one data medium interface.
  • the storage device 28 may include at least one program product having a set (for example, at least one) of program modules configured to perform the functions of the various embodiments of the present application.
  • Program modules 26 generally perform functions and/or methods in the embodiments described herein.
  • the electronic device 12 can also communicate with at least one external device 14 (such as a keyboard, a pointing device, a camera, a display 24, etc.), and also communicate with at least one device that enables a user to interact with the electronic device 12, and/or communicate with a device that enables the user to interact with the electronic device 12.
  • Electronic device 12 is capable of communicating with any device (eg, network card, modem, etc.) that communicates with at least one other computing device. Such communication may occur through input/output (I/O) interface 22 .
  • the electronic device 12 can also communicate with at least one gateway (such as a local area network (Local Area Network, LAN), wide area network, Wide Area Network, WAN) and/or a public gateway, such as the Internet, through the gateway adapter 20.
  • gateway adapter 20 communicates with other modules of electronic device 12 via bus 18 .
  • other hardware and/or software modules may be used in conjunction with electronic device 12, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, disk arrays (Redundant Arrays) of Independent Disks, RAID) systems, tape drives, and data backup storage systems.
  • the processor 16 executes various functional applications and data processing by running the programs stored in the storage device 28 , for example, realizing the live broadcast processing method provided by the above-mentioned embodiments of the present application.
  • Embodiment 5 of the present application provides a computer-readable storage medium, on which a computer program is stored.
  • the program is executed by a processor, the live broadcast processing method provided in the embodiment of the present application is implemented.
  • the computer-readable storage medium provided by the embodiment of the present application is not limited to the method operation described above, and can also execute the live broadcast processing method provided by any embodiment of the present application.
  • the computer storage medium in the embodiments of the present application may use any combination of at least one computer-readable medium.
  • the computer readable medium may be a computer readable signal medium or a computer readable storage medium.
  • a computer readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer readable signal medium may include a data signal carrying computer readable source code in baseband or as part of a carrier wave traveling as a data signal. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which can send, propagate, or transmit a program for use by or in conjunction with an instruction execution system, apparatus, or device. .
  • Source code embodied on a computer readable medium may be transmitted using any appropriate medium, including - but not limited to wireless, wire, optical cable, radio frequency (RF), etc., or any suitable combination of the foregoing.
  • any appropriate medium including - but not limited to wireless, wire, optical cable, radio frequency (RF), etc., or any suitable combination of the foregoing.
  • Computer source code for carrying out the operations of this application may be written in one or more programming languages, or combinations thereof, including object-oriented programming languages—such as Java, Smalltalk, C++, and conventional Procedural Programming Language - such as "C" or a similar programming language.
  • the Source Code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer may be connected to the user computer through any kind of gateway, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (such as through an Internet Service Provider). Internet connection).
  • LAN local area network
  • WAN wide area network
  • Internet Service Provider such as AT&T, MCI, Sprint, EarthLink, MSN, GTE, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Social Psychology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Databases & Information Systems (AREA)
  • User Interface Of Digital Computer (AREA)
  • Image Analysis (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Disclosed in embodiments of the present application are a live streaming processing method and apparatus, a storage medium, and an electronic device. The method comprises: obtaining a live display object and a live data stream of a live streaming terminal; performing action recognition and object recognition on the live data stream to determine recognition results of an action and object in the live data stream; performing taboo match on the recognition results of the action and object and the live display object in a taboo database, wherein the taboo database comprises taboo behavior information of a plurality of live display objects; and in response to the action and object recognition results being successfully matched in the taboo database, sending a taboo prompt to the live streaming terminal.

Description

一种直播处理方法、装置、存储介质及电子设备A live broadcast processing method, device, storage medium and electronic equipment
本申请要求在2021年6月15日提交中国专利局、申请号为202110662255.4的中国专利申请的优先权,该申请的全部内容通过引用结合在本申请中。This application claims priority to a Chinese patent application with application number 202110662255.4 filed with the China Patent Office on June 15, 2021, the entire contents of which are incorporated herein by reference.
技术领域technical field
本申请实施例涉及直播技术领域,例如涉及一种直播处理方法、装置、存储介质及电子设备。The embodiments of the present application relate to the field of live broadcast technologies, for example, to a live broadcast processing method, device, storage medium, and electronic equipment.
背景技术Background technique
随着互联网技术的发展和社会文明的提高,直播行业日趋成熟。直播作为互动体验感最强的形式,得到了越来越多用户的喜爱。With the development of Internet technology and the improvement of social civilization, the live broadcast industry is becoming more and more mature. As the form with the strongest interactive experience, live broadcast has been loved by more and more users.
相关技术中至少存在以下技术问题:世界各国有很多风俗、习惯和禁忌,由于直播的准入门槛很低,不能保证主播能够通晓全世界的风俗习惯,直播领域的实时性很容易造成恶劣的影响。There are at least the following technical problems in the related technologies: There are many customs, habits and taboos in countries all over the world. Due to the low entry threshold for live broadcasting, it cannot be guaranteed that the anchor can understand the customs and habits of the world, and the real-time nature of live broadcasting can easily cause adverse effects .
发明内容Contents of the invention
本申请实施例提供一种直播处理方法、装置、存储介质及电子设备,以实现识别直播过程中的禁忌行为,对主播端进行提示。Embodiments of the present application provide a live broadcast processing method, device, storage medium, and electronic equipment, so as to realize the identification of taboo behaviors in the live broadcast process and prompt the host end.
第一方面,本申请实施例提供了一种直播处理方法,包括:In the first aspect, the embodiment of the present application provides a live broadcast processing method, including:
获取直播端的直播展示对象以及直播数据流;Obtain the live display object and live data stream of the live broadcast terminal;
对所述直播数据流进行动作识别以及物体识别,确定直播数据流中动作-物体的识别结果;Perform action recognition and object recognition on the live data stream, and determine the action-object recognition result in the live data stream;
将所述动作-物体的识别结果以及所述直播展示对象,在禁忌库中进行禁忌匹配,其中,所述禁忌库中包括多个直播展示对象的禁忌行为信息;Performing tabu matching in the taboo library with the recognition result of the action-object and the live display object, wherein the taboo library includes taboo behavior information of a plurality of live display objects;
响应于所述动作-物体的识别结果在所述禁忌库中匹配成功,向所述直播端发送禁忌提示。In response to the recognition result of the action-object being successfully matched in the taboo database, a taboo prompt is sent to the live broadcast terminal.
第二方面,本申请实施例还提供了一种直播处理装置,包括:In the second aspect, the embodiment of the present application also provides a live broadcast processing device, including:
直播数据流获取模块,设置为获取直播端的直播展示对象以及直播数据流;The live data stream acquisition module is configured to acquire the live display objects and live data streams at the live end;
视频帧识别模块,设置为对所述直播数据流进行动作识别以及物体识别,确定直播数据流中动作-物体的识别结果;The video frame recognition module is configured to perform action recognition and object recognition on the live data stream, and determine the action-object recognition result in the live data stream;
禁忌匹配模块,设置为将所述动作-物体的识别结果以及所述直播展示对象,在禁忌库中进行禁忌匹配,其中,所述禁忌库中包括多个直播展示对象的禁忌行为信息;The taboo matching module is configured to perform taboo matching on the recognition result of the action-object and the live display object in a tabu library, wherein the taboo library includes taboo behavior information of multiple live display objects;
禁忌提示模块,设置为若所述动作-物体的识别结果在所述禁忌库中匹配成功,则向所述直播端发送禁忌提示。The taboo prompting module is configured to send a taboo prompt to the live broadcast terminal if the recognition result of the action-object is successfully matched in the tabu library.
第三方面,本申请实施例还提供了一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现如本申请任一实施例提供的直播处理方法。In the third aspect, the embodiment of the present application also provides an electronic device, including a memory, a processor, and a computer program stored on the memory and operable on the processor. When the processor executes the program, it implements the The live broadcast processing method provided by any embodiment.
第四方面,本申请实施例还提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如本申请任一实施例提供的直播处理方法。In a fourth aspect, the embodiment of the present application further provides a computer-readable storage medium, on which a computer program is stored, and when the program is executed by a processor, the live broadcast processing method provided in any embodiment of the present application is implemented.
附图说明Description of drawings
图1为本申请实施例一提供的一种直播处理方法的流程示意图;FIG. 1 is a schematic flowchart of a live broadcast processing method provided in Embodiment 1 of the present application;
图2是本申请实施例提供的一种直播场景的示意图;FIG. 2 is a schematic diagram of a live broadcast scene provided by an embodiment of the present application;
图3是本申请实施例提供的一种直播端生成直播数据流的过程示意图;FIG. 3 is a schematic diagram of a process of generating a live data stream at a live end provided by an embodiment of the present application;
图4是本申请实施例二提供的一种直播处理方法的流程示意图;FIG. 4 is a schematic flowchart of a live broadcast processing method provided in Embodiment 2 of the present application;
图5为本申请实施例三提供的一种直播处理装置的结构示意图;FIG. 5 is a schematic structural diagram of a live broadcast processing device provided in Embodiment 3 of the present application;
图6为本申请实施例四提供的一种电子设备的结构示意图。FIG. 6 is a schematic structural diagram of an electronic device provided in Embodiment 4 of the present application.
具体实施方式detailed description
下面结合附图和实施例对本申请作详细说明。The application will be described in detail below in conjunction with the accompanying drawings and embodiments.
实施例一Embodiment one
图1为本申请实施例一提供的一种直播处理方法的流程示意图,本实施例可适用于在直播过程中实时检测直播过程中禁忌行为的情况,该方法可以由本申请实施例提供的直播处理装置来执行,该直播处理装置可以由软件和/或硬件来实现,该直播处理装置可以配置在诸如服务器或计算机等的电子计算设备上,包括如下步骤:Figure 1 is a schematic flow chart of a live broadcast processing method provided in Embodiment 1 of the present application. This embodiment is applicable to real-time detection of taboo behavior during live broadcast during live broadcast. This method can be processed by the live broadcast provided by the embodiment of the present application The live broadcast processing device can be implemented by software and/or hardware, and the live broadcast processing device can be configured on an electronic computing device such as a server or a computer, including the following steps:
S110、获取直播端的直播展示对象以及直播数据流。S110. Obtain a live display object and a live data stream of the live broadcast terminal.
S120、对所述直播数据流进行动作识别以及物体识别,确定直播数据流中动作-物体的识别结果。S120. Perform action recognition and object recognition on the live data stream, and determine an action-object recognition result in the live data stream.
S130、将所述动作-物体的识别结果以及所述直播展示对象,在禁忌库中进行禁忌匹配,其中,所述禁忌库中包括多个直播展示对象的禁忌行为信息。S130. Perform tabu matching on the action-object recognition result and the live display object in a tabu library, wherein the tabu library includes taboo behavior information of multiple live display objects.
S140、若所述动作-物体的识别结果在所述禁忌库中匹配成功,则向所述直播端发送禁忌提示。S140. If the recognition result of the action-object is successfully matched in the taboo library, send a taboo reminder to the live broadcast terminal.
示例性的,参见图2,图2是本申请实施例提供的一种直播场景的示意图,直播端和客户端分别与直播平台通信连接,在一些实施例中,直播端和客户端可以是分别的两端,即两个不同的电子设备;在另一些实施例中,还可以是直播端和客户端是同一电子设备,通过不同的展示界面实现不同用户权限的区分,或者通过登录不同的应用程序(Application,APP)进行区分,或者通过验证不同的角色身份进行区域。直播端和客户端可以是诸如手机、平板电脑等的终端设备,分别具有不同的权限,直播端设置为提供直播数据,具有直播后台管理权限、直播对象设置权限。客户端可观看直播,仅具有直播观看权限和对直播中直播对象的交易权限。For example, refer to FIG. 2. FIG. 2 is a schematic diagram of a live broadcast scene provided by an embodiment of the present application. The live broadcast terminal and the client are respectively connected to the live broadcast platform. two different electronic devices; in other embodiments, the live broadcast end and the client end can also be the same electronic device, and different user permissions can be distinguished through different display interfaces, or by logging in to different applications Programs (Application, APP) are distinguished, or areas are identified by verifying different role identities. The live broadcast terminal and the client can be terminal devices such as mobile phones and tablet computers, which have different permissions. The live broadcast terminal is set to provide live data, and has the live background management authority and the live broadcast object setting authority. The client can watch the live broadcast, and only has the right to watch the live broadcast and the transaction right to the live broadcast objects in the live broadcast.
直播端需要进行注册,直播平台配置有注册模块,接收用户的注册请求以及用户信息,对用户的直播身份进行注册,注册成功后具备相应的权限。对于任一设备向直播平台发起请求时,判断该设备是否登录,登录成功后,确定该设备的权限,即确定该设备为直播端和客户端中的一项。对于直播端可开设直 播,设置直播对象(例如进行介绍的物品或者游戏等),生成直播数据流,将生成的直播数据流实时传输至直播平台,便于客户端从直播平台获取直播数据流。The live broadcast terminal needs to be registered. The live broadcast platform is equipped with a registration module, which receives the user's registration request and user information, registers the user's live broadcast identity, and has corresponding permissions after successful registration. When any device initiates a request to the live broadcast platform, it is judged whether the device is logged in. After the login is successful, the authority of the device is determined, that is, the device is determined to be one of the live broadcast terminal and the client. For the live broadcast end, you can set up a live broadcast, set the live broadcast object (such as the item or game to be introduced), generate the live data stream, and transmit the generated live data stream to the live broadcast platform in real time, so that the client can obtain the live data stream from the live broadcast platform.
参见图3,图3是本申请实施例提供的一种直播端生成直播数据流的过程示意图。直播端在完成注册和登录后,设置直播展示对象,其中直播展示对象为直播展示所面向的国别,例如国家A、国家B等,任一直播端可设置至少一个直播展示对象。直播端包括本地直播采集设备,例如直播采集设备包括但不限于摄像头、麦克风以及手机等终端设备。通过终端设备上的直播应用调用本地直播采集设备采集音视频数据,形成直播数据流,并通过终端设备上的直播应用将直播数据流发送至直播平台(例如直播服务器)。直播平台对直播端信息进行权限鉴定,若该直播端具有直播权限,则接收直播数据流并存储,在外层封装一个主播服务层用于对外服务,便于在接收到客户端发送的直播数据流的请求时,将该直播数据流发送至客户端进行展示。Referring to FIG. 3 , FIG. 3 is a schematic diagram of a process of generating a live data stream by a live broadcast terminal provided by an embodiment of the present application. After completing the registration and login, the live broadcast terminal sets the live broadcast display object, where the live broadcast display object is the country that the live broadcast display is oriented to, such as country A, country B, etc., and any live broadcast terminal can set at least one live broadcast display object. The live broadcast end includes local live broadcast collection equipment, for example, live broadcast collection equipment includes but is not limited to cameras, microphones, mobile phones and other terminal equipment. The live broadcast application on the terminal device calls the local live broadcast collection device to collect audio and video data to form a live data stream, and sends the live data stream to the live broadcast platform (such as a live server) through the live broadcast application on the terminal device. The live broadcast platform conducts authority identification on the information of the live broadcast terminal. If the live broadcast terminal has the live broadcast authority, it will receive the live data stream and store it, and encapsulate an anchor service layer in the outer layer for external services, which is convenient for receiving the live data stream sent by the client. When requested, the live data stream is sent to the client for display.
本实施例中,直播平台接收直播端发送的直播展示对象,不同直播展示对象存在不同的禁忌行为,例如在直播展示对象包括国家C时,不能在直播中出现摸佛像的行为,在直播展示对象包括国家D时,不能在直播中出现送双数花朵的行为。通过预先设置直播展示对象,便于针对性的进行禁忌行为的检测,以避免直播视频流中出现直播展示对象的禁忌行为。In this embodiment, the live broadcast platform receives the live display objects sent by the live broadcast end. Different live display objects have different taboo behaviors. For example, when the live display objects include country C, the behavior of touching Buddha statues cannot occur in the live broadcast. When country D is included, the behavior of sending even-numbered flowers cannot appear in the live broadcast. By pre-setting live display objects, it is convenient to detect taboo behaviors in a targeted manner, so as to avoid taboo behaviors of live display objects in live video streams.
直播平台中预先设置禁忌库,该禁忌库中包括各个国家的禁忌行为信息。在一些实施例中,禁忌库中基于{country-action(act:thing)}的数据结构进行存储,其中,country为国别信息,action为禁忌行为,act为动作,thing为物体。示例性的,禁忌库中可以包括{国家C-与佛像互动(互动-佛像)},{国家B-展示花(拿/捧-荷花)}等。禁忌库中的禁忌行为信息可以是网络搜索、百科查询的方式获取,同时,禁忌库中的禁忌行为信息可以是根据需求进行编辑,例如新增国别以及新增国别的禁忌行为信息,或者,增加、修改和删除任一禁忌行为信息等。通过预先设置禁忌库,便于通过该禁忌库中的禁忌行为信息对每个直播端的直播数据流进行禁忌检测,对直播端的直播数据流进行禁忌提示,避免在主 播不了解各国国情时出现禁忌行为的情况。The taboo library is pre-set in the live broadcast platform, and the taboo library includes taboo behavior information in various countries. In some embodiments, the tabu database is stored based on a data structure of {country-action(act: thing)}, where country is country information, action is a taboo action, act is an action, and thing is an object. Exemplarily, the taboo library may include {country C-interact with Buddha statues (interaction-Buddha statues)}, {country B-display flowers (holding/holding-lotus)} and so on. The taboo behavior information in the taboo database can be obtained by means of internet search and encyclopedia query. At the same time, the taboo behavior information in the taboo database can be edited according to needs, such as adding countries and adding country-specific taboo behavior information, or , Add, modify and delete any taboo behavior information, etc. By pre-setting the taboo library, it is convenient to use the taboo behavior information in the taboo library to perform taboo detection on the live data stream of each live broadcast terminal, and provide taboo prompts for the live data stream of the live broadcast terminal, so as to avoid taboo behavior when the anchor does not understand the national conditions of each country Condition.
禁忌行为信息由动作和物体组合形成,对于任一直播数据流进行动作识别和物体识别,基于识别到的动作结果和物体结果进行禁忌行为信息的匹配,同时,禁忌库中存储的禁忌行为信息中包括对应的动作和物体,便于进行精准匹配,提高禁忌行为信息的高精度识别。Taboo behavior information is formed by the combination of actions and objects. For any live data stream, action recognition and object recognition are performed, and taboo behavior information is matched based on the recognized action results and object results. At the same time, the taboo behavior information stored in the taboo library Including corresponding actions and objects, it is convenient for precise matching and improves the high-precision recognition of taboo behavior information.
在一些可选实施例中,可以是对所述直播数据流中的每一个视频帧进行动作识别和物体识别,还可以是对直播数据流中的局部视频帧进行动作识别和物体识别,对此不作限定。In some optional embodiments, action recognition and object recognition may be performed on each video frame in the live data stream, or action recognition and object recognition may be performed on partial video frames in the live data stream. Not limited.
可选的,对所述直播数据流进行动作识别以及物体识别,确定直播数据流中动作-物体的识别结果包括:基于预设时间间隔确定所述直播数据流中进行检测的视频帧;对所述视频帧进行动作识别,得到动作识别结果;对所述视频帧进行物体识别,所述物体识别结果,其中,所述物体识别结果包括物体类型和物体属性;基于所述视频帧的动作识别结果和物体识别结果得到所述直播数据流在当前的动作-物体的识别结果。其中,预设时间间隔可以是3s-5s,可根据需求设置。直播视频流中的每个视频帧中均配置有时间戳,可根据时间戳以及预设时间间隔确定进行检测的每一视频帧。通过预设时间间隔确定进行检测的局部视频帧,避免对直播数据流的全量视频帧进行检测导致的计算量过大,影响直播质量的问题。Optionally, performing action recognition and object recognition on the live data stream, and determining the action-object recognition result in the live data stream includes: determining the detected video frame in the live data stream based on a preset time interval; Perform motion recognition on the video frame to obtain a motion recognition result; perform object recognition on the video frame, the object recognition result, wherein the object recognition result includes object type and object attribute; based on the motion recognition result of the video frame and the object recognition result to obtain the current action-object recognition result of the live data stream. Wherein, the preset time interval can be 3s-5s, which can be set according to requirements. Each video frame in the live video stream is configured with a time stamp, and each video frame to be detected can be determined according to the time stamp and a preset time interval. The local video frames to be detected are determined by the preset time interval, so as to avoid the problem of excessive calculation caused by detecting the full video frames of the live data stream and affecting the quality of the live broadcast.
在一些实施例中,可以是基于预先设置的动作识别模型对视频帧进行动作识别,将提取的视频帧作为输入信息输入至预先训练的动作识别模型中,得到动作识别模型输出的动作类型,示例性的,动作类型可以是包括但不限于走、吃、抽烟、捧、摸等。可选的,动作识别模型可以是关键点识别模型,关键点识别模型设置为识别视频帧中人物的关键点位置,并基于关键点位置的组合确定对应的动作类型,其中,关键点位置可以包括头、手部、腿部以及手部、腿部的多个关节点。可选的,动作识别模型可以是骨骼识别模型,骨骼识别模型设置为识别视频帧中人物的骨架图,基于骨架图中多个骨骼的位置确定对应的 动作类型。可选的,动作识别模型包括但不限于神经网络模型、提升树模型、分类器模型等,对此不作限定。In some embodiments, motion recognition can be performed on video frames based on a preset motion recognition model, and the extracted video frames are input into the pre-trained motion recognition model as input information to obtain the motion type output by the motion recognition model, for example Specifically, the type of action may include but is not limited to walking, eating, smoking, holding, touching, etc. Optionally, the action recognition model may be a key point recognition model, and the key point recognition model is set to recognize the key point positions of the characters in the video frame, and determine the corresponding action type based on the combination of key point positions, wherein the key point positions may include Head, hands, legs, and multiple joint points for hands and legs. Optionally, the action recognition model can be a skeleton recognition model, and the skeleton recognition model is set to recognize the skeleton diagram of the person in the video frame, and determine the corresponding action type based on the positions of multiple bones in the skeleton diagram. Optionally, the action recognition model includes but is not limited to a neural network model, a boosted tree model, a classifier model, etc., which is not limited thereto.
动作识别模型可以是基于样本图像和样本图像的动作标签训练得到,其中,样本图像可以是基于所需的动作分类采集得到,例如可以是通过摄像头等图像采集设备对目标对象采集特定动作在不同角度、不同位置、不同光照强度下的图像,作为样本图像,还可以是在搜索引擎中基于特定动作为搜索词得到的图像,作为样本图像。其中,特定动作可以至少包括根据禁忌库中各国别的禁忌行为信息中的动作。The action recognition model can be trained based on the sample image and the action label of the sample image, wherein the sample image can be collected based on the required action classification, for example, it can be obtained by collecting specific actions of the target object at different angles through an image acquisition device such as a camera , images at different positions and different light intensities are used as sample images, and images obtained based on specific actions as search words in a search engine are used as sample images. Wherein, the specific action may at least include actions in taboo behavior information according to each country in the taboo library.
在一些可选实施例中,可以是基于预先设置的物体识别模型对视频帧进行物体识别,将提取的视频帧作为输入信息输入至预先训练的物体识别模型中,得到物体识别模型输出的物体识别结果。其中,物体识别结果中可以是包括物体类型和物体属性,物体属性可以包括但不限于物体数量、物体颜色和物体尺寸等,其中,物体属性可以是基于禁忌库中各国别的禁忌行为信息中的物体属性确定,示例性的,国家D对应的禁忌行为信息包括送双数花朵的行为,相应的,物体属性中包括物体数量;国家A对应的禁忌行为信息包括送绿色帽子的行为,即相应的,物体属性中包括物体颜色。In some optional embodiments, object recognition can be performed on video frames based on a preset object recognition model, and the extracted video frames are input into the pre-trained object recognition model as input information to obtain the object recognition output by the object recognition model result. Wherein, the object recognition result may include object type and object attribute, and object attribute may include but not limited to object quantity, object color and object size, etc., wherein, object attribute may be based on taboo behavior information of each country in the taboo library Object attributes are determined. For example, the taboo behavior information corresponding to country D includes the behavior of sending even-numbered flowers. Correspondingly, the object attributes include the number of objects; the taboo behavior information corresponding to country A includes the behavior of sending green hats, that is, the corresponding , the object properties include the object color.
物体识别模型可以是基于样本图像和样本图像的物体标签训练得到,其中,样本图像可以是通过搜索引擎获取,该样本图像中物体可以是至少包括禁忌库中各国别的禁忌行为信息中的物体。可选的,物体识别模型包括但不限于神经网络模型、提升树模型、分类器模型等,对此不作限定。The object recognition model can be trained based on sample images and object labels of the sample images, wherein the sample images can be obtained through a search engine, and the objects in the sample images can at least include objects in the taboo behavior information of each category in the taboo library. Optionally, the object recognition model includes but is not limited to a neural network model, a boosted tree model, a classifier model, etc., which is not limited thereto.
对于任一视频帧,可以是同步执行动作识别和物体识别,即将视频帧同步输入至动作识别模型和物体识别模型,分别得到对应的识别结果,其中,该识别结果中携带有视频帧的时间戳以及所属直播视频流的直播标识,将具有相同直播标识和时间戳的动作识别结果和物体识别结果组合,得到当前的动作-物体的识别结果,避免不同视频帧或者不同直播视频流的识别结果组合动作的禁忌误匹配的情况。For any video frame, action recognition and object recognition can be performed synchronously, that is, the video frame is synchronously input to the action recognition model and the object recognition model, and corresponding recognition results are obtained respectively, wherein the recognition results carry the time stamp of the video frame And the live identification of the live video stream to which it belongs, combine the action recognition results and object recognition results with the same live identification and timestamp to obtain the current action-object recognition result, avoiding the combination of recognition results of different video frames or different live video streams Actions that contraindicate the case of a mismatch.
将得到的动作-物体的识别结果在禁忌库中进行匹配,确定该动作-物体的识别结果是否属于禁忌行为。由于不同国别存在不同的禁忌行为信息,同一动作-物体的识别结果在不同国别存在不同的匹配结果,例如,送双数花朵的行为在国家A不属于禁忌行为,而在国家D属于禁忌行为,因此,根据直播端发送的直播展示对象针对性的对动作-物体的识别结果进行禁忌匹配,提高匹配精确度,避免误匹配导致的遗漏或误提醒等问题。Match the obtained action-object recognition result in the taboo library to determine whether the action-object recognition result belongs to a taboo behavior. Since there are different taboo behavior information in different countries, the recognition results of the same action-object have different matching results in different countries. For example, the behavior of sending even-numbered flowers is not a taboo behavior in country A, but it is taboo in country D Therefore, according to the live broadcast display objects sent by the live broadcast end, the action-object recognition results are targeted for taboo matching to improve the matching accuracy and avoid problems such as omissions or false reminders caused by wrong matching.
可选的,将所述动作-物体的识别结果以及所述直播展示对象,在禁忌库中进行禁忌匹配,包括:基于所述直播展示对象在所述禁忌库中确定匹配范围,其中,所述匹配范围包括所述直播展示对象的禁忌行为信息;将所述动作-物体的识别结果在所述直播展示对象的匹配范围内进行匹配。示例性的,在禁忌库中提取直播端选择的直播展示对象的禁忌行为信息,形成匹配范围,该匹配范围内包括直播端选择的每一个直播展示对象在禁忌库中全部禁忌行为信息,将通过视频帧识别得到的动作-物体的识别结果在上述匹配范围内进行匹配,在减少匹配数据量的基础上,提高匹配的针对性和匹配精度。Optionally, performing tabu matching on the action-object recognition result and the live display object in the tabu library includes: determining a matching range in the tabu library based on the live display object, wherein the The matching range includes the taboo behavior information of the live broadcast display object; the recognition result of the action-object is matched within the matching range of the live broadcast display object. Exemplarily, the taboo behavior information of the live display object selected by the live broadcast terminal is extracted from the taboo database to form a matching range, which includes all the taboo behavior information of each live display object selected by the live broadcast terminal in the taboo database, which will be passed through The action-object recognition results obtained by video frame recognition are matched within the above matching range, and the matching pertinence and matching accuracy are improved on the basis of reducing the amount of matching data.
分别将动作-物体的识别结果中的动作、物体以及物体属性与匹配范围内的各禁忌行为信息进行匹配,若动作、物体以及物体属性均匹配成功,则确定视频帧中包括禁忌行为,若动作、物体以及物体属性中的至少一项未匹配成功,则确定视频帧中不包括禁忌行为,示例性的,动作-物体的识别结果为手捧-花(三朵),与送双数花朵的行为中物体属性不一致,不属于禁忌行为,动作-物体的识别结果为手捧-花(二朵),与送双数花朵的行为中动作、物体以及物体属性均一致,属于禁忌行为。Match the actions, objects, and object attributes in the action-object recognition results with the taboo behavior information within the matching range. If the actions, objects, and object attributes are all matched successfully, it is determined that the video frame includes taboo behaviors. If the action If at least one of , object and object attributes does not match successfully, it is determined that taboo behavior is not included in the video frame. Exemplary, the recognition result of the action-object is holding in hand-flowers (three), and sending even-numbered flowers The attribute of the object in the behavior is inconsistent, which is not a taboo behavior. The action-object recognition result is holding flowers (two flowers), which is consistent with the action, object, and object attributes in the behavior of sending even-numbered flowers, which is a taboo behavior.
本实施例中,通过将物体和动作分别识别,提高识别精度,同时将物体和动作分别进行匹配,降低了行为匹配难度,提高了行为匹配精确度。在任一直播视频流的动作-物体的识别结果在禁忌库中匹配成功时,向该直播视频流对应的直播端发送禁忌提示,用于提示主播停止禁忌行为并改正。In this embodiment, by separately recognizing the object and the action, the recognition accuracy is improved, and at the same time, the object and the action are matched separately, which reduces the difficulty of behavior matching and improves the accuracy of behavior matching. When the action-object recognition result of any live video stream is successfully matched in the taboo library, a taboo reminder is sent to the live broadcast terminal corresponding to the live video stream, which is used to prompt the anchor to stop the taboo behavior and correct it.
可选的,向所述直播端发送禁忌提示,包括:提取所述匹配成功的禁忌行 为信息的提示内容,将所述提示内容发送至所述直播端,以使所述直播端展示所述提示内容。在一些可选实施例中,禁忌库中可以是存储有禁忌行为信息的提示内容,该提示内容可以是包括禁忌行为的描述信息,以及禁忌行为对应的正确行为。可选的,禁忌库中可以是以{country-action(act:thing)-taboo-right}的数据结构存储,其中,taboo为禁忌行为的描述信息,right为禁忌行为对应的正确行为。示例性的,{国家D-献花(送-花)-不能双数(尤其不能是2)-必须单数},{国家B-展示花(拿/捧-花)-不能是荷花-其他花可以},{国家C-与佛像互动(互动-佛像)-不能摸佛像(尤其是头)-需要对佛像保持尊敬}。在动作-物体的识别结果与禁忌行为信息中的act项和thing项匹配成功时,提取taboo项和right项,形成禁忌提示信息,基于该直播视频流的直播标识将禁忌提示信息发送至直播端,以使直播端在接收到禁忌提示信息时,展示提示内容。Optionally, sending a taboo reminder to the live broadcast terminal includes: extracting the prompt content of the successfully matched taboo behavior information, and sending the prompt content to the live broadcast terminal, so that the live broadcast terminal displays the prompt content. In some optional embodiments, the taboo library may store prompt content containing taboo behavior information, and the prompt content may include descriptive information including taboo behavior and correct behavior corresponding to the taboo behavior. Optionally, the taboo library may be stored in a data structure of {country-action(act: thing)-taboo-right}, where taboo is the description information of the taboo action, and right is the correct action corresponding to the taboo action. Exemplary, {Country D-present flowers (send-flowers)-cannot be double (especially not 2)-must be singular}, {Country B-display flowers (take/hold-flowers)-cannot be lotus-other flowers can }, {Country C - Interact with Buddha Statue (Interaction-Buddha Statue) - Cannot touch Buddha Statue (especially the head) - Need to maintain respect for Buddha Statue}. When the recognition result of the action-object matches the act item and the thing item in the taboo behavior information, extract the taboo item and right item to form taboo prompt information, and send the taboo prompt information to the live broadcast terminal based on the live broadcast identifier of the live video stream , so that the live broadcast end will display the prompt content when it receives a taboo prompt message.
在一些可选实施例中,提示内容包括文本、图片和视频中的至少一项。其中,禁忌行为对应的正确行为可以是文本、图片和视频中至少一种形式。其中,文本提示内容可以是基于文字弹幕、文字弹窗的形式在直播端界面进行显示,图片提示内容可以是以一定屏幕比例的形式在直播端界面展示,并悬浮显示预设时长,视频提示内容可以是在直播端界面以一定屏蔽比例反复播放画中画视频N次,示例性的,可以在直播端的直播层增加一个透明蒙层,并在透明蒙层上述比例和位置播放指定视频。In some optional embodiments, the prompt content includes at least one of text, picture and video. Wherein, the correct behavior corresponding to the taboo behavior can be at least one form of text, picture and video. Among them, the content of the text prompt can be displayed on the interface of the live broadcast terminal based on text bullet screens and text pop-up windows, and the content of the picture prompt can be displayed on the interface of the live broadcast terminal in the form of a certain screen ratio, and the preset duration is displayed in suspension, and the video prompt The content can be to repeatedly play the picture-in-picture video N times with a certain shielding ratio on the interface of the live broadcast terminal. For example, a transparent mask layer can be added to the live broadcast layer of the live broadcast terminal, and the specified video can be played at the above ratio and position of the transparent mask layer.
本实施例的技术方案,通过设置禁忌库,包括各个国别的禁忌行为信息,保证了禁忌行为信息的全面性和准确性。获取直播端选择的直播展示对象在禁忌库中确定禁忌匹配范围,以保证禁忌行为匹配的针对性和精确度。对于获取的直播数据流进行动作识别以及物体识别,确定直播数据流中动作-物体的识别结果,并将动作-物体的识别结果基于所述直播展示对象在禁忌库中进行针对性地禁忌匹配,以检测直播数据流中针对直播展示对象的禁忌行为,通过将动作和物体分别进行识别,降低了行为的识别和匹配难度,提高了识别精度和匹配精度。在匹配成功时向所述直播端发送禁忌提示,避免了主播不了解其他国家 的禁忌行为信息,出现直播禁忌的问题,提高了面向不同展示对象的直播规范性。The technical solution of this embodiment ensures the completeness and accuracy of the taboo behavior information by setting a taboo database, including the taboo behavior information of each country. Obtain the live display objects selected by the live broadcast terminal and determine the taboo matching range in the taboo library to ensure the pertinence and accuracy of taboo behavior matching. Perform action recognition and object recognition on the acquired live data stream, determine the action-object recognition result in the live data stream, and perform targeted taboo matching in the tabu library based on the action-object recognition result in the live broadcast display object, In order to detect the taboo behavior of the live display object in the live data stream, by recognizing the action and the object separately, the difficulty of behavior recognition and matching is reduced, and the recognition accuracy and matching accuracy are improved. When the matching is successful, a taboo reminder is sent to the live broadcast terminal, which avoids the problem that the anchor does not understand the taboo behavior information of other countries, and the problem of live taboo occurs, and improves the standardization of live broadcast for different display objects.
实施例二Embodiment two
图4是本申请实施例二提供的一种直播处理方法的流程示意图,在上述实施例的基础上进行了细化,可选的,在所述动作-物体的识别结果在所述禁忌库中匹配成功后,所述方法还包括:确定匹配成功的禁忌行为信息是否满足判断条件,在匹配成功的禁忌行为信息不满足判断条件的情况下,记录所述匹配成功的禁忌行为信息,并继续对下一视频帧进行动作识别以及物体识别;在匹配成功的禁忌行为信息满足判断条件的情况下,执行向所述直播端发送禁忌提示的步骤。参见图4,该方法包括:Fig. 4 is a schematic flow chart of a live broadcast processing method provided in Embodiment 2 of the present application, which is refined on the basis of the above embodiment. Optionally, the recognition result of the action-object is in the taboo library After the matching is successful, the method further includes: determining whether the successfully matched taboo behavior information satisfies the judgment condition, and if the successfully matched taboo behavior information does not meet the judgment condition, recording the successfully matched taboo behavior information, and continuing to Perform action recognition and object recognition in the next video frame; when the successfully matched taboo behavior information satisfies the judgment condition, execute the step of sending a taboo prompt to the live broadcast terminal. Referring to Figure 4, the method includes:
S210、获取直播端的直播展示对象以及直播数据流。S210. Obtain a live display object and a live data stream of the live broadcast terminal.
S220、对所述直播数据流进行动作识别以及物体识别,确定直播数据流中动作-物体的识别结果。S220. Perform action recognition and object recognition on the live data stream, and determine an action-object recognition result in the live data stream.
S230、将所述动作-物体的识别结果以及所述直播展示对象,在禁忌库中进行禁忌匹配,其中,所述禁忌库中包括多个直播展示对象的禁忌行为信息。S230. Perform tabu matching on the action-object recognition result and the live display object in a tabu library, wherein the tabu library includes taboo behavior information of multiple live display objects.
S240、若所述动作-物体的识别结果在所述禁忌库中匹配成功,则确定匹配成功的禁忌行为信息是否满足判断条件,在匹配成功的禁忌行为信息满足判断条件的情况下,,则执行步骤S260,在匹配成功的禁忌行为信息不满足判断条件的情况下,则执行步骤S250。S240. If the recognition result of the action-object is successfully matched in the taboo library, determine whether the successfully matched taboo behavior information satisfies the judgment condition, and if the successfully matched taboo behavior information satisfies the judgment condition, execute In step S260, if the successfully matched taboo behavior information does not satisfy the judgment condition, step S250 is executed.
S250、记录所述匹配成功的禁忌行为信息,返回执行步骤S220,对下一视频帧进行动作识别以及物体识别。S250. Record the successfully matched taboo behavior information, return to step S220, and perform action recognition and object recognition on the next video frame.
S260、向所述直播端发送禁忌提示。S260. Send a taboo prompt to the live broadcast terminal.
为了减少误判和避免重复提醒的情况,对每一个禁忌行为信息对应设置判断条件,该判断条件用于确定禁忌行为的程度,不同的禁忌行为信息可以是对应不同的判断条件,若满足判断条件,则触发直播端发送禁忌提示,以提示主 播进行改正,在不满足判断条件时,可以是记录所述匹配成功的禁忌行为信息,并对记录的禁忌行为信息进行条件累计,对下一视频帧进行禁忌行为信息的识别和匹配,直到满足判断条件。通过设置判断条件,对不满足判断条件的直播视频流不进行提示,避免频繁提示对直播造成干扰。In order to reduce misjudgments and avoid repeated reminders, a judgment condition is set for each taboo behavior information. The judgment condition is used to determine the degree of taboo behavior. Different taboo behavior information can correspond to different judgment conditions. If the judgment conditions are met , then the live broadcast terminal is triggered to send a taboo prompt to prompt the host to make corrections. When the judgment condition is not met, it can record the taboo behavior information that matches successfully, and perform conditional accumulation on the recorded taboo behavior information, and perform a conditional accumulation on the next video frame Identify and match taboo behavior information until the judgment condition is met. By setting the judgment conditions, no prompts will be given for the live video streams that do not meet the judgment conditions, so as to avoid frequent prompts causing interference to the live broadcast.
可选的,判断条件包括时长条件和频次条件中的一项或两项,例如,判断条件可以是匹配成功的禁忌行为信息的累计时长超过预设时长N,和/或,匹配成功的禁忌行为信息的累计频次超过预设频次M,其中,N为大于0的自然数,M为大于或等于1的正整数。通过对不同的禁忌行为信息设置不同的判断条件,以对不同禁忌行为信息进行针对性判断,对于诸如比中指等恶劣行为设置严格判断条件,例如次数为1,提高直播文明程度,对于诸如吃炸鸡等非恶劣行为设置宽松的判断条件,降低频繁提示对直播造成干扰。Optionally, the judging condition includes one or both of a duration condition and a frequency condition. For example, the judging condition may be that the cumulative duration of the successfully matched taboo behavior information exceeds the preset duration N, and/or, the successfully matched taboo behavior The cumulative frequency of information exceeds the preset frequency M, where N is a natural number greater than 0, and M is a positive integer greater than or equal to 1. By setting different judgment conditions for different taboo behavior information, we can make targeted judgments on different taboo behavior information, set strict judgment conditions for bad behaviors such as fingering, for example, the number of times is 1, and improve the degree of live broadcast civilization. Set loose judgment conditions for non-bad behaviors such as chickens to reduce the interference caused by frequent reminders to live broadcasts.
本实施例中,根据对进行检测的每一个视频帧的动作-物体的识别结果形成识别列表,该识别列表可以是包括时长识别列表和/或频次识别列表。其中,时长识别列表可以是List[{action-startTimestamp-ts}]的数据结构,action是禁忌行为,startTimestamp是触发开始时间,ts是行为的总时长,List是指该数据结构为一个有序数组。当一个直播首次触发禁忌内容时,初始化List数组,同时保存一条记录数据至数组中,其中ts值为0。例如action为献花,startTimestamp为unix时间16021313211,则保存一条记录为:{献花-16021313211-0}。基于预设时间间隔(例如每隔3秒)对视频帧进行检测,若禁忌匹配成功,则生成一条新的记录,并与List数组中最后一条记录进行比对,比对内容包括action和startTimestamp,例如,action不变,startTimestamp间隔为3秒(允许正负300ms误差),则覆盖startTimestamp,并增加ts数据3,例如数据变为{献花-16021313214-3},其中startTimestamp增加了3,ts也增加了3。如果再次检测时,action不变,startTimestamp间隔大于3秒,则覆盖startTimestamp,但不修改ts数据。通过覆盖startTimestamp,更新每一次触发禁忌行为的起始时间,便于识别当前次触发禁忌行为与上一次触发禁忌行为是否为连续触发,若当前次 触发禁忌行为的startTimestamp是上一当前次触发禁忌行为的startTimestamp与ts数据的和,表明为连续行为,可进行时长叠加,若当前次触发禁忌行为的startTimestamp不是上一当前次触发禁忌行为的startTimestamp与ts数据的和,表明为非连续行为。如果再次检测时,action变化,则重新建立一条记录,追加至有序列表末端。In this embodiment, a recognition list is formed according to an action-object recognition result of each detected video frame, and the recognition list may include a duration recognition list and/or a frequency recognition list. Among them, the duration recognition list can be the data structure of List[{action-startTimestamp-ts}], where action is a taboo behavior, startTimestamp is the trigger start time, ts is the total duration of the behavior, and List means that the data structure is an ordered array . When a live broadcast triggers taboo content for the first time, initialize the List array, and save a record data to the array at the same time, where the ts value is 0. For example, if the action is to present flowers, and the startTimestamp is the Unix time 16021313211, then a record will be saved as: {Dedicated Flowers-16021313211-0}. The video frame is detected based on the preset time interval (for example, every 3 seconds). If the taboo match is successful, a new record is generated and compared with the last record in the List array. The comparison content includes action and startTimestamp, For example, if the action remains unchanged and the startTimestamp interval is 3 seconds (plus or minus 300ms error is allowed), startTimestamp will be overwritten and ts data will be increased by 3. For example, the data will become {贵花-16021313214-3}, where startTimestamp increases by 3 and ts also increases Got 3. If the action remains the same and the startTimestamp interval is greater than 3 seconds when detecting again, the startTimestamp will be overwritten, but the ts data will not be modified. By covering the startTimestamp, update the start time of each taboo behavior triggered, which is convenient to identify whether the current taboo behavior trigger and the previous trigger taboo behavior are continuous triggers, if the startTimestamp of the current taboo behavior trigger is the previous trigger taboo behavior The sum of startTimestamp and ts data indicates a continuous behavior, and the duration can be superimposed. If the startTimestamp of the current taboo behavior is not the sum of the startTimestamp and ts data of the previous current taboo behavior, it indicates a discontinuous behavior. If the action changes when detecting again, a new record will be created and appended to the end of the ordered list.
频次识别列表可以是List[{action-count}]数据结构,其中count为累计次数。首次触发禁忌内容时,初始化List结构,并形成一条action-count数据,如{摸佛像-1},每次触发禁忌内容时,判断List中是否有同action数据,在List中有同action数据的情况下,其count+1,在List中没有同action数据的情况下,新增一条act-count数据。The frequency identification list can be a List[{action-count}] data structure, where count is the cumulative number of times. When taboo content is triggered for the first time, the List structure is initialized and an action-count data is formed, such as {touch Buddha statue-1}. Every time taboo content is triggered, it is judged whether there is the same action data in the List, and if there is the same action data in the List In this case, its count+1, if there is no same action data in the List, add an act-count data.
通过设置时长识别列表和频次识别列表,记录直播数据流中已触发禁忌行为的情况,便于基于判断条件进行判断。在一些可选实施例中,确定匹配成功的禁忌行为信息是否满足判断条件,包括:基于所述匹配成功的禁忌行为信息,更新所述禁忌行为信息的时长信息和/或频次信息;确定更新后的时长信息和/或频次信息进行是否满足所述禁忌行为信息对应的判断条件。示例性的,可以是基于匹配成功的禁忌行为信息更新时长识别列表和/或频次识别列表,确定当前的时长信息和/或频次信息。从禁忌库中提取匹配成功的禁忌行为信息对应的判断条件,将当前的时长信息和/或频次信息与提取的判断条件进行比对。示例性的,判断条件为累计时长大于15s,若当前的时长信息小于15s,则不满足判断条件,若当前的时长信息大于15s,则满足判断条件。By setting the duration recognition list and the frequency recognition list, record the situation that the taboo behavior has been triggered in the live data stream, so as to facilitate the judgment based on the judgment conditions. In some optional embodiments, determining whether the successfully matched taboo behavior information satisfies the judgment condition includes: updating the duration information and/or frequency information of the taboo behavior information based on the successfully matched taboo behavior information; Whether the duration information and/or frequency information of the taboo behavior information satisfies the judgment condition corresponding to the taboo behavior information. Exemplarily, the duration identification list and/or the frequency identification list may be updated based on the successfully matched taboo behavior information, and the current duration information and/or frequency information may be determined. Judgment conditions corresponding to successfully matched taboo behavior information are extracted from the taboo database, and current duration information and/or frequency information are compared with the extracted judgment conditions. Exemplarily, the judgment condition is that the accumulated duration is greater than 15s. If the current duration information is less than 15s, the judgment condition is not satisfied, and if the current duration information is greater than 15s, the judgment condition is satisfied.
本实施例提供的技术方案,通过在动作-物体的识别结果在禁忌库中匹配成功后,确定匹配成功的禁忌行为信息是否满足判断条件,并在满足判断条件时触发对直播端的提示,对不满足判断条件的直播视频流不进行提示,避免频繁提示对直播造成干扰。In the technical solution provided in this embodiment, after the action-object recognition result is successfully matched in the taboo library, it is determined whether the successfully matched taboo behavior information meets the judgment condition, and when the judgment condition is met, the prompt to the live broadcast terminal is triggered. Live video streams that meet the judgment conditions will not be prompted to avoid frequent reminders causing interference to the live broadcast.
实施例三Embodiment three
图4为本申请实施例三提供的一种直播处理装置的结构示意图,该直播处理装置可以是配置在主播平台或者直播服务器中,该装置包括:FIG. 4 is a schematic structural diagram of a live broadcast processing device provided in Embodiment 3 of the present application. The live broadcast processing device may be configured in a host platform or a live broadcast server. The device includes:
直播数据流获取模块310,设置为获取直播端的直播展示对象以及直播数据流;The live data stream obtaining module 310 is configured to obtain the live display object and the live data stream at the live end;
视频帧识别模块320,设置为对所述直播数据流进行动作识别以及物体识别,确定直播数据流中动作-物体的识别结果;The video frame recognition module 320 is configured to perform action recognition and object recognition on the live data stream, and determine the action-object recognition result in the live data stream;
禁忌匹配模块330,设置为将所述动作-物体的识别结果以及所述直播展示对象,在禁忌库中进行禁忌匹配,其中,所述禁忌库中包括多个直播展示对象的禁忌行为信息;The taboo matching module 330 is configured to perform taboo matching on the action-object recognition result and the live display object in a tabu library, wherein the taboo library includes taboo behavior information of multiple live display objects;
禁忌提示模块340,设置为若所述动作-物体的识别结果在所述禁忌库中匹配成功,则向所述直播端发送禁忌提示。The taboo prompting module 340 is configured to send a taboo prompt to the live broadcast terminal if the recognition result of the action-object is successfully matched in the tabu library.
在上述实施例的基础上,视频帧识别模块320设置为:On the basis of the foregoing embodiments, the video frame identification module 320 is set to:
基于预设时间间隔确定所述直播数据流中进行检测的视频帧;Determining video frames for detection in the live data stream based on a preset time interval;
对所述视频帧进行动作识别,得到动作识别结果;Perform motion recognition on the video frame to obtain a motion recognition result;
对所述视频帧进行物体识别,所述物体识别结果,其中,所述物体识别结果包括物体类型和物体属性;Performing object recognition on the video frame, the object recognition result, wherein the object recognition result includes object type and object attribute;
基于所述视频帧的动作识别结果和物体识别结果得到所述直播数据流在当前的动作-物体的识别结果。The current action-object recognition result of the live data stream is obtained based on the action recognition result and the object recognition result of the video frame.
在上述实施例的基础上,禁忌匹配模块330设置为:On the basis of the foregoing embodiments, the contraindication matching module 330 is set to:
基于所述直播展示对象在所述禁忌库中确定匹配范围,其中,所述匹配范围包括所述直播展示对象的禁忌行为信息;Determining a matching range in the taboo library based on the live broadcast display object, wherein the matching range includes taboo behavior information of the live broadcast display object;
将所述动作-物体的识别结果在所述直播展示对象的匹配范围内进行匹配。Matching the recognition result of the action-object within the matching range of the live display object.
在上述实施例的基础上,该装置还包括:On the basis of the foregoing embodiments, the device also includes:
禁忌判断模块,设置为在所述动作-物体的识别结果在所述禁忌库中匹配成功后,确定匹配成功的禁忌行为信息是否满足判断条件,响应于匹配成功的禁忌行为信息不满足判断条件,记录所述匹配成功的禁忌行为信息,并继续对下 一视频帧进行动作识别以及物体识别;响应于匹配成功的禁忌行为信息满足判断条件,执行向所述直播端发送禁忌提示的步骤。The taboo judgment module is configured to determine whether the successfully matched taboo behavior information satisfies the judgment condition after the recognition result of the action-object is successfully matched in the taboo library, and in response to the successfully matched taboo behavior information not meeting the judgment condition, Record the successfully matched taboo behavior information, and continue to perform action recognition and object recognition on the next video frame; in response to the successfully matched taboo behavior information meeting the judgment condition, execute the step of sending a taboo prompt to the live broadcast terminal.
在上述实施例的基础上,所述判断条件包括时长条件和频次条件中的一项或两项;On the basis of the above embodiments, the judgment conditions include one or two of duration conditions and frequency conditions;
禁忌判断模块设置为:The taboo judgment module is set to:
所述确定匹配成功的禁忌行为信息是否满足判断条件,包括:The determining whether the matching taboo behavior information satisfies the judging conditions includes:
基于所述匹配成功的禁忌行为信息,更新所述禁忌行为信息的时长信息和/或频次信息;Based on the successfully matched taboo behavior information, update the duration information and/or frequency information of the taboo behavior information;
确定更新后的时长信息和/或频次信息是否满足所述禁忌行为信息对应的判断条件。It is determined whether the updated duration information and/or frequency information satisfies the judging condition corresponding to the taboo behavior information.
在上述实施例的基础上,禁忌提示模块340设置为:On the basis of the foregoing embodiments, the contraindication prompting module 340 is set to:
提取所述匹配成功的禁忌行为信息的提示内容,将所述提示内容发送至所述直播端,以使所述直播端展示所述提示内容。Extracting the prompt content of the successfully matched taboo behavior information, and sending the prompt content to the live broadcast terminal, so that the live broadcast terminal displays the prompt content.
在上述实施例的基础上,所述提示内容包括文本、图片和视频的至少一项。Based on the above embodiments, the prompt content includes at least one of text, picture and video.
本申请实施例所提供的直播处理装置可执行本申请任意实施例所提供的直播处理方法,具备执行方法相应的功能模块。The live broadcast processing device provided in the embodiment of the present application can execute the live broadcast processing method provided in any embodiment of the present application, and has corresponding functional modules for executing the method.
实施例四Embodiment Four
图6为本申请实施例四提供的一种电子设备的结构示意图。图6示出了适于用来实现本申请实施方式的电子设备12的框图。图6显示的电子设备12仅仅是一个示例,不应对本申请实施例的功能和使用范围带来任何限制。设备12典型的是承担图像分类功能的电子设备。FIG. 6 is a schematic structural diagram of an electronic device provided in Embodiment 4 of the present application. FIG. 6 shows a block diagram of an electronic device 12 suitable for implementing embodiments of the present application. The electronic device 12 shown in FIG. 6 is only an example, and should not limit the functions and scope of use of the embodiment of the present application. Device 12 is typically an electronic device undertaking image classification functions.
如图6所示,电子设备12以通用计算设备的形式表现。电子设备12的组件可以包括但不限于:至少一个处理器16,存储装置28,连接不同系统组件(包括存储装置28和处理器16)的总线18。As shown in FIG. 6, electronic device 12 takes the form of a general-purpose computing device. Components of the electronic device 12 may include, but are not limited to: at least one processor 16, a storage device 28, and a bus 18 connecting various system components (including the storage device 28 and the processor 16).
总线18表示几类总线结构中的一种或多种,包括存储器总线或者存储器控 制器,外围总线,图形加速端口,处理器或者使用多种总线结构中的任意总线结构的局域总线。举例来说,这些体系结构包括但不限于工业标准体系结构(Industry Standard Architecture,ISA)总线,微通道体系结构(Micro Channel Architecture,MCA)总线,增强型ISA总线、视频电子标准协会(Video Electronics Standards Association,VESA)局域总线以及外围组件互连(Peripheral Component Interconnect,PCI)总线。 Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, a processor, or a local bus using any of a variety of bus structures. For example, these architectures include but are not limited to Industry Standard Architecture (Industry Standard Architecture, ISA) bus, Micro Channel Architecture (Micro Channel Architecture, MCA) bus, Enhanced ISA bus, Video Electronics Standards Association (Video Electronics Standards Association, VESA) local bus and peripheral component interconnect (Peripheral Component Interconnect, PCI) bus.
电子设备12典型地包括多种计算机系统可读介质。这些介质可以是任何能够被电子设备12访问的可用介质,包括易失性和非易失性介质,可移动的和不可移动的介质。 Electronic device 12 typically includes a variety of computer system readable media. These media can be any available media that can be accessed by electronic device 12 and include both volatile and nonvolatile media, removable and non-removable media.
存储装置28可以包括易失性存储器形式的计算机系统可读介质,例如随机存取存储器(Random Access Memory,RAM)30和/或高速缓存存储器32。电子设备12可以进一步包括其它可移动/不可移动的、易失性/非易失性计算机系统存储介质。仅作为举例,存储系统34可以用于读写不可移动的、非易失性磁介质(图6未显示,通常称为“硬盘驱动器”)。尽管图6中未示出,可以提供用于对可移动非易失性磁盘(例如“软盘”)读写的磁盘驱动器,以及对可移动非易失性光盘(例如只读光盘(Compact Disc-Read Only Memory,CD-ROM)、数字视盘(Digital Video Disc-Read Only Memory,DVD-ROM)或者其它光介质)读写的光盘驱动器。在这些情况下,每个驱动器可以通过至少一个数据介质接口与总线18相连。存储装置28可以包括至少一个程序产品,该程序产品具有一组(例如至少一个)程序模块,这些程序模块被配置以执行本申请各实施例的功能。 Storage device 28 may include computer system readable media in the form of volatile memory, such as random access memory (Random Access Memory, RAM) 30 and/or cache memory 32 . The electronic device 12 may further include other removable/non-removable, volatile/nonvolatile computer system storage media. By way of example only, storage system 34 may be used to read and write to non-removable, non-volatile magnetic media (not shown in FIG. 6, commonly referred to as a "hard drive"). Although not shown in FIG. 6, a disk drive for reading and writing to a removable nonvolatile disk (such as a "floppy disk") may be provided, as well as a removable nonvolatile disk (such as a Compact Disc- Read Only Memory, CD-ROM), Digital Video Disc (Digital Video Disc-Read Only Memory, DVD-ROM) or other optical media) CD-ROM drive. In these cases, each drive may be connected to bus 18 via at least one data medium interface. The storage device 28 may include at least one program product having a set (for example, at least one) of program modules configured to perform the functions of the various embodiments of the present application.
具有一组(至少一个)程序模块26的程序36,可以存储在例如存储装置28中,这样的程序模块26包括但不限于操作系统、至少一个应用程序、其它程序模块以及程序数据,这些示例中的每一个或某种组合中可能包括网关环境的实现。程序模块26通常执行本申请所描述的实施例中的功能和/或方法。A program 36 having a set (at least one) of program modules 26, such as but not limited to an operating system, at least one application program, other program modules, and program data, may be stored, for example, in storage device 28, in which case Each or some combination of these may include implementations of gateway environments. Program modules 26 generally perform functions and/or methods in the embodiments described herein.
电子设备12也可以与至少一个外部设备14(例如键盘、指向设备、摄像头、 显示器24等)通信,还可与至少一个使得用户能与该电子设备12交互的设备通信,和/或与使得该电子设备12能与至少一个其它计算设备进行通信的任何设备(例如网卡,调制解调器等等)通信。这种通信可以通过输入/输出(I/O)接口22进行。并且,电子设备12还可以通过网关适配器20与至少一个网关(例如局域网(Local Area Network,LAN),广域网Wide Area Network,WAN)和/或公共网关,例如因特网)通信。如图所示,网关适配器20通过总线18与电子设备12的其它模块通信。应当明白,尽管图中未示出,可以结合电子设备12使用其它硬件和/或软件模块,包括但不限于:微代码、设备驱动器、冗余处理单元、外部磁盘驱动阵列、磁盘阵列(Redundant Arrays of Independent Disks,RAID)系统、磁带驱动器以及数据备份存储系统等。The electronic device 12 can also communicate with at least one external device 14 (such as a keyboard, a pointing device, a camera, a display 24, etc.), and also communicate with at least one device that enables a user to interact with the electronic device 12, and/or communicate with a device that enables the user to interact with the electronic device 12. Electronic device 12 is capable of communicating with any device (eg, network card, modem, etc.) that communicates with at least one other computing device. Such communication may occur through input/output (I/O) interface 22 . Moreover, the electronic device 12 can also communicate with at least one gateway (such as a local area network (Local Area Network, LAN), wide area network, Wide Area Network, WAN) and/or a public gateway, such as the Internet, through the gateway adapter 20. As shown, gateway adapter 20 communicates with other modules of electronic device 12 via bus 18 . It should be appreciated that although not shown, other hardware and/or software modules may be used in conjunction with electronic device 12, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, disk arrays (Redundant Arrays) of Independent Disks, RAID) systems, tape drives, and data backup storage systems.
处理器16通过运行存储在存储装置28中的程序,从而执行各种功能应用以及数据处理,例如实现本申请上述实施例所提供的直播处理方法。The processor 16 executes various functional applications and data processing by running the programs stored in the storage device 28 , for example, realizing the live broadcast processing method provided by the above-mentioned embodiments of the present application.
实施例五Embodiment five
本申请实施例五提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如本申请实施例所提供的直播处理方法。Embodiment 5 of the present application provides a computer-readable storage medium, on which a computer program is stored. When the program is executed by a processor, the live broadcast processing method provided in the embodiment of the present application is implemented.
当然,本申请实施例所提供的一种计算机可读存储介质,其上存储的计算机程序不限于如上所述的方法操作,还可以执行本申请任意实施例所提供的直播处理方法。Of course, the computer-readable storage medium provided by the embodiment of the present application, the computer program stored thereon is not limited to the method operation described above, and can also execute the live broadcast processing method provided by any embodiment of the present application.
本申请实施例的计算机存储介质,可以采用至少一个计算机可读的介质的任意组合。计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子(非穷举的列表)包括:具有至少一个导线的电连接、便携式计算机磁盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器((Erasable Programmable Read-Only Memory,EPROM)或闪 存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本文件中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。The computer storage medium in the embodiments of the present application may use any combination of at least one computer-readable medium. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof. More specific examples (non-exhaustive list) of computer-readable storage media include: electrical connections having at least one lead, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable Read-Only Memory ((Erasable Programmable Read-Only Memory, EPROM) or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above . In this document, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的源代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。A computer readable signal medium may include a data signal carrying computer readable source code in baseband or as part of a carrier wave traveling as a data signal. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which can send, propagate, or transmit a program for use by or in conjunction with an instruction execution system, apparatus, or device. .
计算机可读介质上包含的源代码可以用任何适当的介质传输,包括——但不限于无线、电线、光缆、射频(Radio Frequency,RF)等等,或者上述的任意合适的组合。Source code embodied on a computer readable medium may be transmitted using any appropriate medium, including - but not limited to wireless, wire, optical cable, radio frequency (RF), etc., or any suitable combination of the foregoing.
可以以一种或多种程序设计语言或其组合来编写用于执行本申请操作的计算机源代码,所述程序设计语言包括面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。源代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网关——包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer source code for carrying out the operations of this application may be written in one or more programming languages, or combinations thereof, including object-oriented programming languages—such as Java, Smalltalk, C++, and conventional Procedural Programming Language - such as "C" or a similar programming language. The Source Code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In cases involving a remote computer, the remote computer may be connected to the user computer through any kind of gateway, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (such as through an Internet Service Provider). Internet connection).

Claims (10)

  1. 一种直播处理方法,包括:A live broadcast processing method, comprising:
    获取直播端的直播展示对象以及直播数据流;Obtain the live display object and live data stream of the live broadcast terminal;
    对所述直播数据流进行动作识别以及物体识别,确定直播数据流中动作-物体的识别结果;Perform action recognition and object recognition on the live data stream, and determine the action-object recognition result in the live data stream;
    将所述动作-物体的识别结果以及所述直播展示对象,在禁忌库中进行禁忌匹配,其中,所述禁忌库中包括多个直播展示对象的禁忌行为信息;Performing tabu matching in the taboo library with the recognition result of the action-object and the live display object, wherein the taboo library includes taboo behavior information of a plurality of live display objects;
    响应于所述动作-物体的识别结果在所述禁忌库中匹配成功,向所述直播端发送禁忌提示。In response to the recognition result of the action-object being successfully matched in the taboo database, a taboo prompt is sent to the live broadcast terminal.
  2. 根据权利要求1所述的方法,其中,所述对所述直播数据流进行动作识别以及物体识别,确定直播数据流中动作-物体的识别结果,包括:The method according to claim 1, wherein the performing action recognition and object recognition on the live data stream, and determining the action-object recognition result in the live data stream include:
    基于预设时间间隔确定所述直播数据流中进行检测的视频帧;Determining video frames for detection in the live data stream based on a preset time interval;
    对所述视频帧进行动作识别,得到动作识别结果;Perform motion recognition on the video frame to obtain a motion recognition result;
    对所述视频帧进行物体识别,所述物体识别结果,其中,所述物体识别结果包括物体类型和物体属性;Performing object recognition on the video frame, the object recognition result, wherein the object recognition result includes object type and object attribute;
    基于所述视频帧的动作识别结果和物体识别结果得到所述直播数据流在当前的动作-物体的识别结果。The current action-object recognition result of the live data stream is obtained based on the action recognition result and the object recognition result of the video frame.
  3. 根据权利要求1所述的方法,其中,所述将所述动作-物体的识别结果以及所述直播展示对象,在禁忌库中进行禁忌匹配,包括:The method according to claim 1, wherein, performing tabu matching in the tabu library of the recognition result of the action-object and the live display object includes:
    基于所述直播展示对象在所述禁忌库中确定匹配范围,其中,所述匹配范围包括所述直播展示对象的禁忌行为信息;Determining a matching range in the taboo library based on the live broadcast display object, wherein the matching range includes taboo behavior information of the live broadcast display object;
    将所述动作-物体的识别结果在所述直播展示对象的匹配范围内进行匹配。Matching the recognition result of the action-object within the matching range of the live display object.
  4. 根据权利要求1所述的方法,在所述动作-物体的识别结果在所述禁忌库中匹配成功后,所述方法还包括:According to the method according to claim 1, after the recognition result of the action-object is successfully matched in the tabu library, the method further comprises:
    确定匹配成功的禁忌行为信息是否满足判断条件,响应于匹配成功的禁忌行为信息不满足判断条件,记录所述匹配成功的禁忌行为信息,并继续对下一视频帧进行动作识别以及物体识别;Determine whether the successfully matched taboo behavior information satisfies the judgment condition, and record the successfully matched taboo behavior information in response to the successfully matched taboo behavior information not meeting the judgment condition, and continue to perform action recognition and object recognition on the next video frame;
    响应于匹配成功的禁忌行为信息满足判断条件,向所述直播端发送禁忌提示。In response to the successfully matched taboo behavior information meeting the judgment condition, a taboo reminder is sent to the live broadcast terminal.
  5. 根据权利要求4所述的方法,其中,所述判断条件包括时长条件和频次条件中的一项或两项;The method according to claim 4, wherein the judgment condition includes one or both of a duration condition and a frequency condition;
    所述确定匹配成功的禁忌行为信息是否满足判断条件,包括:The determining whether the matching taboo behavior information satisfies the judging conditions includes:
    基于所述匹配成功的禁忌行为信息,更新所述禁忌行为信息的时长信息和频次信息中的至少之一;Updating at least one of duration information and frequency information of the taboo behavior information based on the successfully matched taboo behavior information;
    确定更新后的时长信息和频次信息中的至少之一是否满足所述禁忌行为信息对应的判断条件。It is determined whether at least one of the updated duration information and frequency information satisfies the judging condition corresponding to the taboo behavior information.
  6. 根据权利要求1所述的方法,其中,所述向所述直播端发送禁忌提示,包括:The method according to claim 1, wherein the sending a contraindication prompt to the live broadcast terminal comprises:
    提取所述匹配成功的禁忌行为信息的提示内容,将所述提示内容发送至所述直播端,以使所述直播端展示所述提示内容。Extracting the prompt content of the successfully matched taboo behavior information, and sending the prompt content to the live broadcast terminal, so that the live broadcast terminal displays the prompt content.
  7. 根据权利要求6所述的方法,其中,所述提示内容包括文本、图片和视频中的至少一项。The method according to claim 6, wherein the prompt content includes at least one of text, picture and video.
  8. 一种直播处理装置,包括:A live broadcast processing device, comprising:
    直播数据流获取模块,设置为获取直播端的直播展示对象以及直播数据流;The live data stream acquisition module is configured to acquire the live display objects and live data streams at the live end;
    视频帧识别模块,设置为对所述直播数据流进行动作识别以及物体识别,确定直播数据流中动作-物体的识别结果;The video frame recognition module is configured to perform action recognition and object recognition on the live data stream, and determine the action-object recognition result in the live data stream;
    禁忌匹配模块,设置为将所述动作-物体的识别结果以及所述直播展示对象,在禁忌库中进行禁忌匹配,其中,所述禁忌库中包括多个直播展示对象的禁忌行为信息;The taboo matching module is configured to perform taboo matching on the recognition result of the action-object and the live display object in a tabu library, wherein the taboo library includes taboo behavior information of multiple live display objects;
    禁忌提示模块,设置为若所述动作-物体的识别结果在所述禁忌库中匹配成功,则向所述直播端发送禁忌提示。The taboo prompting module is configured to send a taboo prompt to the live broadcast terminal if the recognition result of the action-object is successfully matched in the tabu database.
  9. 一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如权利要求1-7中任 一所述的直播处理方法。An electronic device, comprising a memory, a processor, and a computer program stored on the memory and operable on the processor, when the processor executes the computer program, the live broadcast according to any one of claims 1-7 is realized Approach.
  10. 一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,所述计算机程序被处理器执行时实现如权利要求1-7中任一所述的直播处理方法。A computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the live broadcast processing method according to any one of claims 1-7 is implemented.
PCT/CN2022/098645 2021-06-15 2022-06-14 Live streaming processing method and apparatus, storage medium, and electronic device WO2022262719A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110662255.4 2021-06-15
CN202110662255.4A CN113329260B (en) 2021-06-15 2021-06-15 Live broadcast processing method and device, storage medium and electronic equipment

Publications (1)

Publication Number Publication Date
WO2022262719A1 true WO2022262719A1 (en) 2022-12-22

Family

ID=77420847

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/098645 WO2022262719A1 (en) 2021-06-15 2022-06-14 Live streaming processing method and apparatus, storage medium, and electronic device

Country Status (2)

Country Link
CN (1) CN113329260B (en)
WO (1) WO2022262719A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113329260B (en) * 2021-06-15 2024-04-09 北京沃东天骏信息技术有限公司 Live broadcast processing method and device, storage medium and electronic equipment
CN115546824B (en) * 2022-04-18 2023-11-28 荣耀终端有限公司 Taboo picture identification method, apparatus and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106060653A (en) * 2016-06-08 2016-10-26 武汉鸿瑞达信息技术有限公司 Method for filtering improper action in live internet video
CN107864401A (en) * 2017-11-08 2018-03-30 北京密境和风科技有限公司 It is a kind of based on live monitoring method, device, system and terminal device
CN110059661A (en) * 2019-04-26 2019-07-26 腾讯科技(深圳)有限公司 Action identification method, man-machine interaction method, device and storage medium
US10893329B1 (en) * 2019-09-03 2021-01-12 International Business Machines Corporation Dynamic occlusion of livestreaming
CN113329260A (en) * 2021-06-15 2021-08-31 北京沃东天骏信息技术有限公司 Live broadcast processing method and device, storage medium and electronic equipment

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9767259B2 (en) * 2012-05-07 2017-09-19 Google Inc. Detection of unauthorized content in live multiuser composite streams
WO2014172624A2 (en) * 2013-04-19 2014-10-23 James Carey Video identification and analytical recognition system
CN107682719A (en) * 2017-09-05 2018-02-09 广州数沃信息科技有限公司 A kind of monitoring and assessing method and device of live content health degree
JP7035395B2 (en) * 2017-09-13 2022-03-15 沖電気工業株式会社 Anomaly detection system, information processing device, and anomaly detection method
WO2020090697A1 (en) * 2018-10-28 2020-05-07 株式会社Dapリアライズ Remote live video amusement facility and method for billing user using said remote live video amusement facility
CN111147880A (en) * 2019-12-30 2020-05-12 广州华多网络科技有限公司 Interaction method, device and system for live video, electronic equipment and storage medium
CN111222450B (en) * 2020-01-02 2023-06-20 广州虎牙科技有限公司 Model training and live broadcast processing method, device, equipment and storage medium
CN112511855A (en) * 2020-11-28 2021-03-16 南京雄雉电子商务有限公司 Cloud computing-based electronic commerce platform live video intelligent management system
CN112217841A (en) * 2020-12-09 2021-01-12 平安国际智慧城市科技股份有限公司 Live broadcast room management method and device, computer equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106060653A (en) * 2016-06-08 2016-10-26 武汉鸿瑞达信息技术有限公司 Method for filtering improper action in live internet video
CN107864401A (en) * 2017-11-08 2018-03-30 北京密境和风科技有限公司 It is a kind of based on live monitoring method, device, system and terminal device
CN110059661A (en) * 2019-04-26 2019-07-26 腾讯科技(深圳)有限公司 Action identification method, man-machine interaction method, device and storage medium
US10893329B1 (en) * 2019-09-03 2021-01-12 International Business Machines Corporation Dynamic occlusion of livestreaming
CN113329260A (en) * 2021-06-15 2021-08-31 北京沃东天骏信息技术有限公司 Live broadcast processing method and device, storage medium and electronic equipment

Also Published As

Publication number Publication date
CN113329260A (en) 2021-08-31
CN113329260B (en) 2024-04-09

Similar Documents

Publication Publication Date Title
CN109547819B (en) Live list display method and device and electronic equipment
WO2022262719A1 (en) Live streaming processing method and apparatus, storage medium, and electronic device
CN108683937B (en) Voice interaction feedback method and system for smart television and computer readable medium
US9860593B2 (en) Devices, systems, methods, and media for detecting, indexing, and comparing video signals from a video display in a background scene using a camera-enabled device
CN109408672B (en) Article generation method, article generation device, server and storage medium
CN109729429B (en) Video playing method, device, equipment and medium
US11100368B2 (en) Accelerated training of an image classifier
US10769247B2 (en) System and method for interacting with information posted in the media
CN111372116B (en) Video playing prompt information processing method and device, electronic equipment and storage medium
CN109286848B (en) Terminal video information interaction method and device and storage medium
US20240062581A1 (en) Obtaining artist imagery from video content using facial recognition
WO2014100936A1 (en) Method, platform, and system for manufacturing associated information library of video and for playing video
CN111754254A (en) System and method for dynamically creating and inserting immersive promotional content in multimedia
US20120150990A1 (en) System and method for synchronizing with multimedia broadcast program and computer program product thereof
US20140012792A1 (en) Systems and methods for building a virtual social network
US10936878B2 (en) Method and device for determining inter-cut time range in media item
US9148708B2 (en) Automated statutory warning system to display disclaimers
CN111741333B (en) Live broadcast data acquisition method and device, computer equipment and storage medium
KR20190119240A (en) Apparatus and method for providing meta-data
CN113361486A (en) Multi-pose face recognition method and device, storage medium and electronic equipment
CN113784058A (en) Image generation method and device, storage medium and electronic equipment
CN109819279B (en) Monitoring method, device, equipment and storage medium for media information delivery
US9084011B2 (en) Method for advertising based on audio/video content and method for creating an audio/video playback application
CN112165626A (en) Image processing method, resource acquisition method, related device and medium
CN114071184A (en) Subtitle positioning method, electronic equipment and medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22824198

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE