WO2016192501A1 - Video search method and apparatus - Google Patents

Video search method and apparatus Download PDF

Info

Publication number
WO2016192501A1
WO2016192501A1 PCT/CN2016/080770 CN2016080770W WO2016192501A1 WO 2016192501 A1 WO2016192501 A1 WO 2016192501A1 CN 2016080770 W CN2016080770 W CN 2016080770W WO 2016192501 A1 WO2016192501 A1 WO 2016192501A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
keyword
key frame
information
picture
Prior art date
Application number
PCT/CN2016/080770
Other languages
French (fr)
Chinese (zh)
Inventor
周茂林
张衎
付贤会
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016192501A1 publication Critical patent/WO2016192501A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data

Definitions

  • the invention relates to, but is not limited to, the field of video technology.
  • the picture recognition technology on the mobile phone has many mature applications. For example, there are a lot of photos on mobile phones, and it is more troublesome to organize them one by one. Some applications can automatically scan your photo albums, find the photos you want based on keywords, and bring convenience to your life.
  • the embodiment of the invention provides a video search method and device, and a video playing method and device, which enable a user to search for a video conveniently and quickly.
  • An embodiment of the present invention provides a video search method, which is applied to a server, and includes the following steps:
  • the keyword sent by the terminal is received, and the key frame of the corresponding video is searched according to the keyword sent by the terminal and the first correspondence.
  • the method further includes:
  • the method before receiving the keyword sent by the terminal, the method further includes: acquiring a second correspondence between the video information and a key frame of the video;
  • the method further includes: before the sending, by the terminal, the key information sent by the terminal and the first corresponding relationship, the video information corresponding to the searched key frame, to the terminal, the method further includes: :
  • the received keyword includes: a picture keyword, where the picture keyword is a keyword related to the video picture obtained by performing image recognition on a picture including a video picture.
  • the first correspondence between the key frame of the acquired video and the keyword includes:
  • the keyword of the key frame further includes: at least one of a text in the key frame, a body content in the key frame, and a ratio of a key frame occupied by the body content in the key frame.
  • the method before receiving the picture keyword sent by the terminal, the method further includes:
  • the method further includes:
  • the found time information is sent to the terminal.
  • the video information includes: video content information or video resource location information.
  • the embodiment of the invention further provides a video search method, which is applied to the terminal, and includes the following steps:
  • the keyword is sent to a server for the server to find a key frame of the video according to the keyword.
  • the step of acquiring a keyword includes:
  • the method further includes:
  • the video is played according to the video information.
  • the method further includes: receiving time information sent by the server;
  • the step of performing video playback according to the video information includes:
  • the video is played according to the video information and the time information.
  • the method further includes:
  • the method further includes:
  • the embodiment of the invention further provides a video search device, which is applied to a server, and includes:
  • the first acquiring module is configured to acquire a first correspondence between a key frame of the video and the keyword
  • the first search module is configured to receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
  • the video search device further includes: a first sending module
  • the first sending module is configured to send video information corresponding to the found key frame to the terminal.
  • the embodiment of the invention further provides a video search device, which is applied to the terminal, and includes:
  • the second obtaining module is configured to acquire a keyword
  • the second sending module is configured to send the keyword to the server, so that the server searches for a key frame of the video according to the keyword.
  • the acquiring, by the second acquiring module, the keyword includes: acquiring a picture that includes a video image;
  • the embodiment of the present invention provides a video search method and device.
  • the video search method of the embodiment of the present invention includes: acquiring a first correspondence between a key frame of a video and a keyword; and receiving a keyword sent by the terminal, according to the terminal Searching for a key frame of the corresponding video in the first correspondence relationship; applying the video search method in the embodiment of the present invention, the user terminal only needs to send the keyword to the server, and the server sends the keyword according to the terminal and the first A corresponding relationship automatically finds out the key frame of the corresponding video; since the key frame of the video can represent the video, the corresponding video can be found; for the user, it only needs to acquire and send the keyword to the server, and the operation is simple
  • the solution of the embodiment of the invention is fast, the difficulty of the video search is reduced, and the user experience is improved.
  • the image recognition technology is based on the image recognition technology, and the user can quickly obtain the corresponding video information by simply acquiring the image including the video image, and the operation is simple and fast.
  • the user does not need to memorize the keyword information of the search video, which reduces the difficulty of the video search and improves the user experience.
  • FIG. 1 is a schematic flowchart of a first video search method according to Embodiment 1 of the present invention
  • FIG. 2 is a schematic flowchart of a second video search method according to Embodiment 1 of the present invention.
  • FIG. 3 is a schematic flowchart of a third video search method according to Embodiment 1 of the present invention.
  • FIG. 4 is a schematic flowchart of a fourth video search method according to Embodiment 1 of the present invention.
  • FIG. 5 is a schematic flowchart of a video search method according to Embodiment 2 of the present invention.
  • FIG. 6 is a schematic flowchart of a video playing method according to Embodiment 2 of the present invention.
  • FIG. 7 is a schematic flowchart diagram of another video playing method according to Embodiment 2 of the present invention.
  • FIG. 8 is a schematic flowchart of video search and playback according to Embodiment 3 of the present invention.
  • FIG. 9 is a schematic flowchart of another video search and play according to Embodiment 3 of the present invention.
  • FIG. 10 is a schematic structural diagram of a first video search apparatus according to Embodiment 4 of the present invention.
  • FIG. 11 is a schematic structural diagram of a second video search apparatus according to Embodiment 4 of the present invention.
  • FIG. 12 is a schematic structural diagram of a third video search apparatus according to Embodiment 4 of the present invention.
  • Embodiment 1 is a diagrammatic representation of Embodiment 1:
  • the present invention provides a video search method, which is applied to the server side, as shown in FIG. 1 , and includes the following steps in view of the problem in the related art:
  • Step 101 Acquire a first correspondence between a key frame of the video and a keyword.
  • the manner in which the first correspondence is obtained may be multiple.
  • the correspondence between the key frames of the video and the keywords may be established by other devices, and then the server obtains the device, and the video is established between the servers.
  • the correspondence between key frames and keywords may be established by other devices, and then the server obtains the device, and the video is established between the servers.
  • the key frame of the video is a frame picture, for example, it may be an independent and complete frame picture.
  • GOPs Group of Pictures
  • the key frame of the video can represent the video, and the key frame of the video can be found to know the corresponding video.
  • the method in this embodiment may separately acquire key frames of the video for the video, and then perform image recognition on all key frames to obtain keywords of each key frame and save the keywords, and finally establish a correspondence between the key frames and the keywords.
  • the image recognition of the key frame is based on the knowledge base content and the image recognition mode, and different knowledge base contents and image recognition methods may obtain different keywords.
  • the knowledge base is the most important structure, easy to operate, easy to use, and comprehensive and organized knowledge cluster in knowledge engineering. It is the need to solve problems in one or some fields, and adopts one or more kinds of knowledge representation. A collection of interrelated pieces of knowledge stored, organized, managed, and used in computer memory.
  • the keyword of the key frame in this embodiment may include at least one of text in a key frame, body content in a key frame, and a ratio of a key frame occupied by the body content in the key frame.
  • the keyword of the key frame in this embodiment may be the text information in the key frame of the video, the content of the main body, and the proportion of the image occupied by the main content. This set of key information may be used to identify a picture.
  • the form of the first correspondence between the key frame of the video and the keyword in the embodiment may include: a keyword index of the key frame, the index value is a keyword, and the index object is a key frame.
  • the keyword sent by the terminal may be a picture keyword, where the picture keyword is a keyword related to the video picture obtained by performing image recognition on the picture including the video picture; for example, the user terminal directly includes the video.
  • Pictures of the screen (such as screenshots of video screens, pictures formed by video screens, etc.) are keywords that are image-received. Others may also use pictures of video images (such as video screen captures and video images). The picture, etc.) performs the image recognition to obtain the keywords, and then the terminal obtains the forwarding from the other device to the server.
  • the image recognition mode on the terminal side and the server side use the key frame.
  • the image recognition method needs to be consistent. Otherwise, the recognized keyword content is different, and the video cannot be accurately matched.
  • Step 102 Receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
  • the picture keyword sent by the terminal is received, and the video corresponding to the picture keyword is searched according to the picture keyword and the first correspondence; the picture keyword is that the terminal pair includes a video picture.
  • the picture is subjected to image recognition to acquire keywords related to the video picture.
  • the key frame of the video can characterize the video, the key frame of the video can be found to know the corresponding video.
  • one or more key frames corresponding to the keyword may be found; the key frame to be searched may be a key frame in a video or a key frame in a group of videos, and the group of videos may be the most relevant.
  • a strong set of videos may be found;
  • the user terminal only needs to obtain a picture including a video picture (for example, capturing a video picture or taking a screen shot, etc.), and then performing image recognition on the picture to obtain a keyword related to the video picture and sending the keyword to the server.
  • the server automatically finds the key frame of the corresponding video according to the keyword sent by the terminal and the stored correspondence relationship. Since the key frame can represent the video, the key frame of the video is found to find the video; for the user, only The image information including the video picture can be obtained, and the corresponding video information can be quickly obtained.
  • the operation is simple and fast.
  • the method of the embodiment is used, and the user does not need to memorize the keyword information of the search video, thereby reducing the difficulty of the video search and improving the user experience. .
  • the embodiment further provides a video search method, which is applied to the server side, and includes:
  • Step 201 Establish a first correspondence between a key frame of the video and the keyword.
  • a keyword index of a key frame is established, the index value is a keyword, and the index object is a key frame of the video.
  • Step 202 Receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
  • the keyword sent by the terminal includes a picture keyword, where the picture keyword is a pair of The picture of the video picture is subjected to image recognition to obtain keywords related to the video picture.
  • a keyword related to the video screen obtained by image recognition of a picture including a video picture between transmission terminals.
  • the key frame of the corresponding video is retrieved in the keyword index of the key frame according to the picture keyword.
  • Step 203 Send video information corresponding to the found key frame to the terminal.
  • the video information of this step may include: video content information, or video resource location information.
  • the video content information corresponding to the key frame may be sent to the terminal for the terminal to directly play, or sent to other playback devices for playing.
  • the terminal Or sending the video resource location information corresponding to the key frame, for example, a URI (Uniform Resource Identifier), to the terminal, so that the terminal acquires the corresponding video content according to the identifier information, or sends the identifier information to other playback devices.
  • the other playback device acquires the corresponding video content according to the identification information for playing.
  • the video information corresponding to the key frames may also be one or more.
  • the searched video may be a video or a group of videos, then The embodiment method needs to send information of one video to the terminal, or send information of each video in a group of videos to the terminal.
  • the method in this embodiment may obtain a second correspondence between the video information and the key frame of the video before the step 202.
  • the step 203 may include: according to the found key frame and the first The two correspondences find the corresponding video information; and the found video information is sent to the terminal.
  • the process of obtaining the second correspondence between the video information and the key frame of the video may be established by the user terminal by itself; the second correspondence may be established by other devices, and then the user terminal obtains the second correspondence.
  • the second correspondence there is a one-to-one correspondence between video information and video.
  • the second correspondence may be a key frame index of the video information, where the index value of the key frame index of the video is a key frame, and the index object is video information; after the key frame of the video is found, the video information is searched.
  • the key frame index matches the corresponding video information.
  • the user terminal only needs to obtain a picture including a video picture (for example, capturing a video picture or taking a screen shot, etc.), and then performing image recognition on the picture to obtain a keyword related to the video picture and sending the keyword to the server.
  • the server automatically finds the corresponding video according to the keyword sent by the terminal and the stored correspondence relationship, and feeds back the search result to the terminal.
  • the video search method in the embodiment of the present invention is based on the image recognition technology, and for the user, Obtaining the video information including the video picture can quickly obtain the corresponding video information, and the operation is simple and fast.
  • the method of the embodiment is used, and the user does not need to memorize the keyword information of the search video, thereby reducing the difficulty of the video search and improving the user experience.
  • this embodiment further provides another video search method, which is applied to the server side, and includes the following steps:
  • Step 300 Acquire a key frame of the video; perform the image recognition on the key frame to obtain a keyword of the key frame.
  • Step 301 Establish a first correspondence between the video key frame and the keyword and a second correspondence between the video information and the video key frame.
  • the establishing manner of the corresponding relationship in this embodiment may be to establish an index, for example, first establishing a key frame index of the video (ie, a second correspondence), and then establishing a keyword index of the key frame (ie, a first correspondence); wherein the video is The index value of the key frame index is a key frame, and the index object is video information (including video content information or resource location information); the index of the key index of the key frame is a keyword, and the index object is a key frame; After the keyword is sent, the keyword index of the search key frame first matches the corresponding key frame, and then the corresponding video information is matched in the key frame index of the search video.
  • Step 302 Receive a picture keyword sent by the terminal, according to the picture keyword and the first A correspondence relationship searches for a video key frame corresponding to the picture keyword.
  • the key picture corresponding to the picture keyword is matched in the keyword index of the key frame by using the picture keyword.
  • Step 303 Search for video information corresponding to the video key frame according to the found video key frame and the second correspondence.
  • the key frame is used to match the video corresponding to the key frame in the index of the key frame of the video.
  • Step 304 Send the found video information to the terminal.
  • the media content of the video may be sent to the terminal for playing, or the URI may be sent to the terminal for the terminal to acquire the corresponding video content for playing.
  • the embodiment provides a The solution is that the server also needs to send related time information to the terminal, so that the user can continue watching the video from the point of time when the video is played, which improves the user experience.
  • the method of the embodiment before the step 302, further includes: acquiring time information of the key frame of the video in the video; establishing a third between the video key frame and the time information.
  • the method of the embodiment further includes: searching for the corresponding time information according to the found key frame and the third correspondence; and sending the found time information to the terminal.
  • the time information corresponding to the key frame can be sent to the terminal, so that the user can continue watching the video from the time point of the previous viewing when the video is played, thereby improving the user experience.
  • this embodiment further provides another video search method, which is applied to the server side, and includes the following steps:
  • Step 400 Acquire a key frame of the video, perform the image recognition on the key frame to obtain a keyword of the key frame, and obtain time information of the key frame in the video.
  • Step 401 Establish a first correspondence between the video key frame and the keyword, a second correspondence between the video information and the video key frame, and a third pair between the video key frame and the time information. It should be related, and store the first correspondence, the second correspondence, and the third correspondence.
  • the correspondence between the keywords of the video and the video key frame is composed of the first relationship and the second correspondence.
  • the establishing manner of the corresponding relationship in this embodiment may be establishing an index, for example, establishing an index of a video key frame of a video (ie, a second correspondence), and establishing an index of a keyword of the video key frame (ie, a first correspondence relationship.
  • the third correspondence relationship may be established by establishing an index, for example, establishing a key frame index of the time information, the index value is a key frame, and the index object is time information; after the key frame is found, the key may be found according to the key The frame matches the corresponding time information in the key frame index of the time information.
  • the time information in this embodiment may be time point information.
  • Step 402 Receive a picture keyword sent by the terminal, and search for a video key frame corresponding to the picture keyword according to the picture keyword and the first correspondence.
  • the key picture corresponding to the picture keyword is matched in the keyword index of the key frame by using the picture keyword.
  • Step 403 Search for video information corresponding to the video key frame according to the found video key frame and the second correspondence, and search for the corresponding video key frame according to the found video key frame and the third correspondence. Time information.
  • the key frame is used to match the video corresponding to the key frame in the index of the key frame of the video.
  • Step 404 Send the found video information (such as content information or resource location information) and time information to the terminal.
  • found video information such as content information or resource location information
  • the video capture or photographing can be conveniently performed, and the corresponding video and video time points are matched, which brings convenience to the user to watch the video.
  • Embodiment 2 is a diagrammatic representation of Embodiment 1:
  • This embodiment provides a video search method, which is applied to the terminal side, as shown in FIG. 5, and includes the following steps:
  • Step 501 Acquire keywords.
  • the manner of obtaining the keyword in this embodiment may include multiple types.
  • the keyword may be generated by the terminal itself, or the keyword may be generated by the device, and the terminal acquires the keyword from other devices.
  • the keyword may be a picture keyword
  • the picture keyword is an image recognition of the picture including the video picture to obtain a picture keyword related to the video picture.
  • the process of the terminal acquiring the picture keyword may include:
  • a picture including a video picture for example, taking a screen shot to obtain a screen shot photo, or taking a picture of the video picture (such as taking a picture of a display that is playing a video).
  • performing image recognition on the picture to obtain keywords corresponding to the picture includes:
  • the terminal may perform image recognition on the image by using a specific image recognition application to obtain a picture keyword related to the video picture, and the application scans the photo acquisition keyword including the video picture.
  • the picture containing the video picture has two forms, one is that the entire picture is filled with the video picture, and the picture is a video picture, for example, a video screen capture picture obtained by taking a screenshot of the video picture, and only the entire picture is needed at this time.
  • Image recognition can be used; the other is that part of the picture fills the video picture.
  • the captured picture also contains other content. In this case, image recognition is required for the video picture, and the non-video picture is The content is discarded.
  • the keywords related to the video screen identified in this embodiment may include at least one of a text in the video screen, a main content in the video screen, and a ratio of the video content in the video screen.
  • the image recognition process on the terminal side is consistent with the image recognition process on the server side.
  • Step 502 Send the keyword to the server, so that the server searches for a key frame of the video according to the keyword.
  • the video search method of the embodiment can send the keyword to the server, and the server automatically finds the key frame of the corresponding video, thereby finding the video, which is convenient and simple, and improves the user experience.
  • the method in this embodiment may further include: receiving the video information sent by the server after the step 502; The video is played according to the video information.
  • this embodiment provides a video playing method, including the following steps:
  • Step 601 Acquire a picture containing a video picture.
  • Step 602 Perform image recognition on the picture to obtain a picture keyword related to the video picture, and send the picture keyword to the server.
  • Step 603 Receive video information sent by the server.
  • the terminal After obtaining the picture keyword, the terminal sends the acquired picture keyword to the server, and the server searches for the corresponding video according to the correspondence between the picture keyword and the stored video and the keyword of the video key frame, and then the server searches for the corresponding video.
  • the information of the outgoing video is sent to the terminal.
  • the video found in this embodiment may be a video, or may be a group of videos, for example, the one with the strongest association with the picture keyword. Therefore, the video information received by the terminal in this embodiment may be one video information or multiple video information (for example, a group of video information).
  • the information of the video in this embodiment may include content information of the video or identification information (for example, a URI) of the video.
  • Step 604 Perform video playback according to the video information.
  • the terminal When the information of the received video is the content information of the video, the terminal directly plays the content information of the video;
  • the terminal When the information of the received video is the location information (for example, a URI) of the video resource, the terminal acquires the corresponding video content according to the location information, and then plays the obtained video content.
  • the location information for example, a URI
  • the terminal When the terminal receives a set of video information, the user also needs to select the desired video information for playback.
  • the video playing method of this embodiment can enable the user to search for the desired video conveniently and quickly and play it.
  • the playing method of the embodiment may further include: receiving the time information sent by the server after the step 602; at this time, the step 604 includes: according to the time information and the video. Information for video playback.
  • the terminal can also receive the time information, and the terminal can know the time when the user acquired the picture (that is, the time when the user interrupts watching the video), and can start playing from the time when playing the video, and does not need to play from the beginning, and improve.
  • the user experience is, the time when the user interrupts watching the video.
  • the embodiment further provides another video playing method, including the following steps:
  • Step 701 Acquire a picture containing a video picture.
  • Step 702 Perform image recognition on the picture to obtain a picture keyword related to the video picture, and send the picture keyword to the server.
  • the terminal may perform image recognition on the image by using a specific image recognition application to obtain a picture keyword related to the video picture, and the application scans the photo acquisition keyword including the video picture.
  • the picture containing the video picture has two forms, one is that the entire picture is filled with the video picture, and the picture is a video picture, for example, a video screen capture picture obtained by taking a screenshot of the video picture, and only the entire picture is needed at this time.
  • Image recognition can be used; the other is that part of the picture fills the video picture.
  • the captured picture also contains other content.
  • image recognition is required for the video picture, and the non-video picture is The content is discarded, for example, when the picture acquired by the television screen is recognized, only the content of the television screen is recognized, and the interface portion not belonging to the content of the television screen is discarded.
  • Step 703 Receive video information sent by the server.
  • step 603 For a description, reference may be made to the description of step 603 above.
  • Step 704 Send the video information to a playback device, where the playback device performs video playback according to the video information.
  • the terminal directly plays the video, but converts the information of the video sent by the server to a playback device (such as a television or a set top box) for playing.
  • a playback device such as a television or a set top box
  • the terminal sends the content information of the video to the playing device, and the playing device directly plays the video after receiving the content information of the video;
  • the terminal When receiving the video information as the video resource location information, the terminal sends the video resource location information to the playback device, and the playback device acquires the corresponding video content according to the received video resource location information for playing.
  • the server further sends the time information to the terminal
  • the method further includes: receiving time information sent by the server; and transmitting the time information to the playback device, where The playback device performs video playback according to the time information and the video information.
  • Embodiment 3 is a diagrammatic representation of Embodiment 3
  • the server establishes the key frame index of the video, the keyword index of the key frame, and the key frame index of the time point.
  • the flow is as follows:
  • a key frame is a separate and complete frame. For a group of GOPs, the subsequent video frames are dependent on the key frame.
  • the image recognition algorithm and the knowledge base content determine the content of the keyword, and also determine the accuracy of the search video and the location.
  • Step 801 The terminal traces the screen image to obtain the screen image keyword information.
  • Step 802 The terminal sends the keyword information to the server.
  • Step 803 The server receives the keyword information sent by the terminal, and searches for a key frame of the key frame according to the keyword information to match the corresponding key frame.
  • the screen capture is not necessarily at the position of the key frame, the screen capture may not exactly match the key frame existence, and one or more closest video frames need to be matched.
  • Step 804 The server searches for the matching video and time point of the key frame index of the time point and all the key frames of the video according to the matched key frame.
  • the matching result in this step can be a video, and the server sends a video or identification information to the terminal.
  • the matching result in this step can be a group of videos, and a set of video or identification information is sent to the terminal.
  • Step 805 The server sends the video information corresponding to the matched video and the time point to the terminal.
  • the video information may include: identification information corresponding to the matched video or video content of the matched video.
  • Step 806 The terminal receives the time point and the video sent by the server, or the time point and the identification information; and then plays the corresponding video according to the received information.
  • the process of video search and playback includes the following steps:
  • Step 901 The mobile phone starts a specific identification application to scan a photo, and obtains keyword information related to the video content.
  • the area of the photograph may be larger than the area of the video, this needs to be identified for the content of the television screen, and the portion of the interface that does not belong to the content of the television screen is discarded.
  • the keyword information identified in this embodiment may be subject information of a photo, and a percentage of various colors.
  • Step 902 The specific identification application sends the keyword information to the server where the video is located through the network.
  • Step 903 After receiving the keyword information, the server searches for a key frame of the key frame according to the keyword information to match the corresponding key frame.
  • the result of the matching may be a key frame in a video, or a key frame in a corresponding group of videos with the strongest correlation.
  • Step 904 The server searches for the video and time point information corresponding to all the matching of the key frame index of the time point and the key frame of the video according to the matched key frame.
  • the matching result can be a video or a group of videos, a point in time information or a set of time point information.
  • Step 905 The server sends the video information corresponding to the matched video and the time point information to the terminal.
  • the video information may include: identification information corresponding to the matched video or video content of the matched video.
  • the received information may be a set of video information
  • the mobile application or the mobile phone user is required to perform screening.
  • the user filters out the video identifier of the desired video from a set of video identification information.
  • Step 906 The mobile phone pushes the video information and the time point information to the television or the set top box.
  • the push mode can be AirPlay mode, or DLNA.
  • Step 907 The television or the set top box starts the corresponding program play according to the video information and the time point information.
  • Embodiment 4 is a diagrammatic representation of Embodiment 4:
  • the embodiment provides a video search device, which is applied to a server, and includes: a first acquiring module and a first searching module;
  • the first acquiring module is configured to acquire a first correspondence between a key frame of the video and the keyword
  • the first search module is configured to receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
  • the video search apparatus of this embodiment further includes: a first sending module
  • the first sending module is configured to send video information corresponding to the found key frame to the terminal.
  • the first obtaining module is further configured to: acquire, by the first acquiring module, a second correspondence between video information and a key frame of the video;
  • the first search module is further configured to search for video information corresponding to the key frame according to the found key frame and the second correspondence.
  • the received keyword includes: a picture keyword, where the picture keyword is a keyword related to the video picture obtained by performing image recognition on a picture including a video picture.
  • the first obtaining module includes:
  • a key frame acquisition unit configured to acquire a key frame of the video
  • a keyword acquiring unit configured to perform the image recognition on a key frame of the video to acquire a keyword of the key frame
  • the first correspondence establishing unit is configured to establish a first correspondence between the key frame of the video and the keyword.
  • the keyword of the key frame further includes: at least one of a text in the key frame, a body content in the key frame, and a ratio of a key frame occupied by the body content in the key frame.
  • the first obtaining module further includes:
  • a time information obtaining unit configured to acquire time information of a key frame of the video in the video
  • a third correspondence establishing unit is configured to establish a third correspondence between the key frame and the time information of the video
  • the first sending module is further configured to search for corresponding time information according to the found key frame and the third correspondence, and send the found time information to the terminal.
  • the video information includes: video content information or video resource location information.
  • the embodiment further provides a video search device, which is applied to a terminal, and includes: a second acquiring module and a second sending module;
  • the second obtaining module is configured to acquire a keyword
  • the second sending module is configured to send the keyword to the server, so that the server searches for a key frame of the video according to the keyword.
  • the acquiring, by the second acquiring module, the keyword includes: acquiring a picture that includes a video image;
  • the video search device further includes: a second receiving module, configured to: receive video information sent by the server.
  • the embodiment of the present invention further provides a video playing device, where the video playing device includes any video searching device provided by the embodiment of the present invention.
  • the video playing device further includes: a playing module, configured to perform video playing according to the video information.
  • the second sending module is further configured to: receive time information sent by the server;
  • the step of the playing module performing video playback according to the video information includes:
  • the video is played according to the video information and the time information.
  • the video search device further includes: a third sending module, configured to:
  • the video information After receiving the video information sent by the server, the video information is sent to the playback device, so that the playback device performs video playback according to the video information.
  • the second receiving module is further configured to: receive time information sent by the server;
  • the third sending module is further configured to: send the time information to the playing device, so that the playing device performs video playing according to the time information and the video information.
  • the user terminal only needs to acquire a picture including a video picture (for example, capturing a video picture or taking a screen shot, etc.), and then performing image recognition on the picture to obtain a keyword related to the video picture and sending the keyword to the server.
  • the server automatically finds the corresponding video according to the keyword sent by the terminal and the stored correspondence relationship, and feeds back the search result to the terminal.
  • the video search method in the embodiment of the present invention is based on the image recognition technology, and for the user, Obtaining the image containing the video screen can quickly obtain the corresponding video information, and the operation is simple and fast, in addition,
  • the user does not need to memorize the keyword information of the search video, which reduces the difficulty of the video search and improves the user experience.
  • the embodiment of the invention further provides a computer readable storage medium storing computer executable instructions for performing the above method.
  • all or part of the steps of the above embodiments may also be implemented by using an integrated circuit. These steps may be separately fabricated into individual integrated circuit modules, or multiple modules or steps may be fabricated into a single integrated circuit module. achieve.
  • the devices/function modules/functional units in the above embodiments may be implemented by a general-purpose computing device, which may be centralized on a single computing device or distributed over a network of multiple computing devices.
  • the device/function module/functional unit in the above embodiment When the device/function module/functional unit in the above embodiment is implemented in the form of a software function module and sold or used as a stand-alone product, it can be stored in a computer readable storage medium.
  • the above mentioned computer readable storage medium may be a read only memory, a magnetic disk or an optical disk or the like.
  • the user can quickly obtain the corresponding video information only by acquiring the picture containing the video picture, and the operation is simple and fast.
  • the method of the present invention is applied without the user. Memorizing the keyword information of the search video reduces the difficulty of the video search and improves the user experience.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

A video search method and apparatus, which are applied to a server. The method comprises: acquiring a first correspondence between keyframes of a video and keywords; and receiving a keyword sent by a terminal, and searching for a keyframe of a corresponding video according to the keyword sent by the terminal and the first correspondence.

Description

视频查找方法及装置Video search method and device 技术领域Technical field
本发明涉及但不限于视频技术领域。The invention relates to, but is not limited to, the field of video technology.
背景技术Background technique
相关技术中,手机上的图片识别技术已经有很多成熟的应用。比如,手机照片很多,一张张整理比较麻烦,有一些应用就能够自动扫描你的相册,根据关键词找出你要的照片,给生活带来便捷。In the related art, the picture recognition technology on the mobile phone has many mature applications. For example, there are a lot of photos on mobile phones, and it is more troublesome to organize them one by one. Some applications can automatically scan your photo albums, find the photos you want based on keywords, and bring convenience to your life.
然而,在视频领域,还没有一种方法可供用户方便快捷地搜索到所需的视频,因此,在实际生活中给用户烦恼;例如用户在手机或者平板上正在看一部喜欢的电影,但是有突发事情需要关闭,用户回来之后需要在手机或平板上经过复杂的操作重新搜索之前观看的电影;又例如用户在外面看到一张喜欢的电影截屏的海报,想观看海报上的电影时,需要在手机或者电脑上经过复杂的操作搜索海报上的电影,电影视频难度大。However, in the field of video, there is no way for users to search for the desired video conveniently and quickly, thus annoying users in real life; for example, the user is watching a favorite movie on the mobile phone or tablet, but There are sudden things that need to be closed. After the user comes back, they need to go through complicated operations on the phone or tablet to re-search for the movies they watched before. For example, when the user sees a poster of a favorite movie screen outside, they want to watch the movie on the poster. It is necessary to search for movies on posters through complicated operations on a mobile phone or a computer. The movie video is difficult.
在上述场景中用户搜索视频非常麻烦且难度大,导致用户的体验也很差;因此如何使用户方便快捷地搜索视频成为视频领域中急需解决的问题。In the above scenario, the user searching for the video is very troublesome and difficult, and the user experience is also poor; therefore, how to make the user search for the video conveniently and quickly becomes an urgent problem to be solved in the video field.
发明内容Summary of the invention
以下是对本文详细描述的主题的概述。本概述并非是为了限制权利要求的保护范围。The following is an overview of the topics detailed in this document. This Summary is not intended to limit the scope of the claims.
本发明实施例提供一种视频查找方法及装置、以及视频播放方法及装置,能够使用户方便快捷地搜索视频。The embodiment of the invention provides a video search method and device, and a video playing method and device, which enable a user to search for a video conveniently and quickly.
本发明实施例提供一种视频查找方法,应用于服务器,包括如下步骤:An embodiment of the present invention provides a video search method, which is applied to a server, and includes the following steps:
获取视频的关键帧与关键词之间的第一对应关系;Obtaining a first correspondence between a key frame of the video and the keyword;
接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系,查找对应的视频的关键帧。 The keyword sent by the terminal is received, and the key frame of the corresponding video is searched according to the keyword sent by the terminal and the first correspondence.
可选地,在查找到关键帧之后,所述方法还包括:Optionally, after the key frame is found, the method further includes:
将与查找到的关键帧对应的视频信息发送给所述终端。Sending video information corresponding to the found key frame to the terminal.
可选地,在接收终端发送的关键词之前,所述方法还包括:获取视频信息与视频的关键帧之间的第二对应关系;Optionally, before receiving the keyword sent by the terminal, the method further includes: acquiring a second correspondence between the video information and a key frame of the video;
在所述根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧之后,所述将与查找到的关键帧对应的视频信息发送给所述终端之前,所述方法还包括:After the video information corresponding to the found key frame is sent to the terminal, the method further includes: before the sending, by the terminal, the key information sent by the terminal and the first corresponding relationship, the video information corresponding to the searched key frame, to the terminal, the method further includes: :
根据查找到的关键帧和所述第二对应关系,查找所述关键帧对应的视频信息。And searching for the video information corresponding to the key frame according to the found key frame and the second correspondence.
可选地,接收到的关键词包括:图片关键词,所述图片关键词为对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。Optionally, the received keyword includes: a picture keyword, where the picture keyword is a keyword related to the video picture obtained by performing image recognition on a picture including a video picture.
可选地,所述获取视频的关键帧与关键词之间的第一对应关系包括:Optionally, the first correspondence between the key frame of the acquired video and the keyword includes:
获取视频的关键帧;Get the keyframe of the video;
对所述视频的关键帧进行所述图像识别获取所述关键帧的关键词;Performing the image recognition on a key frame of the video to acquire a keyword of the key frame;
建立所述视频的关键帧与关键词之间的第一对应关系。Establishing a first correspondence between a key frame of the video and a keyword.
可选地,所述关键帧的关键词还包括:关键帧内的文字、关键帧内的主体内容及关键帧内主体内容所占关键帧的比例中的至少一种。Optionally, the keyword of the key frame further includes: at least one of a text in the key frame, a body content in the key frame, and a ratio of a key frame occupied by the body content in the key frame.
可选地,在接收终端发送的图片关键词之前,所述方法还包括:Optionally, before receiving the picture keyword sent by the terminal, the method further includes:
获取所述视频的关键帧在所述视频中的时间信息;Obtaining time information of a key frame of the video in the video;
建立所述视频的关键帧与时间信息之间的第三对应关系;Establishing a third correspondence between key frames and time information of the video;
在所述查找对应的视频的关键帧之后,所述方法还包括:After the searching for the key frame of the corresponding video, the method further includes:
根据查找到的关键帧和所述第三对应关系查找对应的时间信息;Finding corresponding time information according to the found key frame and the third correspondence relationship;
将查找到的时间信息发送给所述终端。The found time information is sent to the terminal.
可选地,所述视频信息包括:视频内容信息或者视频资源位置信息。Optionally, the video information includes: video content information or video resource location information.
本发明实施例还提供了一种视频查找方法,应用于终端,包括如下步骤:The embodiment of the invention further provides a video search method, which is applied to the terminal, and includes the following steps:
获取关键词; Get keywords;
将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。The keyword is sent to a server for the server to find a key frame of the video according to the keyword.
可选地,所述获取关键词的步骤包括:Optionally, the step of acquiring a keyword includes:
获取包含视频画面的图片;Get a picture containing the video screen;
对所述图片进行图像识别获取与所述图片对应的关键词。Performing image recognition on the picture to obtain a keyword corresponding to the picture.
可选地,在将关键词发送给服务器之后,所述方法还包括:Optionally, after the keyword is sent to the server, the method further includes:
接收服务器发送的视频信息;Receiving video information sent by the server;
根据所述视频信息进行视频播放。The video is played according to the video information.
可选地,在将关键词发送给服务器之后,所述方法还包括:接收服务器发送的时间信息;Optionally, after the keyword is sent to the server, the method further includes: receiving time information sent by the server;
所述根据所述视频信息进行视频播放的步骤包括:The step of performing video playback according to the video information includes:
根据所述视频信息和时间信息进行视频播放。The video is played according to the video information and the time information.
可选地,在将关键词发送给服务器之后,所述方法还包括:Optionally, after the keyword is sent to the server, the method further includes:
接收服务器发送的视频信息;Receiving video information sent by the server;
将所述视频信息发送给播放设备,以供所述播放设备根据所述视频信息进行视频播放。And transmitting the video information to the playback device, where the playback device performs video playback according to the video information.
可选地,在将关键词发送给服务器之后,所述方法还包括:Optionally, after the keyword is sent to the server, the method further includes:
接收服务器发送的时间信息;Receiving time information sent by the server;
将所述时间信息发送给播放设备,以供所述播放设备根据时间信息和所述视频信息进行视频播放。And transmitting the time information to the playing device, so that the playing device performs video playing according to the time information and the video information.
本发明实施例还提供了一种视频查找装置,应用于服务器,包括:The embodiment of the invention further provides a video search device, which is applied to a server, and includes:
第一获取模块和第一查找模块;a first acquisition module and a first search module;
所述第一获取模块,设置成获取视频的关键帧与关键词之间的第一对应关系;The first acquiring module is configured to acquire a first correspondence between a key frame of the video and the keyword;
所述第一查找模块,设置成接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系,查找对应的视频的关键帧。 The first search module is configured to receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
可选地,视频查找装置,还包括:第一发送模块;Optionally, the video search device further includes: a first sending module;
所述第一发送模块,设置成将与查找到的关键帧对应的视频信息发送给所述终端。The first sending module is configured to send video information corresponding to the found key frame to the terminal.
本发明实施例还提供了一种视频查找装置,应用于终端,包括:The embodiment of the invention further provides a video search device, which is applied to the terminal, and includes:
第二获取模块和第二发送模块;a second acquisition module and a second transmission module;
所述第二获取模块,设置成获取关键词;The second obtaining module is configured to acquire a keyword;
所述第二发送模块,设置成将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。The second sending module is configured to send the keyword to the server, so that the server searches for a key frame of the video according to the keyword.
可选地,所述第二获取模块获取关键词包括:获取包含视频画面的图片;Optionally, the acquiring, by the second acquiring module, the keyword includes: acquiring a picture that includes a video image;
对所述图片进行图像识别获取与所述图片对应的关键词。Performing image recognition on the picture to obtain a keyword corresponding to the picture.
本发明实施例提供了一种视频查找方法及装置;本发明实施例的视频查找方法,包括:获取视频的关键帧与关键词之间的第一对应关系;接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧;应用本发明实施例的视频查找方法,用户终端只需将关键词发送给服务器,由服务器根据终端发送的关键词和第一对应关系自动查找出对应的视频的关键帧;由于视频的关键帧可以表征视频,进而可以查找出对应的视频;对于用户来说,其只需获取并发送关键词给服务器即可,操作简单快捷,应用本发明实施例的方案,降低了视频搜索的难度,提升了用户体验。The embodiment of the present invention provides a video search method and device. The video search method of the embodiment of the present invention includes: acquiring a first correspondence between a key frame of a video and a keyword; and receiving a keyword sent by the terminal, according to the terminal Searching for a key frame of the corresponding video in the first correspondence relationship; applying the video search method in the embodiment of the present invention, the user terminal only needs to send the keyword to the server, and the server sends the keyword according to the terminal and the first A corresponding relationship automatically finds out the key frame of the corresponding video; since the key frame of the video can represent the video, the corresponding video can be found; for the user, it only needs to acquire and send the keyword to the server, and the operation is simple The solution of the embodiment of the invention is fast, the difficulty of the video search is reduced, and the user experience is improved.
本发明实施例的视频查找方案的可选实施方式中,是基于图像识别技术,对于用户来说,其只需获取包含视频画面的图片就可以快速获取对应的视频信息,操作简单快捷,另外,应用本可选实施方式,无需用户记忆搜索视频的关键词信息,降低了视频搜索的难度,提升了用户体验。In an optional implementation manner of the video search solution of the embodiment of the present invention, the image recognition technology is based on the image recognition technology, and the user can quickly obtain the corresponding video information by simply acquiring the image including the video image, and the operation is simple and fast. By applying the optional implementation manner, the user does not need to memorize the keyword information of the search video, which reduces the difficulty of the video search and improves the user experience.
在阅读并理解了附图和详细描述后,可以明白其他方面。Other aspects will be apparent upon reading and understanding the drawings and detailed description.
附图概述BRIEF abstract
图1为本发明实施例一提供的第一种视频查找方法的流程示意图;1 is a schematic flowchart of a first video search method according to Embodiment 1 of the present invention;
图2为本发明实施例一提供的第二种视频查找方法的流程示意图; 2 is a schematic flowchart of a second video search method according to Embodiment 1 of the present invention;
图3为本发明实施例一提供的第三种视频查找方法的流程示意图;3 is a schematic flowchart of a third video search method according to Embodiment 1 of the present invention;
图4为本发明实施例一提供的第四种视频查找方法的流程示意图;4 is a schematic flowchart of a fourth video search method according to Embodiment 1 of the present invention;
图5为本发明实施例二提供的一种视频查找方法的流程示意图;FIG. 5 is a schematic flowchart of a video search method according to Embodiment 2 of the present invention;
图6为本发明实施例二提供的一种视频播放方法的流程示意图;FIG. 6 is a schematic flowchart of a video playing method according to Embodiment 2 of the present invention;
图7为本发明实施例二提供的另一种视频播放方法的流程示意图;FIG. 7 is a schematic flowchart diagram of another video playing method according to Embodiment 2 of the present invention;
图8为本发明实施例三提供的一种视频搜索和播放的流程示意图;FIG. 8 is a schematic flowchart of video search and playback according to Embodiment 3 of the present invention; FIG.
图9为本发明实施例三提供的另一种视频搜索和播放的流程示意图;FIG. 9 is a schematic flowchart of another video search and play according to Embodiment 3 of the present invention; FIG.
图10为本发明实施例四提供的第一种视频查找装置的结构示意图;FIG. 10 is a schematic structural diagram of a first video search apparatus according to Embodiment 4 of the present invention; FIG.
图11为本发明实施例四提供的第二种视频查找装置的结构示意图;FIG. 11 is a schematic structural diagram of a second video search apparatus according to Embodiment 4 of the present invention;
图12为本发明实施例四提供的第三种视频查找装置的结构示意图。FIG. 12 is a schematic structural diagram of a third video search apparatus according to Embodiment 4 of the present invention.
在阅读并理解了附图和详细描述后,可以明白其他方面。Other aspects will be apparent upon reading and understanding the drawings and detailed description.
本发明的较佳实施方式Preferred embodiment of the invention
下面结合附图对本发明实施例进行描述。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互任意组合。The embodiments of the present invention are described below in conjunction with the accompanying drawings. It should be noted that, in the case of no conflict, the features in the embodiments and the embodiments in the present application may be arbitrarily combined with each other.
下面结合附图对本发明实施例作详细说明。The embodiments of the present invention are described in detail below with reference to the accompanying drawings.
实施例一:Embodiment 1:
考虑到在相关技术中视频领域存在如何使用户方便快捷地搜索视频的问题,本实施例提供了一种视频查找方法,应用于服务器侧,如图1所示,包括如下步骤:The present invention provides a video search method, which is applied to the server side, as shown in FIG. 1 , and includes the following steps in view of the problem in the related art:
步骤101:获取视频的关键帧与关键词之间的第一对应关系。Step 101: Acquire a first correspondence between a key frame of the video and a keyword.
本实施例中获取第一对应关系的方式可以多种,例如可以由其他设备建立视频的关键帧与关键词之间的对应关系,然后服务器从该设备中获取,又例如服务器之间建立视频的关键帧与关键词之间的对应关系。In this embodiment, the manner in which the first correspondence is obtained may be multiple. For example, the correspondence between the key frames of the video and the keywords may be established by other devices, and then the server obtains the device, and the video is established between the servers. The correspondence between key frames and keywords.
本实施例中视频的关键帧是一帧画面,例如可以为独立完整的一帧画面,对于一组GOP(Group of Pictures)而言,后面的视频帧都依赖于关键帧;因此, 本实施例中视频的关键帧可以表征视频,找到视频的关键帧即可知晓对应的视频。In this embodiment, the key frame of the video is a frame picture, for example, it may be an independent and complete frame picture. For a group of GOPs (Group of Pictures), the following video frames are dependent on the key frame; therefore, In this embodiment, the key frame of the video can represent the video, and the key frame of the video can be found to know the corresponding video.
在本实施例中由服务器建立视频与视频关键帧的关键词之间的第一对应关系的过程可以包括:The process of establishing a first correspondence between the keywords of the video and the video key frame by the server in this embodiment may include:
获取视频的关键帧;Get the keyframe of the video;
对所述视频的关键帧进行所述图像识别获取所述关键帧的关键词;Performing the image recognition on a key frame of the video to acquire a keyword of the key frame;
建立所述视频的关键帧与关键词之间的第一对应关系。Establishing a first correspondence between a key frame of the video and a keyword.
本实施例方法可以针对视频,分别获取视频的关键帧,然后针对所有关键帧,进行图像识别获取各关键帧的关键词并保存关键词,最后建立关键帧与关键词之间的对应关系。The method in this embodiment may separately acquire key frames of the video for the video, and then perform image recognition on all key frames to obtain keywords of each key frame and save the keywords, and finally establish a correspondence between the key frames and the keywords.
本实施例中对关键帧的图像识别基于知识库内容和图像识别方式,不同的知识库内容和图像识别方式可以获取的关键词不相同。其中,知识库是知识工程中重中之重结构化,易操作,易利用,全面有组织的知识集群,是针对某一或某些领域问题求解的需要,采用某种或多种知识表示方式在计算机存储器中存储、组织、管理和使用的互相联系的知识片集合。In this embodiment, the image recognition of the key frame is based on the knowledge base content and the image recognition mode, and different knowledge base contents and image recognition methods may obtain different keywords. Among them, the knowledge base is the most important structure, easy to operate, easy to use, and comprehensive and organized knowledge cluster in knowledge engineering. It is the need to solve problems in one or some fields, and adopts one or more kinds of knowledge representation. A collection of interrelated pieces of knowledge stored, organized, managed, and used in computer memory.
可选地,本实施例中关键帧的关键词可以包括关键帧内的文字、关键帧内的主体内容及关键帧内主体内容所占关键帧的比例中的至少一种。例如本实施例中关键帧的关键词,可以是视频的关键帧中的文字信息,主体内容,以及主体内容所占图片的比例,这一组关键信息,可以用来标识一幅图片。Optionally, the keyword of the key frame in this embodiment may include at least one of text in a key frame, body content in a key frame, and a ratio of a key frame occupied by the body content in the key frame. For example, the keyword of the key frame in this embodiment may be the text information in the key frame of the video, the content of the main body, and the proportion of the image occupied by the main content. This set of key information may be used to identify a picture.
本实施例中视频的关键帧与关键词之间的第一对应关系的形式可以包括:关键帧的关键词索引,索引值为关键词,索引对象为关键帧。The form of the first correspondence between the key frame of the video and the keyword in the embodiment may include: a keyword index of the key frame, the index value is a keyword, and the index object is a key frame.
在本实施例中终端发送的关键词可以为图片关键词,其中图片关键词为对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词;例如由用户终端自己对含视频画面的图片(例如视频画面截屏图片、对视频画面拍摄形成的图片等)进行图像识别获取的关键词,也可以由其他设备对包含视频画面的图片(例如视频画面截屏图片、对视频画面拍摄形成的图片等)进行图像识别获取的关键词,然后终端从其他设备获取转发给服务器。In the embodiment, the keyword sent by the terminal may be a picture keyword, where the picture keyword is a keyword related to the video picture obtained by performing image recognition on the picture including the video picture; for example, the user terminal directly includes the video. Pictures of the screen (such as screenshots of video screens, pictures formed by video screens, etc.) are keywords that are image-received. Others may also use pictures of video images (such as video screen captures and video images). The picture, etc.) performs the image recognition to obtain the keywords, and then the terminal obtains the forwarding from the other device to the server.
可选地,在本实施例中终端侧的图像识别方式与服务器侧对关键帧采用 的图像识别方式需要一致,否则识别出来的关键词内容不同,就无法精确匹配出视频。Optionally, in the embodiment, the image recognition mode on the terminal side and the server side use the key frame. The image recognition method needs to be consistent. Otherwise, the recognized keyword content is different, and the video cannot be accurately matched.
步骤102:接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧。Step 102: Receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
可选地,接收终端发送的图片关键词,根据所述图片关键词和所述第一对应关系查找与所述图片关键词对应的视频;所述图片关键词为所述终端对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。Optionally, the picture keyword sent by the terminal is received, and the video corresponding to the picture keyword is searched according to the picture keyword and the first correspondence; the picture keyword is that the terminal pair includes a video picture. The picture is subjected to image recognition to acquire keywords related to the video picture.
由于视频的关键帧可以表征视频,找到视频的关键帧即可知晓对应的视频。Since the key frame of the video can characterize the video, the key frame of the video can be found to know the corresponding video.
本实施例中查找出与关键词对应的关键帧可能一个或多个;查找的关键帧可以为一个视频中的某一个关键帧或者一组视频中的关键帧,该组视频可以为关联性最强的一组视频。In this embodiment, one or more key frames corresponding to the keyword may be found; the key frame to be searched may be a key frame in a video or a key frame in a group of videos, and the group of videos may be the most relevant. A strong set of videos.
应用本实施例的视频查找方法,用户终端只需获取包含视频画面的图片(例如对视频画面进行拍摄或者截屏等),然后将对该图片进行图像识别获取与视频画面相关的关键词发送给服务器,由服务器根据终端发送的关键词和存储的对应关系自动查找出对应的视频的关键帧,由于该关键帧可以表征视频,查找到视频的关键帧就是查找到视频;对于用户来说,其只需获取包含视频画面的图片就可以快速获取对应的视频信息,操作简单快捷,另外,应用本实施例的方法,无需用户记忆搜索视频的关键词信息,降低了视频搜索的难度,提升了用户体验。Applying the video search method of the embodiment, the user terminal only needs to obtain a picture including a video picture (for example, capturing a video picture or taking a screen shot, etc.), and then performing image recognition on the picture to obtain a keyword related to the video picture and sending the keyword to the server. The server automatically finds the key frame of the corresponding video according to the keyword sent by the terminal and the stored correspondence relationship. Since the key frame can represent the video, the key frame of the video is found to find the video; for the user, only The image information including the video picture can be obtained, and the corresponding video information can be quickly obtained. The operation is simple and fast. In addition, the method of the embodiment is used, and the user does not need to memorize the keyword information of the search video, thereby reducing the difficulty of the video search and improving the user experience. .
为能够让终端可以播放查找到的视频,如图2所示,本实施例还提供了一种视频查找方法,应用于服务器侧,包括:In order to enable the terminal to play the found video, as shown in FIG. 2, the embodiment further provides a video search method, which is applied to the server side, and includes:
步骤201:建立视频的关键帧与关键词之间的第一对应关系。Step 201: Establish a first correspondence between a key frame of the video and the keyword.
本步骤中建立第一对应关系的过程可以参考上述相关描述。例如建立关键帧的关键词索引,索引值为关键词,索引对象为视频的关键帧。For the process of establishing the first correspondence in this step, reference may be made to the related description above. For example, a keyword index of a key frame is established, the index value is a keyword, and the index object is a key frame of the video.
步骤202:接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧。Step 202: Receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
可选地,终端发送的关键词包括图片关键词,所述图片关键词为对包含 视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。例如由发送终端之间对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。Optionally, the keyword sent by the terminal includes a picture keyword, where the picture keyword is a pair of The picture of the video picture is subjected to image recognition to obtain keywords related to the video picture. For example, a keyword related to the video screen obtained by image recognition of a picture including a video picture between transmission terminals.
在接收到图片关键词后,根据图片关键词在关键帧的关键词索引中检索打对应的视频的关键帧。After receiving the picture keyword, the key frame of the corresponding video is retrieved in the keyword index of the key frame according to the picture keyword.
步骤203:将与查找到的关键帧对应的视频信息发送给所述终端。Step 203: Send video information corresponding to the found key frame to the terminal.
本步骤视频信息可以包括:视频内容信息、或者视频资源位置信息。The video information of this step may include: video content information, or video resource location information.
在查找到对应的关键帧之后,本实施例可以与该关键帧对应的视频内容信息发送给终端,以供终端直接进行播放,或者发送给其他播放设备进行播放。After the corresponding key frame is found, the video content information corresponding to the key frame may be sent to the terminal for the terminal to directly play, or sent to other playback devices for playing.
或者将关键帧对应的视频资源位置信息例如URI(Uniform Resource Identifier,统一资源标识符)发送给终端,以供终端根据标识信息获取对应的视频内容进行播放,或者将标识信息发送给其他播放设备使其他播放设备根据标识信息获取对应的视频内容进行播放。Or sending the video resource location information corresponding to the key frame, for example, a URI (Uniform Resource Identifier), to the terminal, so that the terminal acquires the corresponding video content according to the identifier information, or sends the identifier information to other playback devices. The other playback device acquires the corresponding video content according to the identification information for playing.
在本实施例中由于查找出的关键帧可以为一个或者多个,因此,与关键帧对应的视频信息也可以一个或者多个,例如查找出的视频可能为一个视频或者一组视频,那么本实施例方法就需要将一个视频的信息发送给终端,或者将一组视频中各视频的信息发送给终端。In this embodiment, since the searched key frames may be one or more, the video information corresponding to the key frames may also be one or more. For example, the searched video may be a video or a group of videos, then The embodiment method needs to send information of one video to the terminal, or send information of each video in a group of videos to the terminal.
可选地,本实施例方法可以在步骤202之前,获取视频信息与视频的关键帧之间的第二对应关系;在此情况下,步骤203可以包括:根据查找到的关键帧和所述第二对应关系查找到对应的视频信息;将查找到的视频信息发送给所述终端。Optionally, the method in this embodiment may obtain a second correspondence between the video information and the key frame of the video before the step 202. In this case, the step 203 may include: according to the found key frame and the first The two correspondences find the corresponding video information; and the found video information is sent to the terminal.
本实施例中获取视频信息与视频的关键帧之间的第二对应关系的过程可以由用户终端自己建立第二对应关系;可以由其他设备建立第二对应关系,然后用户终端从其他设备中获取第二对应关系。其中,视频信息与视频之间是一一对应的。In the embodiment, the process of obtaining the second correspondence between the video information and the key frame of the video may be established by the user terminal by itself; the second correspondence may be established by other devices, and then the user terminal obtains the second correspondence. The second correspondence. Among them, there is a one-to-one correspondence between video information and video.
本实施例中建立视频信息与视频的关键帧之间的第二对应关系的过程可以包括: The process of establishing a second correspondence between the video information and the key frame of the video in this embodiment may include:
针对视频获取视频的关键帧以及视频信息(例如视频内容或者视频资源位置信息);Obtain key frames of video and video information (such as video content or video resource location information) for the video;
然后建立视频的关键帧与视频信息的第二对应关系。Then, a second correspondence between the key frame of the video and the video information is established.
在本实施例中第二对应关系可以为视频信息的关键帧索引,其中视频的关键帧索引的索引值为关键帧,索引对象为视频信息;在查找到视频的关键帧之后,搜索视频信息的关键帧索引匹配出对应的视频信息。In this embodiment, the second correspondence may be a key frame index of the video information, where the index value of the key frame index of the video is a key frame, and the index object is video information; after the key frame of the video is found, the video information is searched. The key frame index matches the corresponding video information.
应用本实施例的视频查找方法,用户终端只需获取包含视频画面的图片(例如对视频画面进行拍摄或者截屏等),然后将对该图片进行图像识别获取与视频画面相关的关键词发送给服务器,由服务器根据终端发送的关键词和存储的对应关系自动查找出对应的视频并反馈查找结果给终端;可见本发明实施例的视频查找方法是基于图像识别技术,对于用户来说,其只需获取包含视频画面的图片就可以快速获取对应的视频信息,操作简单快捷,另外,应用本实施例的方法,无需用户记忆搜索视频的关键词信息,降低了视频搜索的难度,提升了用户体验。Applying the video search method of the embodiment, the user terminal only needs to obtain a picture including a video picture (for example, capturing a video picture or taking a screen shot, etc.), and then performing image recognition on the picture to obtain a keyword related to the video picture and sending the keyword to the server. The server automatically finds the corresponding video according to the keyword sent by the terminal and the stored correspondence relationship, and feeds back the search result to the terminal. It can be seen that the video search method in the embodiment of the present invention is based on the image recognition technology, and for the user, Obtaining the video information including the video picture can quickly obtain the corresponding video information, and the operation is simple and fast. In addition, the method of the embodiment is used, and the user does not need to memorize the keyword information of the search video, thereby reducing the difficulty of the video search and improving the user experience.
根据上述的描述,如图3所示,本实施例还提供了另一种视频查找方法,应用于服务器侧,包括如下步骤:According to the above description, as shown in FIG. 3, this embodiment further provides another video search method, which is applied to the server side, and includes the following steps:
步骤300:获取视频的关键帧;对所述关键帧进行所述图像识别获取所述关键帧的关键词。Step 300: Acquire a key frame of the video; perform the image recognition on the key frame to obtain a keyword of the key frame.
步骤301:建立视频关键帧与关键词之间的第一对应关系和视频信息与视频关键帧之间的第二对应关系。Step 301: Establish a first correspondence between the video key frame and the keyword and a second correspondence between the video information and the video key frame.
本实施例中对应关系的建立方式可以为建立索引,例如,先建立视频的关键帧索引(即第二对应关系),然后建立关键帧的关键词索引(即第一对应关系);其中视频的关键帧索引的索引值为关键帧,索引对象为视频信息(包括视频内容信息或者资源位置信息);关键帧的关键词索引的索引值为关键词,索引对象为关键帧;这样在接收到终端发送的关键词后,首先搜索关键帧的关键词索引匹配出对应的关键帧,然后在搜索视频的关键帧索引匹配出对应的视频信息。The establishing manner of the corresponding relationship in this embodiment may be to establish an index, for example, first establishing a key frame index of the video (ie, a second correspondence), and then establishing a keyword index of the key frame (ie, a first correspondence); wherein the video is The index value of the key frame index is a key frame, and the index object is video information (including video content information or resource location information); the index of the key index of the key frame is a keyword, and the index object is a key frame; After the keyword is sent, the keyword index of the search key frame first matches the corresponding key frame, and then the corresponding video information is matched in the key frame index of the search video.
步骤302:接收终端发送的图片关键词,根据所述图片关键词和所述第 一对应关系查找与所述图片关键词对应的视频关键帧。Step 302: Receive a picture keyword sent by the terminal, according to the picture keyword and the first A correspondence relationship searches for a video key frame corresponding to the picture keyword.
例如,在接收到图片关键词后,利用图片关键词在关键帧的关键词索引中匹配出与图片关键词对应的关键帧。For example, after receiving the picture keyword, the key picture corresponding to the picture keyword is matched in the keyword index of the key frame by using the picture keyword.
步骤303:根据查找到的视频关键帧和所述第二对应关系查找与该视频关键帧对应的视频信息。Step 303: Search for video information corresponding to the video key frame according to the found video key frame and the second correspondence.
例如,在匹配出对应的关键帧后,利用该关键帧在视频的关键帧的索引中匹配出与关键帧对应的视频。For example, after matching the corresponding key frame, the key frame is used to match the video corresponding to the key frame in the index of the key frame of the video.
步骤304:将查找到的视频信息发送给终端。Step 304: Send the found video information to the terminal.
例如可以将视频的媒体内容发送给终端进行播放,或者将URI发送给终端以供终端获取对应的视频内容进行播放。For example, the media content of the video may be sent to the terminal for playing, or the URI may be sent to the terminal for the terminal to acquire the corresponding video content for playing.
考虑到用户获取视频的信息之后会从头播放之前观看的视频,用户会重复观看已经看过的视频内容或者进行快进等操作,降低了用户体验低;针对此情况,本实施例提供了一种解决方案,即服务器还需要将相关的时间信息发送给终端,以使得用户在播放视频时可以从之前观看的时间点继续观看视频,提升了用户体验。Considering that the user can obtain the previously viewed video from the beginning after the user obtains the video information, the user repeatedly watches the video content that has already been viewed or performs fast forwarding, etc., which reduces the user experience is low; for this case, the embodiment provides a The solution is that the server also needs to send related time information to the terminal, so that the user can continue watching the video from the point of time when the video is played, which improves the user experience.
可选地,在本实施例中,在步骤302之前,本实施例方法还包括:获取所述视频的关键帧在所述视频中的时间信息;建立视频关键帧与时间信息之间的第三对应关系;Optionally, in this embodiment, before the step 302, the method of the embodiment further includes: acquiring time information of the key frame of the video in the video; establishing a third between the video key frame and the time information. Correspondence relationship
在步骤302之后,本实施例方法还包括:根据查找到的关键帧和所述第三对应关系查找对应的时间信息;将查找到的时间信息发送给所述终端。After the step 302, the method of the embodiment further includes: searching for the corresponding time information according to the found key frame and the third correspondence; and sending the found time information to the terminal.
本实施例方法可以将关键帧对应的时间信息发送给终端,以使得用户在播放视频时可以从之前观看的时间点继续观看视频,提升了用户体验。In this embodiment, the time information corresponding to the key frame can be sent to the terminal, so that the user can continue watching the video from the time point of the previous viewing when the video is played, thereby improving the user experience.
如图4所示,本实施例还提供了另一种视频查找方法,应用于服务器侧,包括如下步骤:As shown in FIG. 4, this embodiment further provides another video search method, which is applied to the server side, and includes the following steps:
步骤400:获取视频的关键帧,对所述关键帧进行所述图像识别获取所述关键帧的关键词,以及获取所述关键帧在所述视频中的时间信息。Step 400: Acquire a key frame of the video, perform the image recognition on the key frame to obtain a keyword of the key frame, and obtain time information of the key frame in the video.
步骤401:建立视频关键帧与关键词之间的第一对应关系、视频信息与视频关键帧之间的第二对应关系、以及视频关键帧与时间信息之间的第三对 应关系,并存储第一对应关系、第二对应关系和第三对应关系。Step 401: Establish a first correspondence between the video key frame and the keyword, a second correspondence between the video information and the video key frame, and a third pair between the video key frame and the time information. It should be related, and store the first correspondence, the second correspondence, and the third correspondence.
本步骤中由第一关系和第二对应关系组成了视频与视频关键帧的关键词之间的对应关系。In this step, the correspondence between the keywords of the video and the video key frame is composed of the first relationship and the second correspondence.
本实施例中对应关系的建立方式可以为建立索引,例如,建立视频的视频关键帧的索引(即第二对应关系)、建立视频关键帧的关键词的索引(即第一对应关系。The establishing manner of the corresponding relationship in this embodiment may be establishing an index, for example, establishing an index of a video key frame of a video (ie, a second correspondence), and establishing an index of a keyword of the video key frame (ie, a first correspondence relationship.
本实施例中第三对应关系的建立方式也可以为建立索引,例如建立时间信息的关键帧索引,索引值为关键帧,索引对象为时间信息;在查找出关键帧后,可以根据查找的关键帧在时间信息的关键帧索引中匹配出对应的时间信息。本实施例中时间信息可以为时间点信息。In this embodiment, the third correspondence relationship may be established by establishing an index, for example, establishing a key frame index of the time information, the index value is a key frame, and the index object is time information; after the key frame is found, the key may be found according to the key The frame matches the corresponding time information in the key frame index of the time information. The time information in this embodiment may be time point information.
步骤402:接收终端发送的图片关键词,根据所述图片关键词和所述第一对应关系查找与所述图片关键词对应的视频关键帧。Step 402: Receive a picture keyword sent by the terminal, and search for a video key frame corresponding to the picture keyword according to the picture keyword and the first correspondence.
例如,在接收到图片关键词后,利用图片关键词在关键帧的关键词索引中匹配出与图片关键词对应的关键帧。For example, after receiving the picture keyword, the key picture corresponding to the picture keyword is matched in the keyword index of the key frame by using the picture keyword.
步骤403:根据查找到的视频关键帧和所述第二对应关系查找与该视频关键帧对应的视频信息,根据查找到的视频关键帧和所述第三对应关系查找与该视频关键帧对应的时间信息。Step 403: Search for video information corresponding to the video key frame according to the found video key frame and the second correspondence, and search for the corresponding video key frame according to the found video key frame and the third correspondence. Time information.
例如,在匹配出对应的关键帧后,利用该关键帧在视频的关键帧的索引中匹配出与关键帧对应的视频。For example, after matching the corresponding key frame, the key frame is used to match the video corresponding to the key frame in the index of the key frame of the video.
步骤404:将查找到的视频信息(例如内容信息或者资源位置信息)、以及时间信息发送给终端。Step 404: Send the found video information (such as content information or resource location information) and time information to the terminal.
采用本实施例方法,能够便捷的通过视频截屏或者拍照,匹配到对应的视频以及视频时间点,给用户观看视频带来便捷。By adopting the method of the embodiment, the video capture or photographing can be conveniently performed, and the corresponding video and video time points are matched, which brings convenience to the user to watch the video.
实施例二:Embodiment 2:
本实施例提供了一种视频查找方法,应用于终端侧,如图5所示,包括如下步骤: This embodiment provides a video search method, which is applied to the terminal side, as shown in FIG. 5, and includes the following steps:
步骤501:获取关键词。Step 501: Acquire keywords.
本实施例中获取关键词的方式可以包括多种,例如可以由终端自己生成关键词,也可以由他设备生成关键词,终端从其他设备中获取关键词。The manner of obtaining the keyword in this embodiment may include multiple types. For example, the keyword may be generated by the terminal itself, or the keyword may be generated by the device, and the terminal acquires the keyword from other devices.
可选地,本实施例中关键词可以为图片关键词,图片关键词为对包含视频画面的图片进行图像识别获取与所述视频画面相关的图片关键词。Optionally, in the embodiment, the keyword may be a picture keyword, and the picture keyword is an image recognition of the picture including the video picture to obtain a picture keyword related to the video picture.
终端获取图片关键词的过程可以包括:The process of the terminal acquiring the picture keyword may include:
首先,获取包含视频画面的图片;First, obtain a picture containing a video picture;
本实施例中获取包含视频画面的图片的方式有多种,例如,对视频画面进行截屏获取截屏照片,或者对视频画面进行拍照(如对正在播放视频的显示器拍照)等。In this embodiment, there are various ways to obtain a picture including a video picture, for example, taking a screen shot to obtain a screen shot photo, or taking a picture of the video picture (such as taking a picture of a display that is playing a video).
其次,对所述图片进行图像识别获取与所述图片对应的关键词。Next, performing image recognition on the picture to obtain a keyword corresponding to the picture.
当关键词为图片关键词时,对所述图片进行图像识别获取与所述图片对应的关键词包括:When the keyword is a picture keyword, performing image recognition on the picture to obtain keywords corresponding to the picture includes:
对所述图片进行图像识别获取与所述视频画面相关的图片关键词。Performing image recognition on the picture to obtain a picture keyword related to the video picture.
可选地,终端可以通过特定的图像识别应用来对图片进行图像识别获取与视频画面相关的图片关键词,该应用扫描包含视频画面的照片获取关键词。Optionally, the terminal may perform image recognition on the image by using a specific image recognition application to obtain a picture keyword related to the video picture, and the application scans the photo acquisition keyword including the video picture.
在本实施例中包含视频画面的图片有两种形式,一种是整个图片全部填充视频画面,图片即为视频画面,例如对视频画面截屏获取的视频截屏照片,此时只需对整个照片进行图像识别即可;另一种是图片的一部分填充视频画面,例如拍照的区域大于视频的区域时,拍摄的照片中还包含其他内容,此时需要针对视频画面进行图像识别,把非视频画面的内容丢弃掉。In this embodiment, the picture containing the video picture has two forms, one is that the entire picture is filled with the video picture, and the picture is a video picture, for example, a video screen capture picture obtained by taking a screenshot of the video picture, and only the entire picture is needed at this time. Image recognition can be used; the other is that part of the picture fills the video picture. For example, when the area of the picture is larger than the area of the video, the captured picture also contains other content. In this case, image recognition is required for the video picture, and the non-video picture is The content is discarded.
本实施例中识别出的与视频画面相关的关键词可以包括:视频画面内的文字、视频画面内的主体内容及视频画面内主体内容所占视频画面的比例中的至少一种。本实施例中终端侧的图像识别流程与服务器侧的图像识别流程是一致的。The keywords related to the video screen identified in this embodiment may include at least one of a text in the video screen, a main content in the video screen, and a ratio of the video content in the video screen. In this embodiment, the image recognition process on the terminal side is consistent with the image recognition process on the server side.
步骤502:将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。 Step 502: Send the keyword to the server, so that the server searches for a key frame of the video according to the keyword.
本实施例的视频查找方法可以将关键词发送给服务器由服务器自动查找出对应的视频的关键帧,从而查找到视频,方便简单,提升了用户体验。The video search method of the embodiment can send the keyword to the server, and the server automatically finds the key frame of the corresponding video, thereby finding the video, which is convenient and simple, and improves the user experience.
考虑到服务器侧查找到视频关键帧后,还会将视频关键帧对应的视频信息发送给终端进行视频播放,因此,本实施例方法在上述步骤502之后还可以包括:接收服务器发送的视频信息;根据所述视频信息进行视频播放。After the server side finds the video key frame, the video information corresponding to the video key frame is also sent to the terminal for video playback. Therefore, the method in this embodiment may further include: receiving the video information sent by the server after the step 502; The video is played according to the video information.
如图6所示,本实施例提供了一种视频播放方法,包括如下步骤:As shown in FIG. 6, this embodiment provides a video playing method, including the following steps:
步骤601:获取包含视频画面的图片。Step 601: Acquire a picture containing a video picture.
步骤602:对所述图片进行图像识别获取与所述视频画面相关的图片关键词,并将所述图片关键词发送给服务器。Step 602: Perform image recognition on the picture to obtain a picture keyword related to the video picture, and send the picture keyword to the server.
步骤603:接收服务器发送的视频信息;Step 603: Receive video information sent by the server.
在获取图片关键词之后,终端将获取的图片关键词发送至服务器,服务器会根据图片关键词和存储的视频与视频关键帧的关键词之间的对应关系查找出对应的视频,然后服务器将查找出的视频的信息发送给终端。After obtaining the picture keyword, the terminal sends the acquired picture keyword to the server, and the server searches for the corresponding video according to the correspondence between the picture keyword and the stored video and the keyword of the video key frame, and then the server searches for the corresponding video. The information of the outgoing video is sent to the terminal.
本实施例中查找出的视频可能是一个视频,也可能是多个视频例如与图片关键词关联性最强的一组视频。因此,本实施例中终端接收到的视频信息可以为一个视频信息,或者多个视频信息(例如一组视频信息)。The video found in this embodiment may be a video, or may be a group of videos, for example, the one with the strongest association with the picture keyword. Therefore, the video information received by the terminal in this embodiment may be one video information or multiple video information (for example, a group of video information).
本实施例中视频的信息可以包括视频的内容信息或者视频的标识信息(例如URI)。The information of the video in this embodiment may include content information of the video or identification information (for example, a URI) of the video.
步骤604:根据所述视频信息进行视频播放。Step 604: Perform video playback according to the video information.
当接收到视频的信息为视频的内容信息时,终端直接播放视频的内容信息;When the information of the received video is the content information of the video, the terminal directly plays the content information of the video;
当接收到视频的信息为视频资源的位置信息(例如URI)时,终端根据位置信息获取对应的视频内容,然后播放获取的视频内容。When the information of the received video is the location information (for example, a URI) of the video resource, the terminal acquires the corresponding video content according to the location information, and then plays the obtained video content.
在终端接收到一组视频信息时,用户还需要选择所需的视频信息进行播放。When the terminal receives a set of video information, the user also needs to select the desired video information for playback.
本实施例视频播放方法可以使用户方便快捷地搜索到所需的视频并进行播放。 The video playing method of this embodiment can enable the user to search for the desired video conveniently and quickly and play it.
在服务器还需要发送时间信息的情况下,本实施例的播放方法,在步骤602之后,还可以包括:接收服务器发送的时间信息;此时,步骤604包括:根据所述时间信息和所述视频信息进行视频播放。In the case that the server also needs to send the time information, the playing method of the embodiment may further include: receiving the time information sent by the server after the step 602; at this time, the step 604 includes: according to the time information and the video. Information for video playback.
由于本实施例方法中终端还可以接收到时间信息,终端可以知道之前用户获取图片的时间(即用户中断观看视频的时间),在播放视频时可以从该时间开始播放,不需要从头播放,提升了用户体验。In the method of the embodiment, the terminal can also receive the time information, and the terminal can know the time when the user acquired the picture (that is, the time when the user interrupts watching the video), and can start playing from the time when playing the video, and does not need to play from the beginning, and improve. The user experience.
上述介绍的是终端直接播放视频的情况,下面介绍由其他播放设备播放视频的情况,如图7所示,本实施例还提供了另一种视频播放方法,包括如下步骤:The above describes the case where the terminal directly plays the video. The following describes the case where the video is played by the other playback device. As shown in FIG. 7, the embodiment further provides another video playing method, including the following steps:
步骤701:获取包含视频画面的图片。Step 701: Acquire a picture containing a video picture.
例如,对正在播放视频的电视屏幕进行拍摄获取包含视频画面的图片。For example, taking a picture of a television screen that is playing a video to obtain a picture containing a video picture.
步骤702:对所述图片进行图像识别获取与所述视频画面相关的图片关键词,并将所述图片关键词发送给服务器。Step 702: Perform image recognition on the picture to obtain a picture keyword related to the video picture, and send the picture keyword to the server.
可选地,终端可以通过特定的图像识别应用来对图片进行图像识别获取与视频画面相关的图片关键词,该应用扫描包含视频画面的照片获取关键词。Optionally, the terminal may perform image recognition on the image by using a specific image recognition application to obtain a picture keyword related to the video picture, and the application scans the photo acquisition keyword including the video picture.
在本实施例中包含视频画面的图片有两种形式,一种是整个图片全部填充视频画面,图片即为视频画面,例如对视频画面截屏获取的视频截屏照片,此时只需对整个照片进行图像识别即可;另一种是图片的一部分填充视频画面,例如拍照的区域大于视频的区域时,拍摄的照片中还包含其他内容,此时需要针对视频画面进行图像识别,把非视频画面的内容丢弃掉,例如在对电视屏幕拍摄获取的图片识别时,只针对电视屏幕内容进行识别,把不属于电视屏幕内容的界面部分进行丢弃。In this embodiment, the picture containing the video picture has two forms, one is that the entire picture is filled with the video picture, and the picture is a video picture, for example, a video screen capture picture obtained by taking a screenshot of the video picture, and only the entire picture is needed at this time. Image recognition can be used; the other is that part of the picture fills the video picture. For example, when the area of the picture is larger than the area of the video, the captured picture also contains other content. In this case, image recognition is required for the video picture, and the non-video picture is The content is discarded, for example, when the picture acquired by the television screen is recognized, only the content of the television screen is recognized, and the interface portion not belonging to the content of the television screen is discarded.
步骤703:接收服务器发送的视频信息。Step 703: Receive video information sent by the server.
描述可参考上述步骤603的描述。For a description, reference may be made to the description of step 603 above.
步骤704:将所述视频信息发送给播放设备,以供所述播放设备根据所述视频信息进行视频播放。Step 704: Send the video information to a playback device, where the playback device performs video playback according to the video information.
本实施例中终端部直接播放视频,而是将服务器发送的视频的信息转换给播放设备(例如电视或者机顶盒)进行播放。 In this embodiment, the terminal directly plays the video, but converts the information of the video sent by the server to a playback device (such as a television or a set top box) for playing.
可选地,当接收到视频的信息为视频的内容信息时,终端将视频的内容信息发送给播放设备,播放设备接收到视频的内容信息后直接播放视频;Optionally, when the information of the received video is the content information of the video, the terminal sends the content information of the video to the playing device, and the playing device directly plays the video after receiving the content information of the video;
当接收到视频信息为视频资源位置信息时,终端将视频资源位置信息发送给播放设备,播放设备根据接收到的视频资源位置信息获取对应的视频内容进行播放。When receiving the video information as the video resource location information, the terminal sends the video resource location information to the playback device, and the playback device acquires the corresponding video content according to the received video resource location information for playing.
在服务器还发送时间信息给终端的情况下,在图7所示的方法中,在步骤702之后,还包括:接收服务器发送的时间信息;将所述时间信息发送给播放设备,以供所述播放设备根据时间信息和所述视频信息进行视频播放。In the case that the server further sends the time information to the terminal, in the method shown in FIG. 7, after step 702, the method further includes: receiving time information sent by the server; and transmitting the time information to the playback device, where The playback device performs video playback according to the time information and the video information.
实施例三:Embodiment 3:
根据实施例一和实施例二的描述,本实施例介绍实施例一和实施例二所述方法的应用:According to the descriptions of the first embodiment and the second embodiment, the application of the method described in the first embodiment and the second embodiment is introduced:
首先服务器建立视频的关键帧索引、关键帧的关键词索引以及时间点的关键帧索引,流程如下:First, the server establishes the key frame index of the video, the keyword index of the key frame, and the key frame index of the time point. The flow is as follows:
1、对所有视频进行处理获取各视频的关键帧,建立视频的关键帧索引。1. Process all the videos to obtain key frames of each video, and establish a key frame index of the video.
关键帧是独立完整的一帧画面,对于一组GOP而言,后面的视频帧都依赖于关键帧。A key frame is a separate and complete frame. For a group of GOPs, the subsequent video frames are dependent on the key frame.
2、获取关键帧在视频中的时间点信息,针对所有关键帧进行图像识别获取各关键帧的关键词信息并保存。2. Obtain the time point information of the key frame in the video, perform image recognition for all key frames, and obtain keyword information of each key frame and save it.
本实施例中图像识别算法和知识库内容,决定了关键词的内容,也决定了搜索视频和定位的准确度。In this embodiment, the image recognition algorithm and the knowledge base content determine the content of the keyword, and also determine the accuracy of the search video and the location.
目前很多的应用,能够比较准确识别图片中的文字,主体内容,以及主体内容所占图片的比例,一组关键词信息,可以用来标识一幅图片。这组关键词也即本文中对应的关键词At present, many applications can accurately identify the text in the picture, the main content, and the proportion of the picture occupied by the main content, and a set of keyword information can be used to identify a picture. This set of keywords is also the corresponding keyword in this article.
3、建立关键帧的关键词索引、时间点的关键帧索引。3. Establish key index of key frame and key frame index of time point.
下面以终端直接播放视频为例来介绍视频搜索和播放的流程:The following is a video playback and playback process by taking a video directly from the terminal as an example:
在终端通过摄像头拍照,或者其他方式,获取一张视频截屏的图片之后, 如图8所示,包括如下步骤:After the terminal takes a picture through the camera, or other way, after obtaining a picture of the video screenshot, As shown in Figure 8, the following steps are included:
步骤801、终端描截屏图片,获取截屏图片关键词信息。Step 801: The terminal traces the screen image to obtain the screen image keyword information.
步骤802:终端将关键词信息发送给服务器。Step 802: The terminal sends the keyword information to the server.
步骤803:服务器接收终端发送的关键词信息,根据该关键词信息搜索关键帧的关键词索引匹配对应的关键帧。Step 803: The server receives the keyword information sent by the terminal, and searches for a key frame of the key frame according to the keyword information to match the corresponding key frame.
由于截屏时不一定正好处于关键帧的位置,所以可能截屏与关键帧存在不是完全匹配,需要匹配一个或者多个最相近的视频帧。Since the screen capture is not necessarily at the position of the key frame, the screen capture may not exactly match the key frame existence, and one or more closest video frames need to be matched.
步骤804:服务器根据匹配出的关键帧搜索时间点的关键帧索引和视频的关键帧的所有匹配对应的视频和时间点。Step 804: The server searches for the matching video and time point of the key frame index of the time point and all the key frames of the video according to the matched key frame.
本步骤匹配结果可以为一个视频,此时服务器发送一个视频或者标识信息给终端The matching result in this step can be a video, and the server sends a video or identification information to the terminal.
本步骤匹配结果可以为一组视频,则发送的是一组视频或者标识信息给终端。The matching result in this step can be a group of videos, and a set of video or identification information is sent to the terminal.
步骤805:服务器将匹配出来的视频对应的视频信息以及时间点发送给终端。Step 805: The server sends the video information corresponding to the matched video and the time point to the terminal.
该视频信息可以包括:匹配出的视频对应的标识信息或者匹配出的视频的视频内容。The video information may include: identification information corresponding to the matched video or video content of the matched video.
步骤806:终端接收服务器发送的时间点和视频,或者时间点和标识信息;然后根据接收到的信息播放对应的视频。Step 806: The terminal receives the time point and the video sent by the server, or the time point and the identification information; and then plays the corresponding video according to the received information.
下面以其他播放设备(电视)播放视频为例来介绍视频搜索和播放的流程:Let's take a video playback of other playback devices (TV) as an example to introduce the video search and playback process:
在手机已经针对电视播放的视频进行拍照获取包含视频内容的照片的前提下,如图9所述,视频搜索和播放的过程,包括如下步骤:Under the premise that the mobile phone has taken a photo for the video played by the television to obtain the photo containing the video content, as shown in FIG. 9, the process of video search and playback includes the following steps:
步骤901:手机启动特定识别应用扫描照片,获取与视频内容相关的关键词信息。Step 901: The mobile phone starts a specific identification application to scan a photo, and obtains keyword information related to the video content.
由于拍照的区域可能会大于视频的区域,这个需要针对应电视屏幕内容进行识别,把不属于电视屏幕内容的界面部分进行丢弃。 Since the area of the photograph may be larger than the area of the video, this needs to be identified for the content of the television screen, and the portion of the interface that does not belong to the content of the television screen is discarded.
在本实施例中识别的关键词信息可以是照片的主题信息,以及各种颜色的百分比。The keyword information identified in this embodiment may be subject information of a photo, and a percentage of various colors.
步骤902:特定识别应用将关键词信息通过网络发给视频所在的服务器。Step 902: The specific identification application sends the keyword information to the server where the video is located through the network.
步骤903:服务器在收到关键词信息后,根据该关键词信息搜索关键帧的关键词索引匹配对应的关键帧。Step 903: After receiving the keyword information, the server searches for a key frame of the key frame according to the keyword information to match the corresponding key frame.
匹配出来的结果可以是一个视频中的某一个关键帧,也可以是关联性最强的对应一组视频中的关键帧.The result of the matching may be a key frame in a video, or a key frame in a corresponding group of videos with the strongest correlation.
步骤904:服务器根据匹配出的关键帧搜索时间点的关键帧索引和视频的关键帧的所有匹配对应的视频和时间点信息。Step 904: The server searches for the video and time point information corresponding to all the matching of the key frame index of the time point and the key frame of the video according to the matched key frame.
此时匹配结果可以为一个视频或者一组视频,一个时间点信息或者一组时间点信息。The matching result can be a video or a group of videos, a point in time information or a set of time point information.
步骤905:服务器将匹配出的视频对应的视频信息、和时间点信息发送给终端。Step 905: The server sends the video information corresponding to the matched video and the time point information to the terminal.
该视频信息可以包括:匹配出的视频对应的标识信息或者匹配出的视频的视频内容。The video information may include: identification information corresponding to the matched video or video content of the matched video.
由于接收到的可能是一组视频信息,这个情况下需要手机应用或者手机用户进行筛选。例如用户从一组视频标识信息中筛选出所需视频的视频标识。Since the received information may be a set of video information, in this case, the mobile application or the mobile phone user is required to perform screening. For example, the user filters out the video identifier of the desired video from a set of video identification information.
步骤906:手机将视频信息和时间点信息推送给电视或者机顶盒。Step 906: The mobile phone pushes the video information and the time point information to the television or the set top box.
推送方式可以是AirPlay方式,或者DLNA等方式。The push mode can be AirPlay mode, or DLNA.
步骤907:电视或者机顶盒根据视频信息和时间点信息启动对应的节目播放。Step 907: The television or the set top box starts the corresponding program play according to the video information and the time point information.
实施例四:Embodiment 4:
如图10所示,本实施例提供了一种视频查找装置,应用于服务器,包括:第一获取模块和第一查找模块;As shown in FIG. 10, the embodiment provides a video search device, which is applied to a server, and includes: a first acquiring module and a first searching module;
所述第一获取模块,用于获取视频的关键帧与关键词之间的第一对应关系; The first acquiring module is configured to acquire a first correspondence between a key frame of the video and the keyword;
所述第一查找模块,用于接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧。The first search module is configured to receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
如图11所示,本实施例的视频查找装置,还包括:第一发送模块;As shown in FIG. 11, the video search apparatus of this embodiment further includes: a first sending module;
所述第一发送模块,用于将与查找到的关键帧对应的视频信息发送给所述终端。The first sending module is configured to send video information corresponding to the found key frame to the terminal.
可选地,所述第一获取模块还设置为:所述第一获取模块获取视频信息与视频的关键帧之间的第二对应关系;Optionally, the first obtaining module is further configured to: acquire, by the first acquiring module, a second correspondence between video information and a key frame of the video;
所述第一查找模块还设置为,根据查找到的关键帧和所述第二对应关系,查找所述关键帧对应的视频信息。The first search module is further configured to search for video information corresponding to the key frame according to the found key frame and the second correspondence.
可选地,接收到的关键词包括:图片关键词,所述图片关键词为对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。Optionally, the received keyword includes: a picture keyword, where the picture keyword is a keyword related to the video picture obtained by performing image recognition on a picture including a video picture.
可选地,所述第一获取模块包括:Optionally, the first obtaining module includes:
关键帧获取单元,设置为获取视频的关键帧;a key frame acquisition unit configured to acquire a key frame of the video;
关键词获取单元,设置为对所述视频的关键帧进行所述图像识别获取所述关键帧的关键词;a keyword acquiring unit, configured to perform the image recognition on a key frame of the video to acquire a keyword of the key frame;
第一对应关系建立单元,设置为建立所述视频的关键帧与关键词之间的第一对应关系。The first correspondence establishing unit is configured to establish a first correspondence between the key frame of the video and the keyword.
可选地,所述关键帧的关键词还包括:关键帧内的文字、关键帧内的主体内容及关键帧内主体内容所占关键帧的比例中的至少一种。Optionally, the keyword of the key frame further includes: at least one of a text in the key frame, a body content in the key frame, and a ratio of a key frame occupied by the body content in the key frame.
可选地,所述第一获取模块还包括:Optionally, the first obtaining module further includes:
时间信息获取单元,设置为获取所述视频的关键帧在所述视频中的时间信息;a time information obtaining unit, configured to acquire time information of a key frame of the video in the video;
第三对应关系建立单元,设置为建立所述视频的关键帧与时间信息之间的第三对应关系;a third correspondence establishing unit is configured to establish a third correspondence between the key frame and the time information of the video;
所述第一发送模块,还设置为根据查找到的关键帧和所述第三对应关系查找对应的时间信息;将查找到的时间信息发送给所述终端。The first sending module is further configured to search for corresponding time information according to the found key frame and the third correspondence, and send the found time information to the terminal.
可选地,所述视频信息包括:视频内容信息或者视频资源位置信息。 Optionally, the video information includes: video content information or video resource location information.
如图12所示,本实施例还提供了一种视频查找装置,应用于终端,包括:第二获取模块和第二发送模块;As shown in FIG. 12, the embodiment further provides a video search device, which is applied to a terminal, and includes: a second acquiring module and a second sending module;
所述第二获取模块,用于获取关键词;The second obtaining module is configured to acquire a keyword;
所述第二发送模块,用于将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。The second sending module is configured to send the keyword to the server, so that the server searches for a key frame of the video according to the keyword.
所述第二获取模块获取关键词包括:获取包含视频画面的图片;The acquiring, by the second acquiring module, the keyword includes: acquiring a picture that includes a video image;
对所述图片进行图像识别获取与所述图片对应的关键词。Performing image recognition on the picture to obtain a keyword corresponding to the picture.
可选地,所述视频查找装置还包括:第二接收模块,设置为:接收服务器发送的视频信息。Optionally, the video search device further includes: a second receiving module, configured to: receive video information sent by the server.
本发明实施例还提供一种视频播放装置,该视频播放装置包括本发明实施例提供的任意视频查找装置,该视频播放装置还包括:播放模块,设置为根据所述视频信息进行视频播放。The embodiment of the present invention further provides a video playing device, where the video playing device includes any video searching device provided by the embodiment of the present invention. The video playing device further includes: a playing module, configured to perform video playing according to the video information.
可选地,所述第二发送模块还设置为:接收服务器发送的时间信息;Optionally, the second sending module is further configured to: receive time information sent by the server;
所述播放模块根据所述视频信息进行视频播放的步骤包括:The step of the playing module performing video playback according to the video information includes:
根据所述视频信息和时间信息进行视频播放。The video is played according to the video information and the time information.
所述视频查找装置还包括:第三发送模块,设置为:The video search device further includes: a third sending module, configured to:
在接收服务器发送的视频信息之后,将所述视频信息发送给播放设备,以供所述播放设备根据所述视频信息进行视频播放。After receiving the video information sent by the server, the video information is sent to the playback device, so that the playback device performs video playback according to the video information.
可选地,所述第二接收模块还设置为:接收服务器发送的时间信息;Optionally, the second receiving module is further configured to: receive time information sent by the server;
所述第三发送模块还设置为:将所述时间信息发送给播放设备,以供所述播放设备根据时间信息和所述视频信息进行视频播放。The third sending module is further configured to: send the time information to the playing device, so that the playing device performs video playing according to the time information and the video information.
应用本实施例的视频查找装置,用户终端只需获取包含视频画面的图片(例如对视频画面进行拍摄或者截屏等),然后将对该图片进行图像识别获取与视频画面相关的关键词发送给服务器,由服务器根据终端发送的关键词和存储的对应关系自动查找出对应的视频并反馈查找结果给终端;可见本发明实施例的视频查找方法是基于图像识别技术,对于用户来说,其只需获取包含视频画面的图片就可以快速获取对应的视频信息,操作简单快捷,另外, 应用本实施例的装置,无需用户记忆搜索视频的关键词信息,降低了视频搜索的难度,提升了用户体验。Applying the video search device of the embodiment, the user terminal only needs to acquire a picture including a video picture (for example, capturing a video picture or taking a screen shot, etc.), and then performing image recognition on the picture to obtain a keyword related to the video picture and sending the keyword to the server. The server automatically finds the corresponding video according to the keyword sent by the terminal and the stored correspondence relationship, and feeds back the search result to the terminal. It can be seen that the video search method in the embodiment of the present invention is based on the image recognition technology, and for the user, Obtaining the image containing the video screen can quickly obtain the corresponding video information, and the operation is simple and fast, in addition, By applying the device of the embodiment, the user does not need to memorize the keyword information of the search video, which reduces the difficulty of the video search and improves the user experience.
本发明实施例还提供一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于执行上述方法。The embodiment of the invention further provides a computer readable storage medium storing computer executable instructions for performing the above method.
本领域普通技术人员可以理解上述实施例的全部或部分步骤可以使用计算机程序流程来实现,所述计算机程序可以存储于一计算机可读存储介质中,所述计算机程序在相应的硬件平台上(如系统、设备、装置、器件等)执行,在执行时,包括方法实施例的步骤之一或其组合。One of ordinary skill in the art will appreciate that all or a portion of the steps of the above-described embodiments can be implemented using a computer program flow, which can be stored in a computer readable storage medium, such as on a corresponding hardware platform (eg, The system, device, device, device, etc. are executed, and when executed, include one or a combination of the steps of the method embodiments.
可选地,上述实施例的全部或部分步骤也可以使用集成电路来实现,这些步骤可以被分别制作成一个个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。Alternatively, all or part of the steps of the above embodiments may also be implemented by using an integrated circuit. These steps may be separately fabricated into individual integrated circuit modules, or multiple modules or steps may be fabricated into a single integrated circuit module. achieve.
上述实施例中的装置/功能模块/功能单元可以采用通用的计算装置来实现,它们可以集中在单个的计算装置上,也可以分布在多个计算装置所组成的网络上。The devices/function modules/functional units in the above embodiments may be implemented by a general-purpose computing device, which may be centralized on a single computing device or distributed over a network of multiple computing devices.
上述实施例中的装置/功能模块/功能单元以软件功能模块的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。上述提到的计算机可读取存储介质可以是只读存储器,磁盘或光盘等。When the device/function module/functional unit in the above embodiment is implemented in the form of a software function module and sold or used as a stand-alone product, it can be stored in a computer readable storage medium. The above mentioned computer readable storage medium may be a read only memory, a magnetic disk or an optical disk or the like.
工业实用性Industrial applicability
通过本发明实施例的方案,基于图像识别技术,对于用户来说,其只需获取包含视频画面的图片就可以快速获取对应的视频信息,操作简单快捷,另外,应用本发明的方法,无需用户记忆搜索视频的关键词信息,降低了视频搜索的难度,提升了用户体验。 According to the solution of the embodiment of the present invention, based on the image recognition technology, the user can quickly obtain the corresponding video information only by acquiring the picture containing the video picture, and the operation is simple and fast. In addition, the method of the present invention is applied without the user. Memorizing the keyword information of the search video reduces the difficulty of the video search and improves the user experience.

Claims (18)

  1. 一种视频查找方法,应用于服务器,包括如下步骤:A video search method, applied to a server, includes the following steps:
    获取视频的关键帧与关键词之间的第一对应关系;Obtaining a first correspondence between a key frame of the video and the keyword;
    接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系,查找对应的视频的关键帧。The keyword sent by the terminal is received, and the key frame of the corresponding video is searched according to the keyword sent by the terminal and the first correspondence.
  2. 如权利要求1所述的视频查找方法,其中,在查找到关键帧之后,所述方法还包括:The video search method of claim 1, wherein after the key frame is found, the method further comprises:
    将与查找到的关键帧对应的视频信息发送给所述终端。Sending video information corresponding to the found key frame to the terminal.
  3. 如权利要求2所述的视频查找方法,其中,在接收终端发送的关键词之前,所述方法还包括:获取视频信息与视频的关键帧之间的第二对应关系;The video search method of claim 2, wherein before the receiving the keyword sent by the terminal, the method further comprises: acquiring a second correspondence between the video information and the key frame of the video;
    在所述根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧之后,所述将与查找到的关键帧对应的视频信息发送给所述终端之前,所述方法还包括:After the video information corresponding to the found key frame is sent to the terminal, the method further includes: before the sending, by the terminal, the key information sent by the terminal and the first corresponding relationship, the video information corresponding to the searched key frame, to the terminal, the method further includes: :
    根据查找到的关键帧和所述第二对应关系,查找所述关键帧对应的视频信息。And searching for the video information corresponding to the key frame according to the found key frame and the second correspondence.
  4. 如权利要求1-3任一项所述的视频查找方法,其中,接收到的关键词包括:图片关键词,所述图片关键词为对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。The video search method according to any one of claims 1 to 3, wherein the received keyword includes: a picture keyword, and the picture keyword is an image recognition acquisition of the picture including the video picture and the video Screen related keywords.
  5. 如权利要求4所述的视频查找方法,其中,所述获取视频的关键帧与关键词之间的第一对应关系包括:The video search method of claim 4, wherein the first correspondence between the key frame of the acquired video and the keyword comprises:
    获取视频的关键帧;Get the keyframe of the video;
    对所述视频的关键帧进行所述图像识别获取所述关键帧的关键词;Performing the image recognition on a key frame of the video to acquire a keyword of the key frame;
    建立所述视频的关键帧与关键词之间的第一对应关系。Establishing a first correspondence between a key frame of the video and a keyword.
  6. 如权利要求5所述的视频查找方法,其中,所述关键帧的关键词还包括:关键帧内的文字、关键帧内的主体内容及关键帧内主体内容所占关键帧的比例中的至少一种。The video search method according to claim 5, wherein the keyword of the key frame further comprises: at least a text in a key frame, a body content in a key frame, and a ratio of a key frame occupied by the body content in the key frame. One.
  7. 如权利要求5所述的视频查找方法,其中,在接收终端发送的图片关 键词之前,所述方法还包括:The video search method according to claim 5, wherein the picture transmitted at the receiving terminal is off Before the key words, the method further includes:
    获取所述视频的关键帧在所述视频中的时间信息;Obtaining time information of a key frame of the video in the video;
    建立所述视频的关键帧与时间信息之间的第三对应关系;Establishing a third correspondence between key frames and time information of the video;
    在所述查找对应的视频的关键帧之后,所述方法还包括:After the searching for the key frame of the corresponding video, the method further includes:
    根据查找到的关键帧和所述第三对应关系查找对应的时间信息;Finding corresponding time information according to the found key frame and the third correspondence relationship;
    将查找到的时间信息发送给所述终端。The found time information is sent to the terminal.
  8. 如权利要求2所述的视频查找方法,其中,所述视频信息包括:视频内容信息或者视频资源位置信息。The video search method according to claim 2, wherein the video information comprises: video content information or video resource location information.
  9. 一种视频查找方法,应用于终端,包括如下步骤:A video search method is applied to a terminal, including the following steps:
    获取关键词;Get keywords;
    将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。The keyword is sent to a server for the server to find a key frame of the video according to the keyword.
  10. 如权利要求9所述的视频查找方法,其中,所述获取关键词的步骤包括:The video search method according to claim 9, wherein the step of acquiring a keyword comprises:
    获取包含视频画面的图片;Get a picture containing the video screen;
    对所述图片进行图像识别获取与所述图片对应的关键词。Performing image recognition on the picture to obtain a keyword corresponding to the picture.
  11. 如权利要求9或10所述的视频查找方法,其中,在将关键词发送给服务器之后,所述方法还包括:The video search method according to claim 9 or 10, wherein after the keyword is transmitted to the server, the method further comprises:
    接收服务器发送的视频信息;Receiving video information sent by the server;
    根据所述视频信息进行视频播放。The video is played according to the video information.
  12. 如权利要求11所述的视频查找方法,其中,在将关键词发送给服务器之后,所述方法还包括:接收服务器发送的时间信息;The video search method according to claim 11, wherein after the keyword is transmitted to the server, the method further comprises: receiving time information sent by the server;
    所述根据所述视频信息进行视频播放的步骤包括:The step of performing video playback according to the video information includes:
    根据所述视频信息和时间信息进行视频播放。The video is played according to the video information and the time information.
  13. 如权利要求9或10所述的视频查找方法,其中,在将关键词发送给服务器之后,所述方法还包括: The video search method according to claim 9 or 10, wherein after the keyword is transmitted to the server, the method further comprises:
    接收服务器发送的视频信息;Receiving video information sent by the server;
    将所述视频信息发送给播放设备,以供所述播放设备根据所述视频信息进行视频播放。And transmitting the video information to the playback device, where the playback device performs video playback according to the video information.
  14. 如权利要求13所述的视频查找方法,其中,在将关键词发送给服务器之后,所述方法还包括:The video search method of claim 13, wherein after the keyword is sent to the server, the method further comprises:
    接收服务器发送的时间信息;Receiving time information sent by the server;
    将所述时间信息发送给播放设备,以供所述播放设备根据时间信息和所述视频信息进行视频播放。And transmitting the time information to the playing device, so that the playing device performs video playing according to the time information and the video information.
  15. 一种视频查找装置,应用于服务器,包括:第一获取模块和第一查找模块;A video search device is applied to a server, including: a first acquiring module and a first searching module;
    所述第一获取模块,设置成获取视频的关键帧与关键词之间的第一对应关系;The first acquiring module is configured to acquire a first correspondence between a key frame of the video and the keyword;
    所述第一查找模块,设置成接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系,查找对应的视频的关键帧。The first search module is configured to receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
  16. 如权利要求15所述的视频查找装置,还包括:第一发送模块;The video search device of claim 15, further comprising: a first transmitting module;
    所述第一发送模块,设置成将与查找到的关键帧对应的视频信息发送给所述终端。The first sending module is configured to send video information corresponding to the found key frame to the terminal.
  17. 一种视频查找装置,应用于终端,包括:第二获取模块和第二发送模块;A video search device is applied to the terminal, including: a second acquiring module and a second sending module;
    所述第二获取模块,设置成获取关键词;The second obtaining module is configured to acquire a keyword;
    所述第二发送模块,设置成将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。The second sending module is configured to send the keyword to the server, so that the server searches for a key frame of the video according to the keyword.
  18. 如权利要求17所述的视频查找装置,其中,所述第二获取模块获取关键词包括:获取包含视频画面的图片;The video search device of claim 17, wherein the acquiring, by the second obtaining module, the keyword comprises: acquiring a picture including a video image;
    对所述图片进行图像识别获取与所述图片对应的关键词。 Performing image recognition on the picture to obtain a keyword corresponding to the picture.
PCT/CN2016/080770 2015-05-29 2016-04-29 Video search method and apparatus WO2016192501A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510287451.2A CN106294454A (en) 2015-05-29 2015-05-29 Video retrieval method and device
CN201510287451.2 2015-05-29

Publications (1)

Publication Number Publication Date
WO2016192501A1 true WO2016192501A1 (en) 2016-12-08

Family

ID=57440260

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/080770 WO2016192501A1 (en) 2015-05-29 2016-04-29 Video search method and apparatus

Country Status (2)

Country Link
CN (1) CN106294454A (en)
WO (1) WO2016192501A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107025275B (en) * 2017-03-21 2019-11-15 腾讯科技(深圳)有限公司 Video searching method and device
CN107862003A (en) * 2017-10-24 2018-03-30 珠海市魅族科技有限公司 Video contents search method, apparatus, terminal and readable storage medium storing program for executing
CN107992627A (en) * 2017-12-25 2018-05-04 浙江宇视科技有限公司 Demand video real-time searching method and apparatus
CN110019933A (en) * 2018-01-02 2019-07-16 阿里巴巴集团控股有限公司 Video data handling procedure, device, electronic equipment and storage medium
CN108259974A (en) * 2018-03-07 2018-07-06 优酷网络技术(北京)有限公司 Video matching method and device
CN109146789A (en) * 2018-08-23 2019-01-04 北京优酷科技有限公司 Picture splicing method and device
CN111666453B (en) * 2019-03-07 2024-01-02 杭州海康威视数字技术股份有限公司 Video management and retrieval method and device, electronic equipment and storage medium
CN112019789B (en) * 2019-05-31 2022-05-31 杭州海康威视数字技术股份有限公司 Video playback method and device
CN110415569B (en) * 2019-06-29 2021-08-03 嘉兴梦兰电子科技有限公司 Campus classroom sharing education method and system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1430159A (en) * 2001-12-29 2003-07-16 Lg电子株式会社 Multimedia data searching and browsing system
CN101620629A (en) * 2009-06-09 2010-01-06 中兴通讯股份有限公司 Method and device for extracting video index and video downloading system
CN101917329A (en) * 2009-12-17 2010-12-15 新奥特(北京)视频技术有限公司 Network player and server for providing search service
CN103761345A (en) * 2014-02-27 2014-04-30 苏州千视通信科技有限公司 Video retrieval method based on OCR character recognition technology
US20140193048A1 (en) * 2011-09-27 2014-07-10 Tong Zhang Retrieving Visual Media
CN104639993A (en) * 2013-11-06 2015-05-20 株式会社Ntt都科摩 Video program recommending method and server thereof

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102387310A (en) * 2010-08-31 2012-03-21 腾讯科技(深圳)有限公司 Method and device for positioning video segments
CN102207966B (en) * 2011-06-01 2013-07-10 华南理工大学 Video content quick retrieving method based on object tag
CN102595191A (en) * 2012-02-24 2012-07-18 央视国际网络有限公司 Method and device for searching sport events in sport event videos
CN103593363B (en) * 2012-08-15 2016-12-21 中国科学院声学研究所 The method for building up of video content index structure, video retrieval method and device
TW201421994A (en) * 2012-11-21 2014-06-01 Hon Hai Prec Ind Co Ltd Video searching system and method
CN103559196B (en) * 2013-09-23 2017-02-22 浙江大学 Video retrieval method based on multi-core canonical correlation analysis
CN103942337B (en) * 2014-05-08 2017-08-18 北京航空航天大学 It is a kind of based on image recognition and the video searching system that matches
CN104036018A (en) * 2014-06-25 2014-09-10 百度在线网络技术(北京)有限公司 Video acquiring method and video acquiring device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1430159A (en) * 2001-12-29 2003-07-16 Lg电子株式会社 Multimedia data searching and browsing system
CN101620629A (en) * 2009-06-09 2010-01-06 中兴通讯股份有限公司 Method and device for extracting video index and video downloading system
CN101917329A (en) * 2009-12-17 2010-12-15 新奥特(北京)视频技术有限公司 Network player and server for providing search service
US20140193048A1 (en) * 2011-09-27 2014-07-10 Tong Zhang Retrieving Visual Media
CN104639993A (en) * 2013-11-06 2015-05-20 株式会社Ntt都科摩 Video program recommending method and server thereof
CN103761345A (en) * 2014-02-27 2014-04-30 苏州千视通信科技有限公司 Video retrieval method based on OCR character recognition technology

Also Published As

Publication number Publication date
CN106294454A (en) 2017-01-04

Similar Documents

Publication Publication Date Title
WO2016192501A1 (en) Video search method and apparatus
KR101680714B1 (en) Method for providing real-time video and device thereof as well as server, terminal device, program, and recording medium
US9578366B2 (en) Companion device services based on the generation and display of visual codes on a display device
RU2628108C2 (en) Method of providing selection of video material episode and device for this
WO2019134587A1 (en) Method and device for video data processing, electronic device, and storage medium
US11630862B2 (en) Multimedia focalization
CN103581705A (en) Method and system for recognizing video program
US20130133000A1 (en) Video Interaction System
CN202998337U (en) Video program identification system
CN110740290B (en) Monitoring video previewing method and device
US20160164970A1 (en) Application Synchronization Method, Application Server and Terminal
US20200117910A1 (en) Methods and apparatus for generating a video clip
US11727375B2 (en) Identifying and retrieving video metadata with perceptual frame hashing
CN104811745A (en) Video content displaying method and device
JP5449113B2 (en) Program recommendation device
CN111669641A (en) Media resource playing method, terminal and storage medium
US20150350634A1 (en) System for providing complex-dimensional content service using complex 2d-3d content file, method for providing said service, and complex-dimensional content file therefor
KR20200024541A (en) Providing Method of video contents searching and service device thereof
US20140003656A1 (en) System of a data transmission and electrical apparatus
CN111274449A (en) Video playing method and device, electronic equipment and storage medium
JP5343658B2 (en) Recording / playback apparatus and content search program
WO2014063528A1 (en) Content switching method and apparatus
TWI554090B (en) Method and system for multimedia summary generation
US8824854B2 (en) Method and arrangement for transferring multimedia data
CN112866762A (en) Processing method and device for acquiring video associated information, electronic equipment and server

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16802424

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16802424

Country of ref document: EP

Kind code of ref document: A1