WO2022068494A1 - 搜索目标内容的方法、装置、电子设备及存储介质 - Google Patents

搜索目标内容的方法、装置、电子设备及存储介质 Download PDF

Info

Publication number
WO2022068494A1
WO2022068494A1 PCT/CN2021/115261 CN2021115261W WO2022068494A1 WO 2022068494 A1 WO2022068494 A1 WO 2022068494A1 CN 2021115261 W CN2021115261 W CN 2021115261W WO 2022068494 A1 WO2022068494 A1 WO 2022068494A1
Authority
WO
WIPO (PCT)
Prior art keywords
content
target
search
processed
search content
Prior art date
Application number
PCT/CN2021/115261
Other languages
English (en)
French (fr)
Inventor
杨晶生
陈可蓉
钱程
熊梦园
郑翔
Original Assignee
北京字跳网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京字跳网络技术有限公司 filed Critical 北京字跳网络技术有限公司
Priority to JP2023507572A priority Critical patent/JP2023536330A/ja
Publication of WO2022068494A1 publication Critical patent/WO2022068494A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/483Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/489Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using time information

Definitions

  • the present disclosure relates to the field of computer technology, for example, to a method, apparatus, electronic device, and storage medium for searching for target content.
  • Most of the target content screened by the above method is the same content as the search condition, and the content related to the search condition cannot be searched.
  • the present disclosure provides a method, device, electronic device and storage medium for searching for target content, so as to optimize search conditions, and further improve the comprehensiveness of searched content when searching for corresponding content based on the optimized search conditions.
  • the present disclosure provides a method for searching for target content, the method comprising:
  • Target content matching each target search content is searched from the subtitle information.
  • the present disclosure also provides an apparatus for searching for target content, the apparatus comprising:
  • the search content acquisition module is set to acquire the pending search content in the search content editing control
  • a target search content determination module configured to determine at least one associated to-be-searched content corresponding to the to-be-processed search content, and to use both the at least one associated to-be-searched content and the to-be-processed search content as the target search content;
  • the target content matching module is configured to search the subtitle information for target content that matches each target search content.
  • the present disclosure also provides an electronic device, the electronic device comprising:
  • processors one or more processors
  • storage means arranged to store one or more programs
  • the one or more processors When the one or more programs are executed by the one or more processors, the one or more processors implement the above-mentioned method for searching for target content.
  • the present disclosure also provides a storage medium containing computer-executable instructions, which, when executed by a computer processor, are used to perform the above-described method of searching for target content.
  • FIG. 1 is a schematic flowchart of a method for searching for target content according to Embodiment 1 of the present disclosure
  • FIG. 2 is a schematic flowchart of a method for searching for target content according to Embodiment 2 of the present disclosure
  • FIG. 3 is a schematic diagram of corresponding display of target content and a marker on a time axis according to Embodiment 2 of the present disclosure
  • FIG. 4 is a schematic diagram of highlighting a corresponding mark on a time axis after triggering target content according to Embodiment 2 of the present disclosure
  • FIG. 5 is a schematic structural diagram of an apparatus for searching for target content according to Embodiment 3 of the present disclosure
  • FIG. 6 is a schematic structural diagram of an electronic device according to Embodiment 4 of the present disclosure.
  • method embodiments of the present disclosure may be performed in different orders and/or in parallel. Furthermore, method embodiments may include additional steps and/or omit performing the illustrated steps. The scope of the present disclosure is not limited in this regard.
  • the term “including” and variations thereof are open-ended inclusions, ie, "including but not limited to”.
  • the term “based on” is “based at least in part on.”
  • the term “one embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one additional embodiment”; the term “some embodiments” means “at least some embodiments”. Relevant definitions of other terms will be given in the description below.
  • FIG. 1 is a schematic flowchart of a method for searching for target content according to Embodiment 1 of the present disclosure.
  • the embodiment of the present disclosure is applicable to the situation where content matching the target search content is searched from subtitle information.
  • the content of the device is executed, and the device can be implemented in the form of software and/or hardware.
  • the method of this embodiment includes:
  • the search content editing control may be a control displayed on the target page for editing the search content.
  • Subtitle information may also be included on the target page.
  • the subtitle information may include the text to be searched, and the method for generating the subtitle information is not limited in this embodiment.
  • the server may obtain the search content edited in the search content editing control, and use the obtained search content as the pending search content.
  • the search content edited in the search content editing control is "find", and the pending search content obtained by the server The search content is: find.
  • the obtaining the pending search content in the search content editing control includes: if it is detected that the control that starts the search is triggered, obtaining the pending search content edited in the search content editing control; or, If it is detected that the search content editing control is triggered, the to-be-processed search content edited in the search content editing control is acquired.
  • acquiring the search content to be processed may be implemented in at least two ways as follows.
  • the target page can include search content editing controls and controls that initiate searches.
  • the control that initiates the search may be the control that "confirms" the search.
  • the user can edit the corresponding content in the search content editing control. After the content editing is completed, the user can trigger the control to start the search, that is, click the "OK" control, and the server can obtain the pending search content in the search content editing control.
  • S120 Determine at least one associated to-be-searched content corresponding to the to-be-processed search content, and use both the at least one associated to-be-searched content and the to-be-processed search content as target search content.
  • the associated to-be-searched content is determined based on the to-be-processed search content.
  • the associated to-be-searched content may be content obtained based on at least one conversion form corresponding to the to-be-processed search content, for example, the to-be-processed search content is: one, two, three, and the associated to-be-searched content may be based on at least one of "one, two, three"
  • the content obtained in the transformed form is optional, and the associated content to be searched can be "123", "one two three” and so on. Both at least one associated to-be-searched content and the to-be-processed search content may be used as target search content.
  • At least one associated to-be-searched content associated with the to-be-processed search content may be determined, and both the at least one associated to-be-searched content and the to-be-processed search content are used as target search content.
  • the target search content may include multiple pieces, and the content obtained from the subtitle information search that is the same as any target search content is the target content.
  • the to-be-processed search content is "123", and at least one related to-be-searched content in the target search content may be "one two three” and “one two three”. You can take each string of "123”, “one two three” and “one two three” as a whole, and filter out the content that is completely consistent with any of the above three from the subtitle information, that is, the subtitle information. All "123”, “One Two Three”, and “One Two Three” in the text are regarded as the target content.
  • the same content as the target search content is matched from the subtitle information, and the matched content is used as the target content.
  • the advantage of adopting this method is that related content associated with the search content to be processed can be searched from the subtitle information, which improves the comprehensiveness of the determined target content.
  • the related to-be-searched content associated with the to-be-processed search content is determined, that is, the search conditions are optimized, and the corresponding search conditions are searched from the subtitle information based on the optimized search conditions.
  • the comprehensiveness and accuracy of the determined target content are improved.
  • the target content can be displayed in the subtitle information differently.
  • the target content When the target content is determined from the subtitle information, the target content itself is also one or more elements in the subtitle information. When displayed, it can be displayed differently from other elements, so as to highlight the filtered target content and allow users to be more intuitive and easily discover targeted content. Differential display may be displayed in a display format such as color, font, and background pattern.
  • the generation of subtitle information may be: collecting voice information based on a multimedia data stream; The collected voice information is subjected to voice recognition to obtain subtitle information.
  • the generated voice information can be generated according to the voice information, the original language type corresponding to the voice information, and the target translation language type.
  • the subtitle information corresponding to the target translation language type is displayed on the target page.
  • the multimedia data stream may be video stream data corresponding to the real-time interactive interface, or video stream data in a screen-recording video obtained after screen-recording the real-time interactive interface.
  • the real-time interactive interface is any interactive interface in the real-time interactive application scenario. Real-time interactive application scenarios can be implemented through the Internet and computer means, for example, interactive applications implemented through native programs or World Wide Web (web) programs.
  • the language type used by each user can be the same or different.
  • the language type used by other speakers is quite different from the language used by this user, There may be cases where the user cannot know the speech information of other speaking users.
  • the voice information of the speaking user can be collected and converted into corresponding subtitle information.
  • the user can trigger the language type selection control on the target page and select the translation language type to translate the voice information of other speaking users into subtitle information corresponding to the selected translation language type.
  • the original language type refers to the language type used by the users participating in the real-time interaction when speaking.
  • the target translation language type is the language type set by the user on the target page for displaying subtitle information.
  • the subtitle information is translation data corresponding to the voice information.
  • the subtitle information may display the speaking user identity and the speaking time stamp of each piece of translation data.
  • the voice data that is, voice information, of multiple users participating in the interaction can be collected from the multimedia data stream corresponding to the interactive interface, and the original language type corresponding to the voice information can be recorded.
  • the voice information can be translated from the original language type to the target translation language type to obtain translation data corresponding to the voice information.
  • the translation data, the speaking user ID corresponding to the translation data, and the speaking time stamp are used as subtitle information displayed on the target page.
  • the method of determining the language type of the target translation may include at least one of the following: acquiring the language type preset on the target client as the target translation language type; acquiring the login address of the target client, based on the login address Determine the target language translation type corresponding to the geographic location where the target client is located.
  • the first way can be: when it is detected that the user triggers the language type selection control on the target page, that is, when it is detected that the user selects which language type the subtitle information is displayed in, the language type set by the user can be determined, and the set language type can be determined.
  • the language type is used as the target translation language type.
  • a language type selection list may pop up on the target page for the user to select. The user can select any language type. For example, if the user triggers the Chinese language type in the language type selection list and clicks the confirmation button, the server or client can determine that the target translation language type is the Chinese language type.
  • the voice information in the multimedia data stream can be converted into Chinese subtitle information and displayed on the target interface.
  • the second way may be: when it is detected that the user triggers the language type selection control, the login address of the user's client, that is, the client's Internet Protocol (IP) address, can be obtained, so as to determine the client's login address according to the login address. region, and then use the language type used in the region as the target translation language type. For example, when the user triggers the language type selection control, the login address of the user's client is obtained. If it is determined based on the login address that the region to which the client belongs is China, the target translation language type is the Chinese language type.
  • IP Internet Protocol
  • the subtitle information is more in line with the user's reading habits, so that the user can quickly understand the content corresponding to the multimedia data stream, thereby Improve the efficiency of interaction.
  • a time stamp synchronization association relationship between the subtitle information and the multimedia data stream can also be established, and the subtitle information and the multimedia data stream are displayed on the target page, so that the When it is detected that a target content is triggered, the multimedia data stream is jumped to the video playing time corresponding to the target content.
  • the time stamp synchronization association relationship can be understood as the linkage between the multimedia data stream and the subtitle information based on time synchronization.
  • the current time stamp corresponding to the piece of translation data can be determined, and based on the pre-established time stamp synchronization relationship, jump to the multimedia data stream corresponding to the current time stamp, for example , the multimedia data stream is obtained based on the screen recording video, then jumps to the audio and video frame corresponding to the current timestamp in the screen recording video.
  • the current timestamp corresponding to the audio and video frame can be obtained.
  • the pre-established timestamp synchronization relationship it can be determined that the audio and video frame is in the subtitle information.
  • the translation data can be displayed separately, and optionally, highlighted.
  • the current time stamp of the target content can be obtained, and based on the pre-established time stamp synchronization relationship, the multimedia data stream can be jumped to the playback position corresponding to the current time stamp, so that the user can understand the speech.
  • the tone and state of the user when publishing the voice information including the target content thereby improving the efficiency of interaction.
  • the technical solution of this embodiment by establishing a timestamp synchronization relationship between the multimedia data stream and the subtitle information, realizes the synchronous linkage between the subtitle information and the multimedia data stream, so that it is convenient for the user to quickly find the corresponding subtitle information in the multimedia data stream. location, so that it is convenient to understand the voice information of the speaking user in combination with the context before and after, and the efficiency of information interaction is improved.
  • FIG. 2 is a schematic flowchart of a method for searching for target content according to Embodiment 2 of the present disclosure.
  • the content type of the to-be-processed search content may be acquired, and then the corresponding related to-be-searched content is determined based on the content type.
  • the technical terms that are the same as or corresponding to the above embodiments are not repeated in this embodiment.
  • the method includes:
  • S220 Determine the content type of the to-be-processed search content.
  • the content type may include a numeric type and a foreign language type. Thereby, it can be determined which of the above types the content type of the search content to be processed is.
  • the ways of determining the associated to-be-searched content corresponding to the to-be-processed search content are also different. Therefore, before determining the associated to-be-searched content, the content type of the to-be-processed search content may be determined first.
  • S230 Determine at least one associated to-be-searched content corresponding to the to-be-processed search content according to the content type and the content to be converted corresponding to the content type.
  • the associated to-be-searched content is determined based on the to-be-processed search content and the content type of the to-be-processed search content.
  • the content type includes a digital type, and according to the content type and the content to be converted corresponding to the content type, at least one associated to-be-searched content corresponding to the to-be-processed search content is determined, including:
  • the to-be-processed search content may only include numbers, or may include both numbers and other content.
  • the content type corresponding to the to-be-processed search content may be a number type.
  • the number in the to-be-processed search content may be the to-be-converted content of the to-be-processed search content.
  • the conversion form can be understood as at least one deformation form of the content to be converted.
  • the deformation form can be converting numbers into corresponding Chinese characters, converting numbers into corresponding upper and lower case, and converting numbers into corresponding English, etc.
  • the conversion form may be preset, for example, the content to be converted is converted into English, Japanese, and/or French, and the like.
  • the content to be replaced is determined based on at least one conversion form of the content to be converted.
  • an associated content to be searched corresponding to the content to be processed is generated. If only numbers are included in the to-be-processed search content, the to-be-converted content is the same as the to-be-processed search content.
  • At least one conversion form corresponding to the digital type can be determined, and then the associated content to be searched can be obtained.
  • searching for content from subtitle information based on the associated content to be searched the comprehensiveness of the searched target content is improved.
  • the number of associated content to be searched is determined by the preset conversion form or convertible form.
  • the search content to be processed only includes numbers.
  • the preset conversion form is 5, and the obtained content to be replaced is 5
  • the number of associated contents to be searched consists of five contents to be replaced and one contents to be converted.
  • the content type includes a foreign language type.
  • the content of the "foreign language type" may be content expressed in a language different from the preset language type, for example, the foreign language type may be at least one preset language type that is different from the language type currently set in the user's client Or, the foreign language type can also be other language type different from the language type of the voice information.
  • the determining, according to the content type and the content to be converted corresponding to the content type, at least one associated to-be-searched content corresponding to the to-be-processed search content includes: acquiring the foreign-language type in the to-be-processed search content.
  • the corresponding content to be converted determine the content to be replaced corresponding to the content to be converted, and the content to be replaced includes the root and/or extension word corresponding to the content to be converted; based on the content to be replaced and the content to be replaced
  • For the to-be-processed search content at least one associated to-be-searched content corresponding to the to-be-processed search content is determined.
  • the to-be-processed search content may include only foreign languages, or may include both foreign languages and other contents.
  • the other contents may be Chinese characters, numbers, symbols, and the like.
  • the foreign language can be English, Japanese, French and other preset languages.
  • the content type corresponding to the to-be-processed search content may be a foreign language type.
  • the foreign language in the to-be-processed search content may be the to-be-converted content in the to-be-processed search content. Since foreign words have active, passive, tense and possessive situations in different contexts, when it is detected that the search content to be processed includes search content of foreign language type, in order to facilitate the search for the corresponding content from the subtitle information , at least one conversion form corresponding to the vocabulary can be determined.
  • the different tenses of the vocabulary, active and passive, root or derivative words, etc., the content to be converted is converted based on at least one conversion form. Content to be replaced.
  • the associated to-be-searched content corresponding to the to-be-processed search content is generated. If the to-be-processed search content only includes foreign languages, the to-be-converted content is the same as the to-be-processed search content.
  • the above method can quickly find the relevant to-be-searched content associated with the to-be-processed search content, and then find the corresponding target content from the subtitle information based on the associated to-be-searched content, which improves the comprehensiveness of the target content determined from the subtitle information.
  • the determining at least one associated to-be-searched content corresponding to the to-be-processed search content based on the to-be-replaced content and the to-be-processed search content includes: adding the to-be-processed search content The to-be-converted content is replaced with the to-be-replaced content, and at least one associated to-be-searched content corresponding to the to-be-processed search content is obtained.
  • the content to be replaced corresponding to the number can be directly used as the associated to-be-searched content; correspondingly, if the to-be-processed search content only includes foreign languages, the to-be-replaced content corresponding to the foreign language can be The replacement content is directly used as the related content to be searched.
  • the content to be converted in the search content to be processed can be replaced with a content to be replaced, and the replacement
  • the subsequent to-be-processed search content is regarded as an associated to-be-searched content. If the number of contents to be replaced is 5, the contents to be converted may be sequentially replaced with contents to be replaced to obtain the associated contents to be searched, that is, 5 associated contents to be searched may be obtained.
  • the content to be converted in the search content to be processed can be replaced by the content to be converted.
  • the replaced to-be-processed search content is used as one of the associated to-be-searched content. If the number of contents to be replaced is 5, the contents to be converted may be sequentially replaced with contents to be replaced to obtain the associated contents to be searched, that is, 5 associated contents to be searched may be obtained.
  • the to-be-processed search content includes both foreign language, numbers and other content
  • the content to be replaced corresponding to the foreign language type and the content to be replaced corresponding to the numeric type can be determined respectively, and the obtained content to be replaced can be replaced in the to-be-replaced content.
  • the corresponding positions in the search content are processed, and finally a plurality of associated contents to be searched are obtained.
  • S240 Use both the at least one associated content to be searched and the search content to be processed as target search content.
  • S250 Search for target content matching each target search content from the subtitle information.
  • the technical solution of the embodiment of the present disclosure improves the richness of search conditions by acquiring the content type in the search content to be processed, determining the corresponding associated content to be searched based on the content type, and then determining the target search content based on the associated content to be searched.
  • searching based on the target search content the comprehensiveness of the found target content is improved.
  • the method further includes: determining a time stamp of each target content in the time axis of the multimedia data stream, and marking the position corresponding to the time stamp on the time axis.
  • the time axis is the time axis corresponding to the multimedia data stream.
  • the total duration corresponding to the multimedia data stream is 50 minutes, and the time axis corresponding to the multimedia data stream is also 50 minutes.
  • the timestamp corresponding to the target content can be determined according to the sentence to which the target content belongs.
  • the position of the time stamp on the time axis can be determined and marked at the position, for example, it can be marked with a circle or a triangle below the position of the time axis, see FIG. 3 .
  • the search content edited by the user in the search content editing control is "algorithm", and the target content that is the same as the “algorithm” can be searched from the subtitle information and displayed differently, such as highlighted, and the target content can be determined.
  • the timestamp of the sentence to which the content belongs and is marked on the time axis corresponding to the multimedia data stream based on the timestamp, for example, marked with a dot.
  • the user can set the color and size of the mark according to actual needs, which is not limited here.
  • the number of target content can be displayed, for example, the total number displayed in the search content editing control is 12, see FIG. 3 .
  • the advantage of marking the audio and video frames corresponding to the target content on the time axis is that the user can clearly determine the position of the target content in the multimedia data stream according to the mark on the time axis, thereby improving the search efficiency. Convenience of the corresponding target content.
  • the number of target contents may be more than one, and correspondingly, the number of markers on the time axis may also be more than one. Referring to FIG. 3 , the number of target contents is 12, and the number of markers on the time axis is also 12.
  • the search content editing control In order to facilitate the user to determine the number of the currently triggered target content among all the target contents, the search content editing control also displays the sequence corresponding to the currently triggered target content.
  • the method further includes: when detecting that the target content is triggered, determining a target time stamp corresponding to the target content; distinguishing and displaying the mark corresponding to the target time stamp .
  • the user can trigger any target content.
  • the timestamp (target timestamp) corresponding to the user-triggered target content can be determined, and the target mark corresponding to the target timestamp on the time axis can be determined.
  • the target time stamp corresponding to the target content corresponding to mark 1 can be determined, and the corresponding mark on the time axis can be determined according to the target time stamp as The mark corresponding to mark 2 can be highlighted.
  • the advantage of displaying the marks corresponding to the target content differently on the time axis is that the user can know the position of the triggered target content in the multimedia data stream, which improves the user experience.
  • the accuracy of the audio and video frames corresponding to the determined target content is that the user can know the position of the triggered target content in the multimedia data stream, which improves the user experience.
  • FIG. 5 is a schematic structural diagram of an apparatus for searching for target content according to Embodiment 3 of the present disclosure. As shown in FIG. 5 , the apparatus includes: a search content acquisition module 310 , a target search content determination module 320 and a target content matching module 330 .
  • the search content obtaining module 310 is configured to obtain the to-be-processed search content in the search content editing control; the target search content determination module 320 is configured to determine at least one associated to-be-searched content corresponding to the to-be-processed search content, At least one associated to-be-searched content and the to-be-processed search content are generated as target search content; the target content matching module 330 is configured to search for target content matching each target search content from the subtitle information.
  • the technical solution of the embodiment of the present disclosure is to determine the related to-be-searched content associated with the to-be-processed search content when acquiring the to-be-processed search content, and then filter out the same content as the related to-be-searched content from the subtitle information, which improves the determination of the content to be searched.
  • the comprehensiveness and accuracy of the target content is to determine the related to-be-searched content associated with the to-be-processed search content when acquiring the to-be-processed search content, and then filter out the same content as the related to-be-searched content from the subtitle information, which improves the determination of the content to be searched.
  • the search content obtaining module 310 is configured to obtain the pending search content edited in the search content editing control if it is detected that the control for starting the search is triggered; or, if the search content is detected When the content editing control is triggered, the to-be-processed search content edited in the search content editing control is acquired.
  • the target search content determination module 320 is configured to determine at least one associated to-be-searched content corresponding to the to-be-processed search content by: determining the content type of the to-be-processed search content; At least one associated to-be-searched content corresponding to the to-be-processed search content is determined according to the content type and the content to be converted corresponding to the content type.
  • the content type includes a digital type
  • the target search content determination module 320 is configured to determine the content type to be processed according to the content type and the content to be converted corresponding to the content type in the following manner At least one associated content to be searched corresponding to the search content: obtain the content to be converted corresponding to the digital type in the search content to be processed; determine at least one conversion form corresponding to the content to be converted, and based on the content to be converted The at least one conversion form determines the content to be replaced corresponding to the content to be converted; based on the content to be replaced and the search content to be processed, determine at least one association to be searched corresponding to the search content to be processed content.
  • the content type includes a foreign language type
  • the target search content determination module 320 is configured to determine the content type to be processed according to the content type and the content to be converted corresponding to the content type in the following manner At least one associated content to be searched corresponding to the search content: obtain the content to be converted corresponding to the foreign language type in the search content to be processed; determine the content to be replaced corresponding to the content to be converted, and the content to be replaced includes The root and/or extension word corresponding to the content to be converted; based on the content to be replaced and the search content to be processed, at least one associated content to be searched corresponding to the search content to be processed is determined.
  • the target search content determination module 320 is configured to determine at least one associated to-be-searched content corresponding to the to-be-processed search content based on the to-be-replaced content and the to-be-processed search content in the following manner : Replace the to-be-converted content in the to-be-processed search content with the to-be-replaced content to obtain at least one associated to-be-searched content corresponding to the to-be-processed search content.
  • the target content matching module 330 is configured to match the same content as each target search content from the subtitle information, and use the matched content as the target content.
  • the target content is displayed in the subtitle information differently.
  • the method before acquiring the to-be-processed search content in the search content editing control, the method further includes: determining voice information based on a multimedia data stream; The language type and the target translation language type are used to generate subtitle information corresponding to the target translation language type and displayed on the target page.
  • the method further includes: establishing a time stamp synchronization association relationship between the subtitle information and the multimedia data stream, and combining the subtitle information with the multimedia data stream It is displayed on the target page to jump the multimedia data stream to the video playing time corresponding to the target content when it is detected that a target content is triggered.
  • the apparatus further includes a marking module, configured to determine a time stamp in the time axis of the multimedia data stream corresponding to each target content, and match the time stamp on the time axis with the time stamp. Mark the corresponding location.
  • the device further includes a highlighting module configured to determine a target timestamp corresponding to the target content when it is detected that a target content is triggered;
  • the corresponding target marks are displayed differently on the time axis.
  • the apparatus for searching target content provided by the embodiment of the present disclosure can execute the method for searching target content provided by any embodiment of the present disclosure, and has functional modules and effects corresponding to the execution method.
  • FIG. 6 it shows a schematic structural diagram of an electronic device (eg, a terminal device or a server in FIG. 6 ) 400 suitable for implementing an embodiment of the present disclosure.
  • Terminal devices in the embodiments of the present disclosure may include, but are not limited to, such as mobile phones, notebook computers, digital broadcast receivers, personal digital assistants (Personal Digital Assistants, PDAs), tablet computers (PADs), and portable multimedia players (Portable Media Players). , PMP), in-vehicle terminals (eg, in-vehicle navigation terminals), etc., and stationary terminals such as digital (Television, TV), desktop computers, and the like.
  • PMP Personal Digital Assistants
  • PDAs Personal Digital Assistants
  • PADs tablet computers
  • PMP portable multimedia players
  • in-vehicle terminals eg, in-vehicle navigation terminals
  • stationary terminals such as digital (Television, TV), desktop computers, and the like.
  • the electronic device shown in FIG. 6 is only an example, and
  • the electronic device 400 may include a processing device (such as a central processing unit, a graphics processor, etc.) 401, which may be stored in a read-only memory (Read-Only Memory, ROM) 402 according to a program or from a storage device 408 programs loaded into Random Access Memory (RAM) 403 to perform various appropriate actions and processes.
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • various programs and data necessary for the operation of the electronic device 400 are also stored.
  • the processing device 401 , the ROM 402 , and the RAM 403 are connected to each other through a bus 404 .
  • An Input/Output (I/O) interface 405 is also connected to the bus 404 .
  • the following devices can be connected to the I/O interface 405: input devices 406 including, for example, a touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a Liquid Crystal Display (LCD) output device 407 , a speaker, a vibrator, etc.; a storage device 408 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 409 .
  • Communication means 409 may allow electronic device 400 to communicate wirelessly or by wire with other devices to exchange data.
  • FIG. 6 shows electronic device 400 having various means, it is not required to implement or have all of the illustrated means. More or fewer devices may alternatively be implemented or provided.
  • embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated in the flowchart.
  • the computer program may be downloaded and installed from the network via the communication device 409 , or from the storage device 408 , or from the ROM 402 .
  • the processing apparatus 401 executes the above-mentioned functions defined in the methods of the embodiments of the present disclosure.
  • the electronic device provided by the embodiment of the present disclosure and the method for searching target content provided by the above-mentioned embodiment belong to the same concept.
  • the technical details not described in detail in this embodiment please refer to the above-mentioned embodiment, and this embodiment has the same characteristics as the above-mentioned embodiment. Effect.
  • Embodiments of the present disclosure provide a computer storage medium on which a computer program is stored, and when the program is executed by a processor, implements the method for searching for target content provided by the foregoing embodiments.
  • the computer-readable medium described above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the two.
  • the computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above.
  • Examples of computer-readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, RAM, ROM, Erasable Programmable Read-Only Memory (EPROM) or flash memory), optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
  • the program code embodied on the computer-readable medium may be transmitted by any suitable medium, including but not limited to: electric wire, optical fiber cable, radio frequency (RF), etc., or any suitable combination of the above.
  • clients and servers can communicate using any currently known or future developed network protocols, such as HyperText Transfer Protocol (HTTP), and can communicate with digital data in any form or medium.
  • Communication eg, a communication network
  • Examples of communication networks include Local Area Networks (LANs), Wide Area Networks (WANs), the Internet (eg, the Internet), and peer-to-peer networks (eg, ad hoc peer-to-peer networks), as well as any currently Known or future developed networks.
  • LANs Local Area Networks
  • WANs Wide Area Networks
  • the Internet eg, the Internet
  • peer-to-peer networks eg, ad hoc peer-to-peer networks
  • the above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or may exist alone without being assembled into the electronic device.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device:
  • Computer program code for performing operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and This includes conventional procedural programming languages - such as the "C" language or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server.
  • the remote computer may be connected to the user computer through any kind of network, including a LAN or WAN, or may be connected to an external computer (eg, using an Internet service provider to connect through the Internet).
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions.
  • the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
  • the units involved in the embodiments of the present disclosure may be implemented in a software manner, and may also be implemented in a hardware manner.
  • the name of the unit/module does not constitute a limitation of the unit itself in one case, for example, the target content determination module may also be described as a "content determination module”.
  • exemplary types of hardware logic components include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (Application Specific Standard Products) Standard Parts, ASSP), system on chip (System on Chip, SOC), complex programmable logic device (Complex Programmable Logic Device, CPLD) and so on.
  • FPGAs Field Programmable Gate Arrays
  • ASICs Application Specific Integrated Circuits
  • ASSP Application Specific Standard Products
  • SOC System on Chip
  • complex programmable logic device Complex Programmable Logic Device, CPLD
  • a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device.
  • the machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
  • Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing. Examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, RAM, ROM, EPROM or flash memory, optical fibers, CD-ROMs, optical storage devices, magnetic storage devices, or Any suitable combination of the above.
  • Example 1 provides a method for searching for target content, the method comprising:
  • Example 2 provides a method for searching for target content, further comprising:
  • the obtaining the to-be-processed search content in the search content editing control includes:
  • Example 3 provides a method for searching for target content, further comprising:
  • the determining at least one associated to-be-searched content corresponding to the to-be-processed search content includes:
  • determining the content type of the to-be-processed search content determining at least one associated to-be-searched content corresponding to the to-be-processed search content according to the content type and the to-be-converted content corresponding to the content type.
  • Example 4 provides a method for searching for target content, further comprising:
  • the content type includes a digital type
  • the at least one associated to-be-searched content corresponding to the to-be-processed search content is determined according to the content type and the to-be-converted content corresponding to the content type, including: :
  • Example 5 provides a method for searching for target content, further comprising:
  • the content type includes a preset language type
  • the at least one associated to-be-searched content corresponding to the to-be-processed search content is determined according to the content type and the to-be-converted content corresponding to the content type.
  • Example 6 provides a method for searching for target content, further comprising:
  • determining at least one associated to-be-searched content corresponding to the to-be-processed search content based on the to-be-replaced content and the to-be-processed search content includes:
  • the to-be-converted content in the to-be-processed search content is replaced with the to-be-replaced content to obtain at least one associated to-be-searched content corresponding to the to-be-processed search content.
  • Example 7 provides a method for searching for target content, further comprising:
  • the searching for target content that matches each target search content from the subtitle information includes:
  • the same content as each target search content is matched from the subtitle information, and the matched content is used as the target content.
  • Example 8 provides a method for searching for target content, further comprising:
  • the target content is displayed differently in the subtitle information.
  • Example 9 provides a method for searching for target content, further comprising:
  • the method before the acquiring the to-be-processed search content in the search content editing control, the method further includes:
  • the voice information is determined; according to the voice information, the original language type corresponding to the voice information, and the target translation language type, subtitle information corresponding to the target translation language type displayed on the target page is generated.
  • Example 10 provides a method for searching for target content, further comprising:
  • the method further includes:
  • Example 11 provides a method for searching for target content, further comprising:
  • jumping the multimedia data stream to a video playback time corresponding to the target content including
  • the current timestamp of the target content is determined; based on the pre-established synchronization relationship of the timestamps and the current timestamp, the multimedia data stream is skipped Go to the video frame corresponding to the current timestamp.
  • Example 12 provides a method for searching for target content, further comprising:
  • the timestamp of each target content in the time axis of the multimedia data stream is determined, and a position corresponding to the timestamp on the time axis is marked.
  • Example 13 provides a method for searching for target content, further comprising:
  • a target time stamp corresponding to the target content is determined; and a target mark corresponding to the target time stamp is displayed on the time axis in a differentiated manner.
  • Example 14 provides an apparatus for searching for target content, including:
  • the search content acquisition module is configured to acquire the to-be-processed search content in the search content editing control;
  • the target search content determination module is configured to determine at least one associated to-be-searched content corresponding to the to-be-processed search content, and the at least one The associated to-be-searched content and the to-be-processed search content are both used as target search content;
  • the target content matching module is configured to search for target content matching each target search content from the subtitle information.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Library & Information Science (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

一种搜索目标内容的方法、装置、电子设备及存储介质,该搜索目标内容的方法包括:获取搜索内容编辑控件中的待处理搜索内容(S110);确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,将所述至少一个关联待搜索内容以及所述待处理搜索内容均作为目标搜索内容(S120);从字幕信息中搜索与每个目标搜索内容相匹配的目标内容(S130)。

Description

搜索目标内容的方法、装置、电子设备及存储介质
本申请要求在2020年09月29日提交中国专利局、申请号为202011052041.7的中国专利申请的优先权,该申请的全部内容通过引用结合在本申请中。
技术领域
本公开涉及计算机技术领域,例如涉及一种搜索目标内容的方法、装置、电子设备及存储介质。
背景技术
在从文档或文本中筛选目标内容时,多是依据用户输入的搜索条件,直接从文档中来筛选,进而得到目标内容。
采用上述方式筛选得到的目标内容,多是与搜索条件相同的内容,无法搜索到与搜索条件相关联的内容。
发明内容
本公开提供了一种搜索目标内容的方法、装置、电子设备及存储介质,以实现优化搜索条件,进而在基于优化后的搜索条件搜索相应的内容时,提高查找内容的全面性。
本公开提供了一种搜索目标内容的方法,该方法包括:
获取搜索内容编辑控件中的待处理搜索内容;
确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,将所述至少一个关联待搜索内容以及所述待处理搜索内容均作为目标搜索内容;
从字幕信息中搜索与每个目标搜索内容相匹配的目标内容。
本公开还提供了一种搜索目标内容的装置,该装置包括:
搜索内容获取模块,设置为获取搜索内容编辑控件中的待处理搜索内容;
目标搜索内容确定模块,设置为确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,将所述至少一个关联待搜索内容以及所述待处理搜索内容均作为目标搜索内容;
目标内容匹配模块,设置为从字幕信息中搜索与每个目标搜索内容相匹配的目标内容。
本公开还提供了一种电子设备,所述电子设备包括:
一个或多个处理器;
存储装置,设置为存储一个或多个程序;
当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现上述的搜索目标内容的方法。
本公开还提供了一种包含计算机可执行指令的存储介质,所述计算机可执行指令在由计算机处理器执行时用于执行上述的搜索目标内容的方法。
附图说明
图1为本公开实施例一所提供的一种搜索目标内容的方法的流程示意图;
图2为本公开实施例二所提供的一种搜索目标内容的方法的流程示意图;
图3为本公开实施例二所提供的一种目标内容与时间轴上标记对应显示的示意图;
图4为本公开实施例二所提供的一种触发目标内容后,时间轴上对应标记突出显示的示意图;
图5为本公开实施例三所提供的一种搜索目标内容的装置的结构示意图;
图6为本公开实施例四所提供的一种电子设备的结构示意图。
具体实施方式
下面将参照附图描述本公开的实施例。虽然附图中显示了本公开的一些实施例,然而本公开可以通过多种形式来实现,而且不应该被解释为限于这里阐述的实施例,提供这些实施例是为了理解本公开。
本公开的方法实施方式中记载的多个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开的范围在此方面不受限制。
本文使用的术语“包括”及其变形是开放性包括,即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。
本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。
本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,除非在上下文另有指出,否则应该理解为“一个或多个”。
实施例一
图1为本公开实施例一所提供的一种搜索目标内容的方法的流程示意图,本公开实施例适用于从字幕信息中搜索出与目标搜索内容相匹配内容的情形,该方法可以由搜索目标内容的装置来执行,该装置可以通过软件和/或硬件的形式实现。
如图1,本实施例的方法包括:
S110、获取搜索内容编辑控件中的待处理搜索内容。
搜索内容编辑控件可以是显示在目标页面上,用于编辑搜索内容的控件。目标页面上还可以包括字幕信息。在这里,字幕信息可以包括待搜索的文本,并且,本实施例对字幕信息的生成方式不做限定。服务器可以获取搜索内容编辑控件中编辑的搜索内容,并将获取到的搜索内容作为待处理搜索内容,可选的,搜索内容编辑控件中编辑的搜索内容为“查找”,服务器获取到的待处理搜索内容为:查找。
在本实施例中,所述获取搜索内容编辑控件中的待处理搜索内容,包括:若检测到启动搜索的控件被触发时,获取所述搜索内容编辑控件中编辑的待处理搜索内容;或,若检测到搜索内容编辑控件被触发时,获取所述搜索内容编辑控件中编辑的待处理搜索内容。
示例性的,获取待处理搜索内容可以采用如下至少两种方式来实现。目标页面上可以包括搜索内容编辑控件和启动搜索的控件。可选的,启动搜索的控件可以是“确认”搜索的控件。用户可以在搜索内容编辑控件中编辑相应的内容,在内容编辑完成后,用户可以触发启动搜索的控件,即点击“确认”控件,服务器可以获取搜索内容编辑控件中的待处理搜索内容。或者,也可以是:在检测到用户触发搜索内容编辑控件时,开始获取搜索内容编辑控件中编辑的搜索内容,在预设时长内,可选的,在30S内,若未检测到用户编辑新的搜索内容,则将获取到的搜索内容作为待处理搜索内容。
S120、确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,将所述至少一个关联待搜索内容以及所述待处理搜索内容均作为目标搜索内容。
关联待搜索内容是基于待处理搜索内容来确定的。关联待搜索内容可以是基于待处理搜索内容对应的至少一种转换形式得到的内容,如,待处理搜索内容为:一二三,关联待搜索内容可以是基于“一二三”的至少一种变换形式得到的内容,可选的,关联待搜索内容可以是“123”、“壹贰叁”等。可以将至少一个关 联待搜索内容以及待处理搜索内容均作为目标搜索内容。
在获取到待处理搜索内容后,可以确定与待处理搜索内容相关联的至少一个关联待搜索内容,并将至少一个关联待搜索内容以及待处理搜索内容均作为目标搜索内容。
S130、从字幕信息中搜索与每个目标搜索内容相匹配的目标内容。
从字幕信息中获取与至少一个关联待搜索内容以及待处理搜索内容相同的内容,并将获取到的内容作为目标内容。目标搜索内容可包括多个,从字幕信息搜索得到的、与任一目标搜索内容相同的内容即为目标内容。
示例性的,待处理搜索内容为“123”,目标搜索内容中的至少一个关联待搜索内容可以是“一二三”、“壹贰叁”。可以将“123”、“一二三”、“壹贰叁”三者中的每一个字符串作为一个整体,从字幕信息中筛选出与上述三者中任意一个完全一致的内容,即将字幕信息中全部的“123”、“一二三”、“壹贰叁”均作为目标内容。
可选的,从字幕信息中匹配到与所述目标搜索内容相同的内容,并将匹配到的内容作为目标内容。采用此种方式的好处在于,可以从字幕信息中搜索到与待处理搜索内容相关联的关联内容,提高了确定的目标内容的全面性。
本公开实施例的技术方案,在获取到待处理搜索内容时,确定与待处理搜索内容相关联的关联待搜索内容,即优化了搜索条件,在基于优化后的搜索条件从字幕信息中查找相应的内容时,提高了确定的目标内容的全面性以及准确性。
在上述技术方案的基础上,在得到目标内容后,可以将目标内容在所述字幕信息中区别显示。
当从字幕信息中确定出目标内容时,目标内容本身也是字幕信息中的一个或多个元素,在显示时,可以与其他元素区别显示,从而突出筛选后的目标内容,让用户可以更直观、便捷地发现目标内容。区别显示可以是以颜色、字体、背景图案等显示格式来区别显示。
在上述方案的基础上,在获取搜索内容编辑控件中的待处理搜索内容之前,还需要生成字幕信息,在本实施例中,生成字幕信息可以是:基于多媒体数据流,采集语音信息;对所采集的语音信息进行语音识别,得到字幕信息。
若语音信息表征了不同的语种类型,在对所采集的语音信息进行语音识别进而得到字幕信息时,可以根据所述语音信息、与所述语音信息对应的原始语种类型以及目标翻译语种类型,生成显示在目标页面上与所述目标翻译语种类型相对应的字幕信息。
多媒体数据流可以是与实时互动界面对应的视频流数据,或者是对实时互动界面进行录屏后得到的录屏视频中的视频流数据。实时互动界面为实时互动应用场景中的任意交互界面。实时互动应用场景可通过互联网和计算机手段实现,例如,通过原生程序或全球广域网(World Wide Web,web)程序等实现的交互应用程序。
实时互动或者录屏视频中的用户可以包括多个,每个用户发言时所使用的语种类型可以相同也可以不同,当其它发言用户所用的语种类型与本用户所使用的语种差异较大时,可能存在本用户无法了解其他发言用户的发言信息的情况。
为了解决这一问题,可以采集发言用户的语音信息,并将其转换为相应的字幕信息。为了提高阅读的便捷性,用户可以触发目标页面上的语种类型选择控件并选择翻译语种类型,以将其它发言用户的语音信息翻译为所选择的翻译语种类型对应的字幕信息。
原始语种类型指的是参与实时互动的用户在发言时所使用的语种类型。目标翻译语种类型为用户在目标页面上设置的用于显示字幕信息的语种类型。字幕信息为与语音信息相对应的译文数据。为了便于用户直观地从字幕信息中确定每条译文数据对应的发言用户以及发言时间,字幕信息中可以显示每条译文数据的发言用户身份标识以及发言时间戳。
可以从与互动界面相对应的多媒体数据流中,采集多个参与互动的用户的语音数据,即语音信息,并记录语音信息所对应的原始语种类型。可以将语音信息从原始语种类型翻译为目标翻译语种类型,得到与语音信息对应的译文数据。将译文数据、译文数据对应的发言用户身份标识以及发言时间戳作为展示在目标页面上的字幕信息。
在本实施例中,确定目标翻译语种类型的方式可以包括如下至少一种:获取目标客户端上预先设置的语种类型作为目标翻译语种类型;获取所述目标客户端的登录地址,基于所述登录地址确定与所述目标客户端所在地理位置对应的目标语翻译种类型。
也就是说,确定目标翻译语种类型的方式可以包括至少两种。第一种方式可以是:在检测到用户触发目标页面上的语种类型选择控件时,即检测到用户选择字幕信息以哪种语种类型显示时,可以确定用户设置的语种类型,并将该设置的语种类型作为目标翻译语种类型。示例性的,在用户触发语种类型选择控件时,目标页面上可以弹出语种类型选择列表以供用户选择。用户可以选择任意一种语种类型,如,用户触发了语种类型选择列表中的中文语种类型并点击了确认按键,服务端或客户端可以确定目标翻译语种类型为中文语种类型。 也就是说,可以将多媒体数据流中的语音信息转换为中文字幕信息,并将其展示在目标界面上。第二种方式可以是:在检测到用户触发语种类型选择控件时,可以获取该用户的客户端的登录地址,即客户端的互联网协议(Internet Protocol,IP)地址,以根据登录地址确定客户端所属的区域,进而将所属区域所使用的语种类型作为目标翻译语种类型。例如,在用户触发语种类型选择控件时,获取该用户的客户端的登录地址,若基于登录地址确定客户端所属的区域为中国,则目标翻译语种类型为中文语种类型。
在本实施例中,通过将多媒体数据流中的语音信息转换为目标翻译语种类型的字幕信息,使字幕信息更符合用户的阅读习惯,以便于用户能够快速理解多媒体数据流所对应的内容,从而提高交互的效率。
在得到所述字幕信息之后,还可以建立所述字幕信息与所述多媒体数据流之间的时间戳同步关联关系,并将所述字幕信息和所述多媒体数据流显示在目标页面上,以在检测到一目标内容被触发时,将所述多媒体数据流跳转到与所述一目标内容所对应的视频播放时刻。
时间戳同步关联关系可以理解为多媒体数据流和字幕信息是基于时间同步联动的。当字幕信息中的一条译文数据被触发时,可以确定该条译文数据所对应的当前时间戳,基于预先建立的时间戳同步关联关系,跳转到与当前时间戳所对应的多媒体数据流,例如,多媒体数据流是基于录屏视频获取到的,则跳转到录屏视频中与当前时间戳相对应的音视频帧。其次,拖动录屏视频的进度条到一个音视频帧时,可以获取该音视频帧所对应的当前时间戳,基于预先建立的时间戳同步关联关系,可以确定该音视频帧在字幕信息中所对应的译文数据,为了便于用户确认,可以将该译文数据区别显示,可选的,高亮显示。
可选的,若检测到一目标内容被触发,确定所述一目标内容的当前时间戳;基于预先建立的时间戳的同步关联关系以及所述当前时间戳,将多媒体数据流跳转至所述当前时间戳对应的播放位置。
在检测到用户触发目标内容时,可以获取目标内容的当前时间戳,基于预先建立的时间戳同步关联关系,将多媒体数据流跳转到与当前时间戳所对应的播放位置,以便于用户了解发言用户在发表包括目标内容的语音信息时的语气和状态,进而提高交互的效率。
本实施例技术方案,通过建立多媒体数据流和字幕信息之间的时间戳同步关联关系,实现了字幕信息和多媒体数据流的同步联动,从而便于用户快速查找到相应字幕信息在多媒体数据流中的位置,进而便于结合前后语境了解发言用户的语音信息,提高了信息交互的效率。
实施例二
图2为本公开实施例二所提供的一种搜索目标内容的方法的流程示意图。在确定待处理搜索内容的关联待搜索内容时,可以获取待处理搜索内容的内容类型,进而基于内容类型来确定相应的关联待搜索内容。其中,与上述实施例相同或者相应的技术术语在本实施例不再赘述。
如图2所示,所述方法包括:
S210、获取搜索内容编辑控件中的待处理搜索内容。
S220、确定所述待处理搜索内容的内容类型。
在本实施例中,内容类型可以包括数字类型和外文类型。由此,可以确定待处理搜索内容的内容类型是上述类型中的哪一种。
由于内容类型不同,确定与待处理搜索内容相对应的关联待搜索内容的方式也不同,因此在确定关联待搜索内容之前,可以先确定待处理搜索内容的内容类型。
S230、根据所述内容类型以及与所述内容类型对应的待转换内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容。
关联待搜索内容是基于待处理搜索内容以及待处理搜索内容的内容类型确定出来的。
可选的,内容类型包括数字类型,根据内容类型以及与内容类型对应的待转换内容,确定与待处理搜索内容相对应的至少一个关联待搜索内容,包括:
获取待处理搜索内容中与数字类型对应的待转换内容;确定与待转换内容相对应的至少一种转换形式,并基于至少一种转换形式确定与待转换内容相对应的待替换内容;基于待替换内容与待处理搜索内容,确定与待处理搜索内容相对应的至少一个关联待搜索内容。
待处理搜索内容中可以仅包括数字,也可以是既包括数字也包括其它内容。
若待处理搜索内容中包括数字,则与待处理搜索内容对应的内容类型可以是数字类型。相应的,待处理搜索内容中的数字可以是待处理搜索内容的待转换内容。转换形式可以理解为待转换内容的至少一种变形形式,可选的,变形形式可以是将数字转换为相应的汉字,将数字转换为相应的大小写,将数字转换为相应的英文等。转换形式可以是预先设置的,例如,将待转换内容转换为英文、日文和/或法文等。基于待转换内容的至少一种转换形式确定待替换内容。根据每个待替换内容以及待转换内容,生成与待处理搜索内容相对应的关联待 搜索内容。若待处理搜索内容中仅包括数字,则待转换内容与待处理搜索内容相同。
采用上述方式,可以确定与数字类型相对应的至少一种转换形式,进而得到关联待搜索内容,在基于关联待搜索内容从字幕信息中查找内容时,提高了查找的目标内容的全面性。
关联待搜索内容的数量是由预先设置的转换形式或者可转换形式来确定的,可选的,待处理搜索内容中仅包括数字,预设转换形式为5种,则得到的待替换内容为5个,相应的,关联待搜索内容的数量由5个待替换内容和1个待转换内容组成。
可选的,所述内容类型包括外文类型。在这里,“外文类型”的内容可以是以与预先设置的语种类型不同的语种表达的内容,例如,外文类型可以是至少一种与用户的客户端中当前设置的语种类型不同的预设语种类型;或者,外文类型也可以是与语音信息的语种类型不同的其它语种类型。所述根据所述内容类型以及与所述内容类型对应的待转换内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,包括:获取待处理搜索内容中与所述外文类型对应的待转换内容;确定与所述待转换内容相对应的待替换内容,所述待替换内容包括与所述待转换内容对应的词根和/或延伸词;基于所述待替换内容以及所述待处理搜索内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容。
待处理搜索内容中可以仅包括外文,也可以是既包括外文也包括其它内容,可选的,其它内容可以是汉字、数字、符号等。
外文可以是英文、日文、法文等多种预设语种。若待处理搜索内容中包括外文,则与待处理搜索内容对应的内容类型可以是外文类型。相应的,待处理搜索内容中的外文可以是待处理搜索内容中的待转换内容。由于外文词汇在不同的语境中存在主动、被动、时态以及所有格的情况,因此在检测到待处理搜索内容中包括外文类型的搜索内容时,为了便于从字幕信息中查找到相应的内容,可以确定该词汇所对应的至少一种转换形式,可选的,该词汇的不同时态,主被动、词根或者衍生词等,将待转换内容基于至少一种转换形式变换后得到的内容作为待替换内容。根据待替换内容以及待转换内容,生成与待处理搜索内容相对应的关联待搜索内容。若待处理搜索内容中仅包括外文,则待转换内容与待处理搜索内容相同。
采用上述方式可以快速找到与待处理搜索内容关联的关联待搜索内容,进而基于关联待搜索内容从字幕信息中查找到相应的目标内容,提高了从字幕信息中确定的目标内容的全面性。
在本实施例中,所述基于所述待替换内容以及所述待处理搜索内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,包括:将所述待处理搜索内容中的待转换内容替换为所述待替换内容,得到与所述待处理搜索内容相对应的至少一个关联待搜索内容。
若待处理搜索内容中仅包括数字,则可以将与数字相对应的待替换内容直接作为关联待搜索内容;相应的,若待处理搜索内容中仅包括外文,则可以将与外文相对应的待替换内容直接作为关联待搜索内容。
若待处理搜索内容中不仅包括数字,还包括除数字之外的其它内容,则在确定至少一个待替换内容后,可以将待处理搜索内容中的待转换内容替换为一个待替换内容,将替换后的待处理搜索内容作为一个关联待搜索内容。若待替换内容的数量有5个,则可以依次将待转换内容替换为待替换内容进而得到关联待搜索内容,即可以得到5个关联待搜索内容。
对于内容类型为外文类型来说,若待处理搜索内容中不仅包括外文,还包括除外文之外的其它内容,则在确定至少一个待替换内容后,可以待处理搜索内容中的待转换内容替换为一个待替换内容,将替换后的待处理搜索内容作为关联待搜索内容中的一个。若待替换内容的数量有5个,则可以依次将待转换内容替换为待替换内容进而得到关联待搜索内容,即可以得到5个关联待搜索内容。
若待处理搜索内容中既包括外文、数字还包括其它内容,则可以分别确定与外文类型相对应的待替换内容,与数字类型相对应的待替换内容,并将得到的待替换内容替换在待处理搜索内容中的相应位置处,最终得到多个关联待搜索内容。
S240、将所述至少一个关联待搜索内容以及所述待处理搜索内容均作为目标搜索内容。
S250、从字幕信息中搜索与每个目标搜索内容相匹配的目标内容。
本公开实施例的技术方案,通过获取待处理搜索内容中的内容类型,并基于内容类型确定相应的关联待搜索内容,进而基于关联待搜索内容确定目标搜索内容,提高了搜索条件的丰富程度,在基于目标搜索内容搜索时,提高了查找到的目标内容的全面性。
在上述技术方案的基础上,所述方法还包括:确定每个目标内容在多媒体数据流的时间轴中的时间戳,并在所述时间轴上与所述时间戳对应的位置进行标记。
时间轴是与多媒体数据流所对应的时间轴,可选的,多媒体数据流所对应 的总时长为50min,与多媒体数据流所对应的时间轴也为50min。
在确定目标内容后,可以根据目标内容所属的句子,确定目标内容所对应的时间戳。在确定时间戳后,可以确定时间戳在时间轴上的位置,并在该位置处进行标记,例如,可以在时间轴的位置下方用圆点标记,或者用三角标记,参见图3。
示例性的,参见图3,用户在搜索内容编辑控件编辑的搜索内容为“算法”,可以从字幕信息中搜索出与“算法”相同的目标内容并区别显示,如高亮显示,并确定目标内容所属句子的时间戳,基于时间戳在与多媒体数据流相对应的时间轴上进行标记,如,用圆点标记。其中,标记的颜色、大小等用户可以根据实际需求进行设置,在此不再限定。
在搜索内容编辑控件中,可以显示目标内容的数量,例如,搜索内容编辑控件中显示的总数量为12,参见图3。
在本实施例中,在时间轴上标记与目标内容相对应的音视频帧的好处在于,可以使用户根据时间轴上的标记,清楚地确定目标内容在多媒体数据流中的位置,从而提高查找相应目标内容的便捷性。
目标内容的数量可以不止一个,相应的,时间轴上标记的数量也可以不止一个,参见图3,目标内容的数量为12个,时间轴上的标记也为12个。为了便于用户确定当前触发的目标内容为所有目标内容中的第几个,搜索内容编辑控件中还显示当前触发的目标内容所对应的顺序。
在上述技术方案的基础上,所述方法还包括:当检测到目标内容被触发时,确定与所述目标内容相对应的目标时间戳;将与所述目标时间戳所对应的标记进行区别显示。
用户可以触发任意一个目标内容,在检测到用户触发目标内容时,可以确定与用户触发的目标内容所对应的时间戳(目标时间戳),可以确定目标时间戳在时间轴上所对应的目标标记,并将目标标记与时间轴上的其他标记区别显示,以凸显目标标记。例如,将目标标记和其他标记以不同的颜色区别显示。
示例性的,参见图4,当用户触发标记1对应的目标内容时,可以确定标记1对应的目标内容所对应的目标时间戳,根据目标时间戳可以确定其在时间轴上所对应的标记为标记2所对应的标记,可以将该标记突出显示。
在本实施例中,当目标内容被触发时,在时间轴上将与目标内容相对应的标记区别显示的好处在于,可以使用户了解触发的目标内容在多媒体数据流中的位置,提高了用户确定的目标内容所对应的音视频帧的准确性。
实施例三
图5为本公开实施例三所提供的一种搜索目标内容的装置的结构示意图。如图5所示,所述装置包括:搜索内容获取模块310、目标搜索内容确定模块320以及目标内容匹配模块330。
搜索内容获取模块310,设置为获取搜索内容编辑控件中的待处理搜索内容;目标搜索内容确定模块320,设置为确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,将所述至少一个关联待搜索内容以及所述待处理搜索内容生成均作为目标搜索内容;目标内容匹配模块330,设置为从字幕信息中搜索与每个目标搜索内容相匹配的目标内容。
本公开实施例的技术方案,在获取到待处理搜索内容时,确定与待处理搜索内容相关联的关联待搜索内容,进而从字幕信息中筛选出与关联待搜索内容相同的内容,提高了确定的目标内容的全面性以及准确性。
在上述技术方案的基础上,所述搜索内容获取模块310,设置为若检测到启动搜索的控件被触发时,获取所述搜索内容编辑控件中编辑的待处理搜索内容;或,若检测到搜索内容编辑控件被触发时,获取所述搜索内容编辑控件中编辑的待处理搜索内容。
在上述技术方案的基础上,目标搜索内容确定模块320设置为通过如下方式确定与所述待处理搜索内容相对应的至少一种个关联待搜索内容:确定所述待处理搜索内容的内容类型;根据所述内容类型以及与所述内容类型对应的待转换内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容。
在上述技术方案的基础上,所述内容类型包括数字类型,目标搜索内容确定模块320设置为通过如下方式根据所述内容类型以及与所述内容类型对应的待转换内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容:获取所述待处理搜索内容中与所述数字类型对应的待转换内容;确定与所述待转换内容相对应的至少一种转换形式,并基于所述至少一种转换形式确定与所述待转换内容相对应的待替换内容;基于所述待替换内容与所述待处理搜索内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容。
在上述技术方案的基础上,所述内容类型包括外文类型,目标搜索内容确定模块320设置为通过如下方式根据所述内容类型以及与所述内容类型对应的待转换内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容:获取待处理搜索内容中与所述外文类型对应的待转换内容;确定与所述待转换内容相对应的待替换内容,所述待替换内容包括与所述待转换内容对应的词根和/或延伸词;基于所述待替换内容以及所述待处理搜索内容,确定与所述待处 理搜索内容相对应的至少一个关联待搜索内容。
在上述技术方案的基础上,目标搜索内容确定模块320设置为通过如下方式基于所述待替换内容以及所述待处理搜索内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容:将所述待处理搜索内容中的待转换内容替换为所述待替换内容,得到与所述待处理搜索内容相对应的至少一个关联待搜索内容。
在上述技术方案的基础上,目标内容匹配模块330,设置为从所述字幕信息中匹配与每个目标搜索内容相同的内容,并将匹配到的内容作为目标内容。
在上述技术方案的基础上,将所述目标内容在所述字幕信息中区别显示。
在上述技术方案的基础上,在所述获取搜索内容编辑控件中的待处理搜索内容之前,还包括:基于多媒体数据流,确定语音信息;根据所述语音信息、与所述语音信息对应的原始语种类型以及目标翻译语种类型,生成显示在目标页面上与所述目标翻译语种类型相对应的字幕信息。
在上述技术方案的基础上,得到所述字幕信息之后,还包括:建立所述字幕信息与所述多媒体数据流之间的时间戳同步关联关系,并将所述字幕信息和所述多媒体数据流显示在目标页面上,以在检测到一目标内容被触发时,将所述多媒体数据流跳转到与所述一目标内容所对应的视频播放时刻。
在上述技术方案的基础上,所述装置还包括标记模块,设置为确定与每个目标内容相对应的多媒体数据流在时间轴中的时间戳,并在所述时间轴上与所述时间戳对应的位置进行标记。
在上述技术方案的基础上,所述装置还包括突出显示模块,设置为当检测到一目标内容被触发时,确定与所述一目标内容相对应的目标时间戳;将与所述目标时间戳所对应的目标标记在所述时间轴上进行区别显示。
本公开实施例所提供的搜索目标内容的装置可执行本公开任意实施例所提供的搜索目标内容的方法,具备执行方法相应的功能模块和效果。
上述装置所包括的多个单元和模块只是按照功能逻辑进行划分的,但并不局限于上述的划分,只要能够实现相应的功能即可;另外,功能单元的名称也只是为了便于相互区分,并不用于限制本公开实施例的保护范围。
实施例四
下面参考图6,其示出了适于用来实现本公开实施例的电子设备(例如图6中的终端设备或服务器)400的结构示意图。本公开实施例中的终端设备可以包 括但不限于诸如移动电话、笔记本电脑、数字广播接收器、个人数字助理(Personal Digital Assistant,PDA)、平板电脑(PAD)、便携式多媒体播放器(Portable Media Player,PMP)、车载终端(例如车载导航终端)等等的移动终端以及诸如数字(Television,TV)、台式计算机等等的固定终端。图6示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。
如图6所示,电子设备400可以包括处理装置(例如中央处理器、图形处理器等)401,其可以根据存储在只读存储器(Read-Only Memory,ROM)402中的程序或者从存储装置408加载到随机访问存储器(Random Access Memory,RAM)403中的程序而执行多种适当的动作和处理。在RAM403中,还存储有电子设备400操作所需的多种程序和数据。处理装置401、ROM402以及RAM403通过总线404彼此相连。输入/输出(Input/Output,I/O)接口405也连接至总线404。
通常,以下装置可以连接至I/O接口405:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置406;包括例如液晶显示器(Liquid Crystal Display,LCD)、扬声器、振动器等的输出装置407;包括例如磁带、硬盘等的存储装置408;以及通信装置409。通信装置409可以允许电子设备400与其他设备进行无线或有线通信以交换数据。虽然图6示出了具有多种装置的电子设备400,但是并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。
根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在非暂态计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置409从网络上被下载和安装,或者从存储装置408被安装,或者从ROM402被安装。在该计算机程序被处理装置401执行时,执行本公开实施例的方法中限定的上述功能。
本公开实施例提供的电子设备与上述实施例提供的搜索目标内容的方法属于同一构思,未在本实施例中详尽描述的技术细节可参见上述实施例,并且本实施例与上述实施例具有相同的效果。
实施例五
本公开实施例提供了一种计算机存储介质,其上存储有计算机程序,该程 序被处理器执行时实现上述实施例所提供的搜索目标内容的方法。
本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、RAM、ROM、可擦式可编程只读存储器(Erasable Programmable Read-Only Memory,EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(Compact Disc Read-Only Memory,CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、射频(Radio Frequency,RF)等等,或者上述的任意合适的组合。
在一些实施方式中,客户端、服务器可以利用诸如超文本传输协议(HyperText Transfer Protocol,HTTP)之类的任何当前已知或未来研发的网络协议进行通信,并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(Local Area Network,LAN),广域网(Wide Area Network,WAN),网际网(例如,互联网)以及端对端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:
获取搜索内容编辑控件中的待处理搜索内容;确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,将所述至少一个关联待搜索内容以及所述待处理搜索内容均作为目标搜索内容;从字幕信息中搜索与每个目标搜索内容相匹配的目标内容。
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的 计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括LAN或WAN—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。
附图中的流程图和框图,图示了按照本公开多种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。
描述于本公开实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,单元/模块的名称在一种情况下并不构成对该单元本身的限定,例如,目标内容确定模块还可以被描述为“内容确定模块”。
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(Field Programmable Gate Array,FPGA)、专用集成电路(Application Specific Integrated Circuit,ASIC)、专用标准产品(Application Specific Standard Parts,ASSP)、片上系统(System on Chip,SOC)、复杂可编程逻辑设备(Complex Programmable Logic Device,CPLD)等等。
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、RAM、ROM、EPROM或快闪存储器、光纤、CD-ROM、光学储存设备、磁储存设备、 或上述内容的任何合适组合。
根据本公开的一个或多个实施例,【示例一】提供了一种搜索目标内容的方法,该方法包括:
获取搜索内容编辑控件中的待处理搜索内容;确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,将所述至少一个关联待搜索内容以及所述待处理搜索内容均作为目标搜索内容;从字幕信息中搜索与每个目标搜索内容相匹配的目标内容。
根据本公开的一个或多个实施例,【示例二】提供了一种搜索目标内容的方法,还包括:
可选的,所述获取搜索内容编辑控件中的待处理搜索内容,包括:
若检测到启动搜索的控件被触发时,获取所述搜索内容编辑控件中编辑的待处理搜索内容;或,若检测到搜索内容编辑控件被触发时,获取所述搜索内容编辑控件中编辑的待处理搜索内容。
根据本公开的一个或多个实施例,【示例三】提供了一种搜索目标内容的方法,还包括:
可选的,所述确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,包括:
确定所述待处理搜索内容的内容类型;根据所述内容类型以及与所述内容类型对应的待转换内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容。
根据本公开的一个或多个实施例,【示例四】提供了一种搜索目标内容的方法,还包括:
可选的,所述内容类型包括数字类型,所述根据所述内容类型以及与所述内容类型对应的待转换内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,包括:
获取所述待处理搜索内容中与所述数字类型对应的待转换内容;确定与所述待转换内容相对应的至少一种转换形式,并基于所述至少一种转换形式确定与所述待转换内容相对应的待替换内容;基于所述待替换内容与所述待处理搜索内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容。
根据本公开的一个或多个实施例,【示例五】提供了一种搜索目标内容的方法,还包括:
可选的,所述内容类型包括预设语种类型,所述根据所述内容类型以及与所述内容类型对应的待转换内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,包括:
获取所述待处理搜索内容中与所述预设语种类型对应的待转换内容;确定与所述待转换内容相对应的待替换内容,所述待替换内容包括与所述待转换内容对应的词根和/或延伸词;基于所述待替换内容以及所述待处理搜索内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容。
根据本公开的一个或多个实施例,【示例六】提供了一种搜索目标内容的方法,还包括:
可选的,所述基于所述待替换内容与所述待处理搜索内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,包括:
将所述待处理搜索内容中的待转换内容替换为所述待替换内容,得到与所述待处理搜索内容相对应的至少一个关联待搜索内容。
根据本公开的一个或多个实施例,【示例七】提供了一种搜索目标内容的方法,还包括:
可选的,所述从字幕信息中搜索与每个目标搜索内容相匹配的目标内容,包括:
从所述字幕信息中匹配与每个目标搜索内容相同的内容,并将匹配到的内容作为目标内容。
根据本公开的一个或多个实施例,【示例八】提供了一种搜索目标内容的方法,还包括:
可选的,将所述目标内容在所述字幕信息中区别显示。
根据本公开的一个或多个实施例,【示例九】提供了一种搜索目标内容的方法,还包括:
可选的,在所述获取搜索内容编辑控件中的待处理搜索内容之前,还包括:
基于多媒体数据流,确定语音信息;根据所述语音信息、与所述语音信息对应的原始语种类型以及目标翻译语种类型,生成显示在目标页面上与所述目标翻译语种类型相对应的字幕信息。
根据本公开的一个或多个实施例,【示例十】提供了一种搜索目标内容的方法,还包括:
可选的,在得到所述字幕信息之后,还包括:
建立所述字幕信息与所述多媒体数据流之间的时间戳同步关联关系,并将所述字幕信息和所述多媒体数据流显示在目标页面上,以在检测到一目标内容被触发时,将所述多媒体数据流跳转到与所述一目标内容所对应的视频播放时刻。
根据本公开的一个或多个实施例,【示例十一】提供了一种搜索目标内容的方法,还包括:
所述在检测到一目标内容被触发时,将所述多媒体数据流跳转到与所述一目标内容所对应的视频播放时刻,包括
可选的,若检测到一目标内容被触发,确定所述一目标内容的当前时间戳;基于预先建立的所述时间戳的同步关联关系以及所述当前时间戳,将所述多媒体数据流跳转至所述当前时间戳对应的视频帧。
根据本公开的一个或多个实施例,【示例十二】提供了一种搜索目标内容的方法,还包括:
可选的,确定每个目标内容在多媒体数据流的时间轴中的时间戳,并在所述时间轴上与所述时间戳对应的位置进行标记。
根据本公开的一个或多个实施例,【示例十三】提供了一种搜索目标内容的方法,还包括:
可选的,当检测到一目标内容被触发时,确定与所述一目标内容相对应的目标时间戳;将与所述目标时间戳所对应的目标标记在所述时间轴上进行区别显示。
根据本公开的一个或多个实施例,【示例十四】提供了一种搜索目标内容的装置,包括:
搜索内容获取模块,设置为获取搜索内容编辑控件中的待处理搜索内容;目标搜索内容确定模块,设置为确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,将所述至少一个关联待搜索内容以及所述待处理搜索内容均作为目标搜索内容;目标内容匹配模块,设置为从字幕信息中搜索与每个目标搜索内容相匹配的目标内容。
此外,虽然采用特定次序描绘了多个操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了多个实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的一些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的多种特征也可以单独地或以任何合适的子组合的方式实现在多个 实施例中。

Claims (16)

  1. 一种搜索目标内容的方法,包括:
    获取搜索内容编辑控件中的待处理搜索内容;
    确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,将所述至少一个关联待搜索内容以及所述待处理搜索内容均作为目标搜索内容;
    从字幕信息中搜索与每个目标搜索内容相匹配的目标内容。
  2. 根据权利要求1所述的方法,其中,所述获取搜索内容编辑控件中的待处理搜索内容,包括:
    在检测到启动搜索的控件被触发的情况下,获取所述搜索内容编辑控件中编辑的待处理搜索内容;或,
    在检测到搜索内容编辑控件被触发的情况下,获取所述搜索内容编辑控件中编辑的待处理搜索内容。
  3. 根据权利要求1所述的方法,其中,所述确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,包括:
    确定所述待处理搜索内容的内容类型;
    根据所述内容类型以及与所述内容类型对应的待转换内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容。
  4. 根据权利要求3所述的方法,其中,所述内容类型包括数字类型,所述根据所述内容类型以及与所述内容类型对应的待转换内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,包括:
    获取所述待处理搜索内容中与所述数字类型对应的待转换内容;
    确定与所述待转换内容相对应的至少一种转换形式,并基于所述至少一种转换形式确定与所述待转换内容相对应的待替换内容;
    基于所述待替换内容与所述待处理搜索内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容。
  5. 根据权利要求3所述的方法,其中,所述内容类型包括预设语种类型,所述根据所述内容类型以及与所述内容类型对应的待转换内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,包括:
    获取所述待处理搜索内容中与所述预设语种类型对应的待转换内容;
    确定与所述待转换内容相对应的待替换内容,其中,所述待替换内容包括与所述待转换内容对应的词根和延伸词中的至少之一;
    基于所述待替换内容以及所述待处理搜索内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容。
  6. 根据权利要求4或5所述的方法,其中,所述基于所述待替换内容以及所述待处理搜索内容,确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,包括:
    将所述待处理搜索内容中的待转换内容替换为所述待替换内容,得到与所述待处理搜索内容相对应的至少一个关联待搜索内容。
  7. 根据权利要求1所述的方法,其中,所述从字幕信息中搜索与每个目标搜索内容相匹配的目标内容,包括:
    从所述字幕信息中匹配与每个目标搜索内容相同的内容,并将匹配到的内容作为目标内容。
  8. 根据权利要求1所述的方法,还包括:
    将所述目标内容在所述字幕信息中区别显示。
  9. 根据权利要求1所述的方法,其中,在所述获取搜索内容编辑控件中的待处理搜索内容之前,还包括:
    基于多媒体数据流,确定语音信息;
    根据所述语音信息、与所述语音信息对应的原始语种类型以及目标翻译语种类型,生成显示在目标页面上与所述目标翻译语种类型相对应的字幕信息。
  10. 根据权利要求9所述的方法,其中,在得到所述字幕信息之后,还包括:
    建立所述字幕信息与所述多媒体数据流之间的时间戳同步关联关系,并将所述字幕信息和所述多媒体数据流显示在所述目标页面上,以在检测到一目标内容被触发的情况下,将所述多媒体数据流跳转到与所述一目标内容所对应的视频播放时刻。
  11. 根据权利要求10所述的方法,其中,所述在检测到一目标内容被触发的情况下,将所述多媒体数据流跳转到与所述一目标内容所对应的视频播放时刻,包括:
    在检测到一目标内容被触发的情况下,确定所述一目标内容的当前时间戳;
    基于预先建立的所述时间戳的同步关联关系以及所述当前时间戳,将所述多媒体数据流跳转至所述当前时间戳对应的视频播放时刻。
  12. 根据权利要求1所述的方法,还包括:
    确定每个目标内容在多媒体数据流的时间轴中的时间戳,并在所述时间轴上与所述时间戳对应的位置进行标记。
  13. 根据权利要求12所述的方法,还包括:
    在检测到一目标内容被触发的情况下,确定与所述一目标内容相对应的目标时间戳;
    将与所述目标时间戳所对应的目标标记在所述时间轴上进行区别显示。
  14. 一种搜索目标内容的装置,包括:
    搜索内容获取模块,设置为获取搜索内容编辑控件中的待处理搜索内容;
    目标搜索内容确定模块,设置为确定与所述待处理搜索内容相对应的至少一个关联待搜索内容,将所述至少一个关联待搜索内容以及所述待处理搜索内容均作为目标搜索内容;
    目标内容匹配模块,设置为从字幕信息中搜索与每个目标搜索内容相匹配的目标内容。
  15. 一种电子设备,包括:
    至少一个处理器;
    存储装置,设置为存储至少一个程序;
    当所述至少一个程序被所述至少一个处理器执行,使得所述至少一个处理器实现如权利要求1-13中任一项所述的搜索目标内容的方法。
  16. 一种包含计算机可执行指令的存储介质,所述计算机可执行指令在由计算机处理器执行时用于执行如权利要求1-13中任一项所述的搜索目标内容的方法。
PCT/CN2021/115261 2020-09-29 2021-08-30 搜索目标内容的方法、装置、电子设备及存储介质 WO2022068494A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2023507572A JP2023536330A (ja) 2020-09-29 2021-08-30 ターゲットコンテンツの検索方法、装置、電子機器及び記憶媒体

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011052041.7 2020-09-29
CN202011052041.7A CN112163103A (zh) 2020-09-29 2020-09-29 搜索目标内容的方法、装置、电子设备及存储介质

Publications (1)

Publication Number Publication Date
WO2022068494A1 true WO2022068494A1 (zh) 2022-04-07

Family

ID=73861517

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/115261 WO2022068494A1 (zh) 2020-09-29 2021-08-30 搜索目标内容的方法、装置、电子设备及存储介质

Country Status (3)

Country Link
JP (1) JP2023536330A (zh)
CN (1) CN112163103A (zh)
WO (1) WO2022068494A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112163103A (zh) * 2020-09-29 2021-01-01 北京字跳网络技术有限公司 搜索目标内容的方法、装置、电子设备及存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201421609Y (zh) * 2009-02-23 2010-03-10 未序网络科技(上海)有限公司 基于文字异形体信息的搜索引擎系统
US20140147816A1 (en) * 2012-11-26 2014-05-29 ISSLA Enterprises, LLC Intralingual supertitling in language acquisition
CN109246472A (zh) * 2018-08-01 2019-01-18 平安科技(深圳)有限公司 视频播放方法、装置、终端设备及存储介质
CN110753269A (zh) * 2018-07-24 2020-02-04 Tcl集团股份有限公司 视频摘要生成方法、智能终端及存储介质
CN112163103A (zh) * 2020-09-29 2021-01-01 北京字跳网络技术有限公司 搜索目标内容的方法、装置、电子设备及存储介质

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103838751A (zh) * 2012-11-23 2014-06-04 鸿富锦精密工业(深圳)有限公司 视频内容搜索系统及方法
US10331724B2 (en) * 2012-12-19 2019-06-25 Oath Inc. Method and system for storytelling on a computing device via multiple sources
CN107071554B (zh) * 2017-01-16 2019-02-26 腾讯科技(深圳)有限公司 语义识别方法和装置
CN107992545A (zh) * 2017-11-27 2018-05-04 珠海市魅族科技有限公司 一种搜索方法、装置、终端及可读存储介质
CN109033256A (zh) * 2018-07-06 2018-12-18 北京微播视界科技有限公司 一种搜索方法、装置、终端设备及存储介质
CN110225387A (zh) * 2019-05-20 2019-09-10 北京奇艺世纪科技有限公司 一种信息搜索方法、装置及电子设备

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201421609Y (zh) * 2009-02-23 2010-03-10 未序网络科技(上海)有限公司 基于文字异形体信息的搜索引擎系统
US20140147816A1 (en) * 2012-11-26 2014-05-29 ISSLA Enterprises, LLC Intralingual supertitling in language acquisition
CN110753269A (zh) * 2018-07-24 2020-02-04 Tcl集团股份有限公司 视频摘要生成方法、智能终端及存储介质
CN109246472A (zh) * 2018-08-01 2019-01-18 平安科技(深圳)有限公司 视频播放方法、装置、终端设备及存储介质
CN112163103A (zh) * 2020-09-29 2021-01-01 北京字跳网络技术有限公司 搜索目标内容的方法、装置、电子设备及存储介质

Also Published As

Publication number Publication date
CN112163103A (zh) 2021-01-01
JP2023536330A (ja) 2023-08-24

Similar Documents

Publication Publication Date Title
US11917344B2 (en) Interactive information processing method, device and medium
WO2022042593A1 (zh) 字幕编辑方法、装置和电子设备
WO2022242351A1 (zh) 一种多媒体处理方法、装置、设备及介质
JP7551773B2 (ja) インタラクション記録生成方法、装置、デバイス及び媒体
WO2022105760A1 (zh) 一种多媒体浏览方法、装置、设备及介质
WO2022105710A1 (zh) 一种会议纪要的交互方法、装置、设备及介质
CN112163102B (zh) 搜索内容匹配方法、装置、电子设备及存储介质
WO2023083142A1 (zh) 分句方法、装置、存储介质及电子设备
WO2022105709A1 (zh) 多媒体的交互方法、信息交互方法、装置、设备及介质
WO2021259221A1 (zh) 视频翻译方法和装置、存储介质和电子设备
US20230139416A1 (en) Search content matching method, and electronic device and storage medium
WO2023142913A1 (zh) 视频处理方法、装置、可读介质及电子设备
WO2022160603A1 (zh) 歌曲的推荐方法、装置、电子设备及存储介质
CN112380365A (zh) 一种多媒体的字幕交互方法、装置、设备及介质
WO2022068494A1 (zh) 搜索目标内容的方法、装置、电子设备及存储介质
CN112163433B (zh) 关键词汇的匹配方法、装置、电子设备及存储介质
WO2022068496A1 (zh) 搜索目标内容的方法、装置、电子设备及存储介质
WO2022093111A1 (zh) 基于用户交互的音乐播放方法、装置、设备及存储介质
US20240103802A1 (en) Method, apparatus, device and medium for multimedia processing
JP6506427B1 (ja) 情報処理装置、動画検索方法、生成方法及びプログラム
CN113132789B (zh) 一种多媒体的交互方法、装置、设备及介质
US20230140442A1 (en) Method for searching target content, and electronic device and storage medium
US20230135783A1 (en) Target content search method, electronic device and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21874155

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2023507572

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21874155

Country of ref document: EP

Kind code of ref document: A1