WO2022068496A1 - 搜索目标内容的方法、装置、电子设备及存储介质 - Google Patents

搜索目标内容的方法、装置、电子设备及存储介质 Download PDF

Info

Publication number
WO2022068496A1
WO2022068496A1 PCT/CN2021/115283 CN2021115283W WO2022068496A1 WO 2022068496 A1 WO2022068496 A1 WO 2022068496A1 CN 2021115283 W CN2021115283 W CN 2021115283W WO 2022068496 A1 WO2022068496 A1 WO 2022068496A1
Authority
WO
WIPO (PCT)
Prior art keywords
content
target
search
processed
identifier
Prior art date
Application number
PCT/CN2021/115283
Other languages
English (en)
French (fr)
Inventor
陈可蓉
钱程
熊梦园
杨晶生
Original Assignee
北京字跳网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京字跳网络技术有限公司 filed Critical 北京字跳网络技术有限公司
Priority to EP21874157.7A priority Critical patent/EP4206953A4/en
Priority to JP2023507867A priority patent/JP2023536992A/ja
Publication of WO2022068496A1 publication Critical patent/WO2022068496A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/483Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/489Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using time information
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/685Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Definitions

  • the present disclosure relates to the field of computer technology, for example, to a method, apparatus, electronic device, and storage medium for searching for target content.
  • the target content searched in the above manner is exactly the same as the search conditions, and the associated content associated with the search conditions cannot be obtained, resulting in incomplete target content and poor user experience.
  • the present disclosure provides a method, an apparatus, an electronic device and a storage medium for searching target content, so as to optimize search conditions, so as to search for corresponding content from text based on the optimized search conditions, and improve the comprehensiveness and richness of target content sex.
  • the present disclosure provides a method for searching for target content, the method comprising:
  • target search content corresponding to the to-be-processed search content according to the target search identifier, the target search strategy and the to-be-processed search content
  • the target content corresponding to the target search content is searched out from the text information.
  • the present disclosure also provides an apparatus for searching for target content, the apparatus comprising:
  • a target search strategy determination module configured to determine a target search identifier in the search content to be processed, and to determine a target search strategy corresponding to the target search identifier;
  • a target search content determination module configured to generate target search content corresponding to the to-be-processed search content according to the target search identifier, the target search strategy and the to-be-processed search content;
  • the target content determination module is configured to search out the target content corresponding to the target search content from the text information.
  • the present disclosure also provides an electronic device, the electronic device comprising:
  • processors one or more processors
  • storage means arranged to store one or more programs
  • the one or more processors When the one or more programs are executed by the one or more processors, the one or more processors implement the above-mentioned method for searching for target content.
  • the present disclosure also provides a storage medium containing computer-executable instructions, which, when executed by a computer processor, are used to perform the above-described method of searching for target content.
  • FIG. 1 is a schematic flowchart of a method for searching for target content according to Embodiment 1 of the present disclosure
  • FIG. 2 is a schematic flowchart of a method for searching for target content according to Embodiment 2 of the present disclosure
  • FIG. 3 is a schematic diagram of corresponding display of target content and a marker on a time axis according to Embodiment 2 of the present disclosure
  • FIG. 4 is a schematic diagram of highlighting a corresponding mark on a time axis after triggering target content according to Embodiment 2 of the present disclosure
  • FIG. 5 is a schematic structural diagram of an apparatus for searching for target content according to Embodiment 3 of the present disclosure
  • FIG. 6 is a schematic structural diagram of an electronic device according to Embodiment 4 of the present disclosure.
  • method embodiments of the present disclosure may be performed in different orders and/or in parallel. Furthermore, method embodiments may include additional steps and/or omit performing the illustrated steps. The scope of the present disclosure is not limited in this regard.
  • the term “including” and variations thereof are open-ended inclusions, ie, "including but not limited to”.
  • the term “based on” is “based at least in part on.”
  • the term “one embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one additional embodiment”; the term “some embodiments” means “at least some embodiments”. Relevant definitions of other terms will be given in the description below.
  • Embodiment 1 is a schematic flowchart of a method for searching target content according to Embodiment 1 of the present disclosure.
  • the embodiment of the present disclosure is suitable for determining target search content corresponding to the content to be processed according to a search identifier in the search content to be processed, and then from
  • the method may be performed by a device for searching the target content, and the device may be implemented in the form of software and/or hardware.
  • the method of this embodiment includes:
  • the pending search content is the content obtained from the target page. Include search content edit controls on the target page. In the search content editing control, the user can edit the corresponding content, and the edited content is regarded as the search content to be processed.
  • the target page may further include text information generated based on speech information of different language types or the same language type.
  • the to-be-processed search content includes various search identifiers, which are optional, "," and so on.
  • the search identifier can be preset, and when it is detected that the search content to be processed includes a preset search identifier, it is determined that the search content to be processed includes the search identifier, and the search identifier included in the search content to be processed can be used as the target search identifier. .
  • the to-be-processed search content When the to-be-processed search content is acquired, it may be determined whether the to-be-processed search content includes the search identifier according to the preset search identifier. If the to-be-processed search content includes the search identifier, the search identifier in the to-be-processed search content may be used as the search identifier. Target search identifier. After the target search identification is determined, a search strategy corresponding to the target search identification may be determined, so as to determine the target search content corresponding to the search content to be processed based on the search strategy.
  • acquiring the pending search content edited in the search content editing control includes: if it is detected that the control that initiates the search is triggered, acquiring the target search edited in the search content editing control content.
  • the target page may include a search content editing control and a search-initiating control.
  • the search-initiating control may be a "confirm search" control. The user can edit the corresponding content in the search content editing control. After the content editing is completed, the user can trigger the control to start the search, that is, click the "confirm search" control, and the server can obtain the pending search content in the search content editing control.
  • the to-be-processed search content edited in the search content editing control is acquired.
  • the target page includes a search content editing control: when it is detected that the user triggers the search content editing control, start to obtain the search content edited in the search content editing control, within a preset duration, optionally, within 30 seconds, If it is not detected that the user edits the new search content, the acquired search content is used as the search content to be processed.
  • a search content editing control when it is detected that the user triggers the search content editing control, start to obtain the search content edited in the search content editing control, within a preset duration, optionally, within 30 seconds, If it is not detected that the user edits the new search content, the acquired search content is used as the search content to be processed.
  • S120 Generate target search content corresponding to the to-be-processed search content according to the target search identifier, the target search strategy, and the to-be-processed search content.
  • the target search strategy corresponds to the target search identifier, and the target search content corresponding to the to-be-processed search content can be determined according to the corresponding target search strategy. That is, the target search content is the search content obtained after processing the to-be-processed search content according to the target search policy. In this embodiment, the target search content may be one, two or more, and the number thereof is related to the target search strategy.
  • the to-be-processed search content includes multiple target search identifiers
  • a target search strategy corresponding to each target search identifier can be determined, and then the to-be-processed search content is processed based on the target search strategy to obtain a target search strategy corresponding to the to-be-processed search content.
  • Target search content is processed based on the target search strategy to obtain a target search strategy corresponding to the to-be-processed search content.
  • the text information can be any text.
  • the text information is a document pre-obtained by an acquisition method.
  • the text information may also be text determined based on the audio information of the multimedia data stream.
  • Multimedia data streams may be generated based on real-time interactive scenarios (eg, multimedia conferences, live broadcasts, video chats), or based on recordings.
  • the real-time interactive scene can be realized by means of the Internet and computers, for example, an interactive application program realized by a native program or a World Wide Web (web) program.
  • the audio information of the multimedia data stream can be collected, and the audio information can be converted into corresponding text and displayed on the target page.
  • the text displayed on the target page can be used as text information.
  • the target content is the same content as the target search content.
  • the same content as the target search content can be filtered out from the text information, and this content is used as the target content.
  • the target search content is associated content associated with the to-be-processed search content
  • content associated with the to-be-processed search content can be screened out from the text information based on the target search content, thereby improving the determined Comprehensiveness of target content.
  • the to-be-processed search content is optimized based on the target search strategy, that is, optimizing the search conditions, based on The optimized search conditions filter out the corresponding target content from the text information, which improves the comprehensiveness of the determined target content.
  • the target content after filtering out the target content that is the same as the target search content from the text information, in order to facilitate the user to determine the position of the target content in the text, the target content can be displayed differently in the text information.
  • the target content itself is also one or more elements in the textual information. When displayed, it can be displayed differently from other elements, thereby highlighting the filtered target content, allowing users to discover the target content more intuitively and conveniently. Differential display may be displayed in a display format such as color, font, and background pattern.
  • generating text information based on a multimedia data stream may be: collecting voice information based on the multimedia data stream; Text information corresponding to the target translation language type on the page.
  • the number of users participating in the real-time interaction or the users participating in the speech in the screen recording video may be multiple, and the language type used by each speaking user may be the same or different.
  • the language type is quite different from the language used by the user, the user may not be able to know the speech information of other speakers.
  • the text information generated based on the original language type can be converted into the language type expected by the user (ie The text information corresponding to the target translation language type).
  • the original language type can be understood as the language used by the users participating in the voice interaction in the multimedia data stream.
  • the target translation language type can be understood as the language type expected by the user for displaying text information.
  • the text information corresponding to the target translation language type may be translation data corresponding to the voice information presented in the target translation language type.
  • the text information may display the speaking user identity and the speaking time stamp of each piece of translation data.
  • the voice data that is, voice information, of a plurality of users participating in the interaction can be collected from the multimedia data stream corresponding to the interactive behavior interface, and the original language type corresponding to the voice information can be recorded.
  • the voice information can be translated from the original language type to the target translation language type to obtain translation data corresponding to the voice information.
  • the translation data, the speaking user ID corresponding to the translation data, and the speaking time stamp are used as text information displayed on the target page.
  • a language type selection control can be set on the target page. In this way, the user can select the language type for translation by triggering the language type selection control on the target page, so as to translate the text information in the original language type generated from the speech information of the speaking user into the text information corresponding to the selected translation language type .
  • the target translation language type may also be determined by at least one of the following methods: acquiring the language type preset on the target client as the target translation language type; acquiring the The login address of the target client, and the target translation language type corresponding to the geographic location of the target client is determined based on the login address.
  • the first way may be: use the default language type in the device to which the speaking user belongs as the target translation language type; or, the user may preset the language type to be converted, so that the language type preset by the user is used as the target translation language type.
  • the user may preset the target translation language type.
  • a language type selection list may pop up on the target page for the user to select. The user can select any language type. For example, if the user triggers the Chinese language type in the language type selection list and clicks the confirmation button, the server or client can determine that the target translation language type is the Chinese language type. That is to say, the voice information in the multimedia data stream can be converted into Chinese text information and displayed on the target interface.
  • the second method can be: the login address of the client can be obtained, that is, the Internet Protocol (IP) address of the client, and the region to which the client belongs can be determined according to the login address, and then the language type used in the region can be determined.
  • the language type used is the target translation language type.
  • the text information is more in line with the user's reading habits, so that the user can quickly understand the content corresponding to the multimedia data stream, and further Improve the efficiency of interaction.
  • the method further includes: establishing a time stamp synchronization relationship between the text information and the multimedia data stream, and displaying the text information and the multimedia data stream on the On the target page, when it is detected that the target content is triggered, the multimedia data stream is jumped to the video playing time corresponding to the target content.
  • the time stamp synchronization relationship can be understood as the linkage between multimedia data stream and text information based on time synchronization.
  • the current timestamp corresponding to the piece of content can be determined, and based on the pre-established timestamp synchronization relationship, jump to the multimedia data stream corresponding to the current timestamp. For example, you can Jumps the multimedia data stream to the playback position corresponding to the current timestamp.
  • the time stamp synchronization relationship between the multimedia data stream and the text information is established, when dragging the progress bar of the screen recording video to a position, the current time stamp corresponding to the position can be obtained, based on the pre-established time
  • the content corresponding to the voice content of the playback position in the text information can be determined.
  • the text content may be differentiated and displayed on the target page based on the time stamp synchronization relationship, and optionally, highlighted.
  • the current time stamp corresponding to the text content can also be obtained, and the corresponding audio and video frames can be obtained and displayed based on the time stamp synchronization relationship.
  • the current time stamp of the target content is determined; based on the pre-established synchronization relationship of the time stamps and the current time stamp, the multimedia data stream is jumped to the current time Stamp the corresponding video frame.
  • the current timestamp corresponding to the sentence to which the target content belongs can be obtained.
  • the multimedia data stream is jumped to the playback position corresponding to the current time stamp, so that the user can understand the tone and state of the speaking user when he publishes the voice information including the target content, thereby improving interaction. s efficiency.
  • the method further includes: displaying the identifier of the target content on the time axis corresponding to the multimedia data stream.
  • the timestamp of the sentence to which each target content belongs can be determined, and the timestamp can be used as the current timestamp corresponding to the target content.
  • each current timestamp corresponding to the target content is displayed on the time axis, so that the user can quickly determine the relative position of each target content in the multimedia data stream.
  • the technical solution of the present embodiment realizes the synchronous linkage between the text information and the multimedia data stream by establishing the time stamp synchronization relationship between the multimedia data stream and the text information, so that it is convenient for the user to quickly find the corresponding text information in the multimedia data stream. location, so that it is convenient to understand the voice information of the speaking user in combination with the context before and after, and the efficiency of information interaction is improved.
  • FIG. 2 is a schematic flowchart of a method for searching for target content according to Embodiment 2 of the present disclosure.
  • the to-be-combined search strategies corresponding to different target search identifiers can be determined, the target search strategies can be determined based on the to-be-combined search strategies, and then the target search strategies can be determined based on the target search strategies to be combined.
  • the search strategy determines the target search content corresponding to the search content to be processed.
  • the method includes:
  • S210 Acquire at least one target search identifier in the to-be-processed search content.
  • the to-be-processed search content may include one or more target search identifiers.
  • the target search identifier can be a preset special symbol. Special symbols can be ",, -" and other symbols.
  • a search identification library can be preset, and the search identification library includes multiple search identifications.
  • a corresponding relationship between the search identification and the search strategy can be established, so that the When it is detected that there is a search identifier, a search strategy corresponding to the search identifier is retrieved according to the corresponding relationship.
  • acquiring multiple target search identifiers in the search content to be processed may be: determining search identifiers included in the search content to be processed according to a preset search identifier library.
  • multiple symbols in the search content to be processed are acquired, a target symbol is determined from each symbol according to a preset search identification library, and the target symbol is used as the target search identification.
  • the target search identifiers included in the search content to be processed may be one, two or more, and the target search identifiers may be the same or different. If the target search identifiers are the same, the corresponding search strategies are also the same.
  • S220 Determine a search strategy to be combined corresponding to each target search identifier.
  • a search strategy corresponding to each target search identifier may be determined according to the correspondence table, and the determined search strategy may be used as the search strategy to be combined. That is to say, the search strategy corresponding to each target search identifier can be used as the search strategy to be combined.
  • the search strategies to be combined corresponding to the targets are also the same; if there are different target search identifiers in the multiple target search identifiers, the corresponding search strategies to be combined may be the same or different.
  • determining the search strategy to be combined corresponding to each target search identifier may be: establishing a correspondence between the search identifier and the search strategy in advance. After the target search identifier is acquired, the search policy to be combined corresponding to the target search identifier can be determined according to the correspondence between the search identifier and the search policy. If there is one target search identifier, there may be one strategy to be combined; if there are multiple target search identifiers, there may be multiple search strategies to be combined.
  • the target search strategy is formed based on each search strategy to be combined.
  • the target search strategy is one; if there are multiple search strategies to be combined, the target search strategy includes multiple search strategies to be combined.
  • S240 Determine the content to be converted in the search content to be processed according to the at least one target search identifier, process the content to be converted according to the target search strategy, and determine the content to be replaced corresponding to the content to be converted .
  • the content to be converted is the content determined according to the target search identifier.
  • the content to be replaced is the content obtained after processing the content to be converted according to the target search strategy.
  • each target search identifier may be processed to determine a search strategy and content to be converted corresponding to each target search identifier. to determine the content to be replaced corresponding to the content to be converted.
  • the target search strategy includes at least one of the following: a search strategy for combining and searching some strings in the search content to be processed; determining the associated content of some strings in the search content to be processed, and based on the association The search strategy for searching content; the search strategy for splitting and searching some strings in the content to be searched.
  • the search strategy to be combined may be any of the above search strategies.
  • the target search strategy is one or more of the above-mentioned search strategies to be combined.
  • the partial character string is determined based on the target search identifier in the search content to be processed.
  • the combined search strategy can be understood as a search strategy in which the string and the target search identifier are searched as a whole after the character string is determined according to the target search identifier.
  • the search strategy for searching based on the associated content may be a strategy of determining the content associated with the character string after determining the character string according to the target search identifier, and performing the search based on the determined associated content.
  • the search strategy of split search refers to using each character string as the content to be replaced after the character string is determined according to the target search identifier.
  • the corresponding target search strategy and the content to be converted can be determined according to the target search identifier, and then the content to be replaced is generated based on the content to be converted, so as to generate the target search content based on the content to be replaced and the search content to be processed.
  • Different search identifiers may be classified into at least three categories, namely first target search identifiers, second target search identifiers and third target search identifiers, and search strategies corresponding to different search identifiers may also be classified into at least three types. If the to-be-processed search content includes multiple types of search identifiers, a search strategy corresponding to each type of search identifier can be determined separately, and then the to-be-replaced content is determined, and target search content is generated according to the to-be-replaced content.
  • the content to be converted and the content to be replaced in the search content to be processed may be determined by referring to at least one of the following manners.
  • the content to be converted and the content to be replaced in the search content to be processed may be determined by referring to at least one of the following manners.
  • the first way may be, if the target search identifier includes a first target search identifier, determine a character string adjacent to the first target search identifier according to the first target search identifier, and generate a character string based on the character string.
  • the to-be-converted content; the to-be-replaced content is generated based on the to-be-converted content and the first target search identifier.
  • the first target search identifier can be an abbreviated identifier.
  • the target identifier at this time can be "'", and "'" can be used as the first target search. logo.
  • the character string adjacent to the target identifier can be determined, and the adjacent character string can be used as the content to be converted. For example, if the target identifier is "'”, and the string adjacent to the target identifier is "you ll", "you ll” can be used as the content to be converted.
  • the content to be replaced is generated according to the content to be converted and the first target search identifier.
  • the target search strategy corresponding to the first target search identifier is a search strategy for combining and searching some character strings in the search content to be processed
  • the content to be converted and the first target identifier can be combined together as the content to be replaced , so the resulting content to replace can be "you'll".
  • the content to be converted in the search content to be processed can be determined, and the content to be replaced corresponding to the content to be converted can be obtained.
  • the second way may be, if the target search identifier includes a second target search identifier, obtain a target character string that is adjacent to the second target search identifier before the second target search identifier in the to-be-processed search content. , and the target character string is used as the content to be replaced; at least one associated word corresponding to the target character string is determined, and the content to be replaced is determined based on the at least one associated word.
  • the target search strategy corresponding to the second target search identifier may be: a search strategy for determining the associated content of some character strings in the search content to be processed, and performing a search based on the associated content.
  • the second target search identifier is preset. If it is detected that the second target search identifier exists, a character string before the second target search identifier and adjacent to the second target search identifier can be obtained as the target character string. This target string can be used as the content to be converted. Since the search strategy corresponding to the second target search identifier is the search strategy for obtaining the associated content of part of the character string, the associated content may be the content obtained by performing at least one form of transformation on the target character string. Strings change tense, singular and plural, etc. The vocabulary associated with the target string can be used as the content to be replaced.
  • the character string that is before the second target search identifier and adjacent to the second target search identifier in the to-be-processed search content may be obtained as the target character string, and the target character string as the content to be converted. It can detect whether the target string has associated content.
  • the associated content can be the vocabulary corresponding to the target string in different tenses, the vocabulary in singular and plural forms, etc. If the target string has associated content, it is determined to be converted. related content of the content, and use the determined related content as the content to be replaced.
  • the associated word that is, the associated content includes at least one of a derivative of the target character string, character strings in various tenses corresponding to the target character string, and a singular form and a plural form of the target character string .
  • a derivative word can be a vocabulary corresponding to the target string determined by taking the target string as a root.
  • the tense includes present tense, future tense and past tense, which can determine the vocabulary corresponding to the target string in different tenses. It can also be determined whether the target string has singular or plural forms. If the target string has singular and plural forms, the singular and plural forms of the target string are also associated words.
  • the corresponding related word can be obtained according to the above principles, and the related word can be used as the content to be replaced.
  • the third way may be, if the target search identifier includes a third target search identifier, determine a character string adjacent to the third target search identifier according to the third target search identifier, and based on the character string Generate the content to be converted; take each character string in the content to be converted as the content to be replaced.
  • the target search strategy corresponding to the third target search identifier may be a search strategy for splitting and searching for partial strings in the search content to be processed.
  • the partial character string is determined based on the third target search identification.
  • the third target search identifier is an identifier for splitting and searching a part of the character string.
  • the third identifier may be ",, -" and the like. If it is detected that the search content to be processed includes the third target search identifier, a character string adjacent to the third target search identifier is determined. Since the search strategy corresponding to the third target search identifier is a string splitting search strategy, the strings determined at this time can be respectively used as the content to be converted.
  • the target search identifier is determined to be "-".
  • the target search strategy corresponding to "-" is a partial string split search strategy, which can obtain strings adjacent to the target search identifier.
  • the obtained adjacent strings are "A", “B” ", that is, the strings “A” and “B” are the content to be converted.
  • “A” and “B” in the content to be converted can be respectively used as the content to be replaced.
  • the to-be-processed search content includes a target search identifier
  • at least one to-be-replaced content corresponding to the target search identifier can be determined, and both the to-be-replaced content and the to-be-processed search content can be used as a search condition in the target search content.
  • the search content to be processed includes at least two target search identifiers
  • the content to be converted corresponding to each target search identifier and at least one content to be replaced corresponding to each content to be converted can be determined according to the corresponding search strategy. .
  • Each content to be converted is respectively replaced with the content to be replaced to obtain the target search content.
  • the to-be-converted content in the to-be-processed search content is replaced with the to-be-replaced content to obtain at least one target sub-search content; and the target search content is generated based on the at least one target sub-search content.
  • the content to be replaced can be replaced at a corresponding position in the search content to be processed.
  • the content to be replaced can be replaced by the corresponding content to be converted.
  • the to-be-processed search content obtained after replacement is used as a target sub-search content in the target search content. That is to say, the target search content may include multiple target sub-search contents, and the target sub-search contents are obtained by replacing the content to be replaced with the content to be converted in the search content to be processed.
  • the number of target sub-search contents corresponds to the number of contents to be replaced.
  • the content to be replaced corresponding to each content to be converted can be acquired, and the content to be converted can be replaced with the corresponding content to be replaced to obtain at least one target sub-search content in the target search content.
  • the set of at least one target sub-search content is the target search content.
  • the exact same content as the target search content can be obtained from the text information and used as the target content.
  • the technical solution of the embodiments of the present disclosure is to obtain the target search by acquiring the target search identifier in the search content to be processed, and determining the target search strategy corresponding to the target search identifier, and then improving and enriching the to-be-processed search content based on the target search strategy.
  • the comprehensiveness and richness of the target content are improved.
  • the method further includes: determining a time stamp of each target content in a time axis corresponding to the multimedia data stream, and marking a position on the time axis corresponding to the time stamp .
  • the time axis is the time axis corresponding to the multimedia data stream.
  • the total duration corresponding to the multimedia data stream is 50 minutes, and the time axis corresponding to the multimedia data stream is also 50 minutes.
  • the timestamp corresponding to the target content can be determined according to the sentence to which the target content belongs.
  • the position of the time stamp on the time axis can be determined and marked at the position, for example, it can be marked with a circle or a triangle below the position of the time axis, see FIG. 3 .
  • marking the position corresponding to the time stamp on the time axis includes: in a plurality of control controls corresponding to all time stamps in the time axis one-to-one, determining the The position of the control control corresponding to the time stamp, and mark the position; wherein, the control control is used to adjust the audio and video frames of the multimedia data stream.
  • the position of the control control is the position of the dot mark.
  • the search content edited by the user in the search content editing control is "algorithm", and the target content that is the same as the “algorithm” can be searched from the text information and displayed differently, such as highlighted, and the target content can be determined.
  • the timestamp of the sentence to which the content belongs and is marked on the time axis corresponding to the multimedia data stream based on the timestamp, for example, marked with a dot.
  • the user can set the color and size of the mark according to actual needs, which is not limited here.
  • the number of target content can be displayed, for example, the total number displayed in the search content editing control is 12, see FIG. 3 .
  • the advantage of marking the audio and video frames corresponding to the target content on the time axis is that the user can clearly determine the position of the target content in the multimedia data stream according to the mark on the time axis, thereby improving the search efficiency. Convenience of the corresponding target content.
  • the number of target contents may be more than one, and correspondingly, the number of markers on the time axis may also be more than one. Referring to FIG. 3 , the number of target contents is 12, and the number of markers on the time axis is also 12.
  • the search content editing control In order to facilitate the user to determine the number of the currently triggered target content among all the target contents, the search content editing control also displays the sequence corresponding to the currently triggered target content.
  • the method further includes: when detecting that a target content is triggered, determining a target time stamp corresponding to the target content; distinguishing and displaying the marks corresponding to the target time stamp .
  • the user can trigger any target content.
  • the timestamp (target timestamp) corresponding to the user-triggered target content can be determined, and the target mark corresponding to the target timestamp on the time axis can be determined.
  • the target time stamp corresponding to the target content corresponding to mark 1 can be determined, and the corresponding mark on the time axis can be determined according to the target time stamp as The mark corresponding to mark 2 can be highlighted.
  • the advantage of displaying the marks corresponding to the target content differently on the time axis is that the user can know the position of the triggered target content in the multimedia data stream, which improves the user experience.
  • the accuracy of the audio and video frames corresponding to the determined target content is that the user can know the position of the triggered target content in the multimedia data stream, which improves the user experience.
  • FIG. 5 is a schematic structural diagram of an apparatus for searching for target content according to Embodiment 3 of the present disclosure. As shown in FIG. 5 , the apparatus includes: a target search strategy determination module 310 , a target search content determination module 320 and a target content determination module 330 .
  • the target search strategy determination module 310 is configured to determine the target search identifier in the search content to be processed, and to determine the target search strategy corresponding to the target search identifier; the target search content determination module 320 is configured to search according to the target identification, target search strategy and the to-be-processed search content, generate target search content corresponding to the to-be-processed search content; target content determination module 330, set to search for the target search content corresponding to the target search content from text information target content.
  • the to-be-processed search content is optimized based on the target search strategy, that is, optimizing the search conditions, based on The optimized search conditions filter out the corresponding target content from the text information, which improves the comprehensiveness and accuracy of the determined target content.
  • the target search strategy determination module 310 includes:
  • a target search identification determination unit configured to obtain at least one target search identification in the to-be-processed search content
  • a to-be-combined search strategy determination unit configured to determine a to-be-combined search strategy corresponding to each target search identification
  • the target search strategy determination The unit is configured to generate a target search strategy corresponding to the to-be-processed search content based on each to-be-combined search strategy.
  • the target search strategy includes at least one of the following: a search strategy for combining and searching partial strings in the search content to be processed; determining the associated content of the partial strings in the search content to be processed, and A search strategy for searching based on associated content; a search strategy for splitting a part of the string in the search content to be processed; wherein the part of the string is determined based on the target search identifier in the search content to be processed
  • the target search content determination module 320 is configured to determine the content to be converted in the search content to be processed according to each target search identifier, and perform the conversion process on the content to be converted according to the target search strategy. processing, determining content to be replaced corresponding to the content to be converted; generating the target search content based on the content to be replaced and the search content to be processed.
  • the target search strategy includes a search strategy for combining and searching some character strings in the search content to be processed;
  • the target search content determination module 320 includes: a first content determination unit to be converted, Set to if the at least one target search identifier includes a first target search identifier, determine a character string adjacent to the first target search identifier according to the first target search identifier, and generate the to-be-to-be based on the character string. Converting content; a first to-be-replaced content determination unit, configured to generate the to-be-replaced content based on the to-be-converted content and the first target search identifier.
  • the target search strategy includes a search strategy for determining the associated content of some character strings in the search content to be processed, and searching based on the associated content;
  • the target search content determination module 320 includes: a second The content-to-be-converted determination unit is configured to obtain the second target search identifier in the to-be-processed search content before and adjacent to the second target search identifier if the at least one target search identifier includes a second target search identifier.
  • a target character string, and the target character string is used as the content to be replaced;
  • a second content determination unit to be replaced is configured to determine at least one associated word corresponding to the content to be converted, and determine the content to be replaced based on the at least one associated word .
  • the at least one associated word includes derivatives of the target character string, character strings in multiple tenses corresponding to the target character string, singular forms of the target character string, and plural forms of the target character string. at least one of.
  • the target search strategy includes a search strategy for splitting and searching some character strings in the search content to be processed;
  • the target search strategy determination module 310 includes: a third content determination unit to be converted, Set to if the at least one target search identifier includes a third target search identifier, determine a character string adjacent to the third target search identifier according to the third target search identifier, and generate the described character string based on the character string.
  • content to be converted and a third content determination unit to be replaced, configured to use each character string in the content to be converted as the content to be replaced.
  • the target search content determination module 320 includes:
  • a target sub-search content determination unit configured to replace the to-be-converted content in the to-be-processed search content with the to-be-replaced content to obtain at least one target sub-search content
  • a target search content determination unit configured to be based on the at least one The target sub-search content is generated, and the target search content is generated.
  • the target content determination module 330 is configured to search for the same content as the target search content from the text information according to the target search content to obtain the target content.
  • the target content is displayed in the text information differently.
  • the device further includes:
  • the voice information determination module is configured to determine the voice information based on the multimedia data stream; the text information determination module is configured to generate and display on the target page according to the voice information, the original language type corresponding to the voice information and the target translation language type text information corresponding to the target translation language type.
  • the apparatus further includes: a display module configured to display the identifier of the target content on the time axis corresponding to the multimedia data stream.
  • the apparatus further includes: a time stamp synchronization association relationship determination module, configured to establish a time stamp synchronization association relationship between the text information and the multimedia data stream, and to store the text information and the multimedia data stream is displayed on the target page, so that when a target content is detected to be triggered, the multimedia data stream is jumped to a video frame corresponding to the target content.
  • a time stamp synchronization association relationship determination module configured to establish a time stamp synchronization association relationship between the text information and the multimedia data stream, and to store the text information and the multimedia data stream is displayed on the target page, so that when a target content is detected to be triggered, the multimedia data stream is jumped to a video frame corresponding to the target content.
  • the apparatus further includes: a current timestamp determination unit, configured to determine the current timestamp of the target content if it is detected that a target content is triggered; the video frame determination unit, configured to be based on The synchronization association relationship of the pre-established time stamps and the current time stamp are used to jump the multimedia data stream to the video frame corresponding to the current time stamp.
  • the apparatus for searching target content provided by the embodiment of the present disclosure can execute the method for searching target content provided by any embodiment of the present disclosure, and has functional modules and effects corresponding to the execution method.
  • FIG. 6 it shows a schematic structural diagram of an electronic device (eg, a terminal device or a server in FIG. 6 ) 400 suitable for implementing an embodiment of the present disclosure.
  • Terminal devices in the embodiments of the present disclosure may include, but are not limited to, such as mobile phones, notebook computers, digital broadcast receivers, personal digital assistants (Personal Digital Assistants, PDAs), tablet computers (PADs), and portable multimedia players (Portable Media Players). , PMP), in-vehicle terminals (eg, in-vehicle navigation terminals), etc., and stationary terminals such as digital (Television, TV), desktop computers, and the like.
  • PMP Personal Digital Assistants
  • PDAs Personal Digital Assistants
  • PADs tablet computers
  • PMP portable multimedia players
  • in-vehicle terminals eg, in-vehicle navigation terminals
  • stationary terminals such as digital (Television, TV), desktop computers, and the like.
  • the electronic device shown in FIG. 6 is only an example, and
  • the electronic device 400 may include a processing device (such as a central processing unit, a graphics processor, etc.) 401, which may be stored in a read-only memory (Read-Only Memory, ROM) 402 according to a program or from a storage device 408 programs loaded into Random Access Memory (RAM) 403 to perform various appropriate actions and processes.
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • various programs and data necessary for the operation of the electronic device 400 are also stored.
  • the processing device 401 , the ROM 402 , and the RAM 403 are connected to each other through a bus 404 .
  • An Input/Output (I/O) interface 405 is also connected to the bus 404 .
  • the following devices can be connected to the I/O interface 405: input devices 406 including, for example, a touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a Liquid Crystal Display (LCD) output device 407 , a speaker, a vibrator, etc.; a storage device 408 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 409 .
  • Communication means 409 may allow electronic device 400 to communicate wirelessly or by wire with other devices to exchange data.
  • FIG. 6 shows the electronic device 400 with various means, it is not required to implement or have all of the shown means. More or fewer devices may alternatively be implemented or provided.
  • embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated in the flowchart.
  • the computer program may be downloaded and installed from the network via the communication device 409 , or from the storage device 408 , or from the ROM 402 .
  • the processing apparatus 401 executes the above-mentioned functions defined in the methods of the embodiments of the present disclosure.
  • the electronic device provided by the embodiment of the present disclosure and the method for searching target content provided by the above-mentioned embodiment belong to the same concept.
  • the technical details not described in detail in this embodiment please refer to the above-mentioned embodiment, and this embodiment has the same characteristics as the above-mentioned embodiment. Effect.
  • Embodiments of the present disclosure provide a computer storage medium on which a computer program is stored, and when the program is executed by a processor, implements the method for searching for target content provided by the foregoing embodiments.
  • the computer-readable medium described above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the two.
  • the computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above.
  • Examples of computer-readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, RAM, ROM, Erasable Programmable Read-Only Memory (EPROM) or flash memory), optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
  • the program code embodied on the computer-readable medium may be transmitted by any suitable medium, including but not limited to: electric wire, optical fiber cable, radio frequency (RF), etc., or any suitable combination of the above.
  • clients and servers can communicate using any currently known or future developed network protocols, such as HyperText Transfer Protocol (HTTP), and can communicate with digital data in any form or medium.
  • Communication eg, a communication network
  • Examples of communication networks include Local Area Networks (LANs), Wide Area Networks (WANs), the Internet (eg, the Internet), and peer-to-peer networks (eg, ad hoc peer-to-peer networks), as well as any currently Known or future developed networks.
  • LANs Local Area Networks
  • WANs Wide Area Networks
  • the Internet eg, the Internet
  • peer-to-peer networks eg, ad hoc peer-to-peer networks
  • the above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or may exist alone without being assembled into the electronic device.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device:
  • Computer program code for performing operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and This includes conventional procedural programming languages - such as the "C" language or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server.
  • the remote computer may be connected to the user computer through any kind of network, including a LAN or WAN, or may be connected to an external computer (eg, using an Internet service provider to connect through the Internet).
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions.
  • the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
  • the units involved in the embodiments of the present disclosure may be implemented in a software manner, and may also be implemented in a hardware manner.
  • the name of the unit/module does not constitute a limitation of the unit itself in one case, for example, the target search strategy determination module may also be described as a "search strategy determination module".
  • exemplary types of hardware logic components include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (Application Specific Standard Products) Standard Parts, ASSP), system on chip (System on Chip, SOC), complex programmable logic device (Complex Programmable Logic Device, CPLD) and so on.
  • FPGAs Field Programmable Gate Arrays
  • ASICs Application Specific Integrated Circuits
  • ASSP Application Specific Standard Products
  • SOC System on Chip
  • complex programmable logic device Complex Programmable Logic Device, CPLD
  • a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device.
  • the machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
  • Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing. Examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, RAM, ROM, EPROM or flash memory, optical fibers, CD-ROMs, optical storage devices, magnetic storage devices, or Any suitable combination of the above.
  • Example 1 provides a method for searching for target content, the method comprising:
  • Example 2 provides a method for searching for target content, further comprising:
  • the determining a target search identifier in the search content to be processed, and determining a target search strategy corresponding to the target search identifier includes:
  • Example 3 provides a method for searching for target content, further comprising:
  • the target search strategy includes at least one of the following:
  • a search strategy for combining and searching some character strings in the to-be-processed search content a search strategy for determining the associated content of some character strings in the to-be-processed search content, and searching based on the associated content;
  • a search strategy for splitting and searching for partial strings in the content wherein the partial strings are determined based on the target search identifier in the to-be-processed search content.
  • Example 4 provides a method for searching for target content, further comprising:
  • generating the target search content corresponding to the to-be-processed search content according to the target search identifier, the target search strategy and the to-be-processed search content includes:
  • each target search identifier determine the to-be-converted content in the to-be-processed search content, process the to-be-converted content according to the target search strategy, and determine the to-be-replaced content corresponding to the to-be-converted content;
  • the target search content is generated from the to-be-replaced content and the to-be-processed search content.
  • Example 5 provides a method for searching for target content, further comprising:
  • the target search strategy includes a search strategy for combining and searching some character strings in the to-be-processed search content; the to-be-converted content in the to-be-processed search content is determined according to each target search identifier. , process the content to be converted according to the target search strategy, and determine the content to be replaced corresponding to the content to be converted, including:
  • the at least one target search identifier includes a first target search identifier, determine a character string adjacent to the first target search identifier according to the first target search identifier, and generate the content to be converted; based on the content to be converted and the first target search identifier, the content to be replaced is generated.
  • Example 6 provides a method for searching for target content, further comprising:
  • the target search strategy includes a search strategy for determining the associated content of some character strings in the to-be-processed search content, and performing a search based on the associated content; the to-be-processed search is determined according to each target search identifier.
  • the content to be converted is processed according to the target search strategy, and the content to be replaced corresponding to the content to be converted is determined, including:
  • the at least one target search identifier includes a second target search identifier, obtain the target character string that is adjacent to the second target search identifier before the second target search identifier in the to-be-processed search content, and set the The target character string is used as the content to be replaced; at least one associated word corresponding to the content to be converted is determined, and the content to be replaced is determined based on the at least one associated word.
  • Example 7 provides a method for searching for target content, further comprising:
  • the at least one associated word includes at least one of a derivative of the target string, a string in multiple tenses corresponding to the target string, a singular form and a plural form of the target string. A sort of.
  • Example 8 provides a method for searching for target content, further comprising:
  • the target search strategy includes a search strategy for splitting and searching for some strings in the to-be-processed search content; the to-be-converted content in the to-be-processed search content is determined according to each target search identifier. , process the content to be converted according to the target search strategy, and determine the content to be replaced corresponding to the content to be converted, including:
  • the at least one target search identifier includes a third target search identifier, determine a character string adjacent to the third target search identifier according to the third target search identifier, and generate a string based on the adjacent character string. the content to be converted; each character string in the content to be converted is used as the content to be replaced.
  • Example 9 provides a method for searching for target content, further comprising:
  • generating the target search content based on the to-be-replaced content and the to-be-processed search content includes:
  • Example 10 provides a method for searching for target content, further comprising:
  • the searching for the target content corresponding to the target search content from the text information includes:
  • target search content content identical to the target search content is searched from the text information to obtain the target content.
  • Example 11 provides a method for searching for target content, further comprising:
  • the target content is displayed differently in the text information.
  • Example 12 provides a method for searching for target content, wherein the text information is determined based on audio information of a multimedia data stream.
  • Example thirteen provides a method for searching for target content, wherein the text information includes translation text information; the translated text information is based on the audio information, and the The original language type corresponding to the audio information and the target translation language type are generated.
  • Example 14 provides a method for searching for target content, further comprising:
  • the identifier of the target content is displayed on the time axis corresponding to the multimedia data stream.
  • Example 15 provides a method for searching for target content, further comprising:
  • the method further includes:
  • Example 16 provides a method for searching for target content, further comprising:
  • jumping the multimedia data stream to a video frame corresponding to the target content includes:
  • the current time stamp of the target content is determined; based on the pre-established synchronization relationship between the time stamps and the current time stamp, the multimedia data stream is jumped to the current time Stamp the corresponding video frame.
  • Example 17 provides a method for searching for target content, further comprising:
  • the displaying the identifier of the target content on the time axis corresponding to the multimedia data stream includes:
  • the time stamp of each target content in the time axis corresponding to the multimedia data stream is determined, and a position on the time axis corresponding to the time stamp is marked.
  • Example 18 provides a method for searching for target content, further comprising:
  • the marking at the position corresponding to the timestamp on the time axis includes:
  • the position of the control control corresponding to the time stamp is determined, and the position is marked; wherein, the control control is used for Adjust the audio and video frames of the multimedia data stream.
  • Example 19 provides a method for searching for target content, further comprising:
  • a target time stamp corresponding to the target content is determined; and the marks corresponding to the target time stamp are displayed differently.
  • Example 20 provides an apparatus for searching for target content, the apparatus comprising:
  • a target search strategy determination module configured to determine a target search identifier in the search content to be processed, and to determine a target search strategy corresponding to the target search identifier; a target search content determination module, configured to determine the target search identifier according to the target search identifier, the The target search strategy and the to-be-processed search content generate target search content corresponding to the to-be-processed search content; the target content determination module is configured to search for the target content corresponding to the target search content from the text information .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

本文公开了一种搜索目标内容的方法、装置、电子设备及存储介质,该方法包括:确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略;根据所述目标搜索标识、所述目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容;从文本信息中搜索出与所述目标搜索内容相对应的目标内容。

Description

搜索目标内容的方法、装置、电子设备及存储介质
本申请要求在2020年09月29日提交中国专利局、申请号为202011056294.1的中国专利申请的优先权,该申请的全部内容通过引用结合在本申请中。
技术领域
本公开涉及计算机技术领域,例如涉及一种搜索目标内容的方法、装置、电子设备及存储介质。
背景技术
从文档中搜索内容时,多是依赖用户输入的搜索内容直接来搜索,从而得到目标内容。
采用上述方式搜索得到的目标内容与搜索条件完全相同,无法得到与搜索条件相关联的关联内容,导致搜索到的目标内容不全面,导致用户体验较差。
发明内容
本公开提供了一种搜索目标内容的方法、装置、电子设备及存储介质,以实现优化搜索条件,从而基于优化后的搜索条件从文本中搜索出相应的内容,提高目标内容的全面性以及丰富性。
本公开提供了一种搜索目标内容的方法,该方法包括:
确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略;
根据所述目标搜索标识、所述目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容;
从文本信息中搜索出与所述目标搜索内容相对应的目标内容。
本公开还提供了一种搜索目标内容的装置,该装置包括:
目标搜索策略确定模块,设置为确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略;
目标搜索内容确定模块,设置为根据所述目标搜索标识、所述目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容;
目标内容确定模块,设置为从文本信息中搜索出与所述目标搜索内容相对 应的目标内容。
本公开还提供了一种电子设备,所述电子设备包括:
一个或多个处理器;
存储装置,设置为存储一个或多个程序;
当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现上述的搜索目标内容的方法。
本公开还提供了一种包含计算机可执行指令的存储介质,所述计算机可执行指令在由计算机处理器执行时用于执行上述的搜索目标内容的方法。
附图说明
图1为本公开实施例一所提供的一种搜索目标内容的方法的流程示意图;
图2为本公开实施例二所提供的一种搜索目标内容的方法的流程示意图;
图3为本公开实施例二所提供的一种目标内容与时间轴上标记对应显示的示意图;
图4为本公开实施例二所提供的一种触发目标内容后,时间轴上对应标记突出显示的示意图;
图5为本公开实施例三所提供的一种搜索目标内容的装置的结构示意图;
图6为本公开实施例四所提供的一种电子设备的结构示意图。
具体实施方式
下面将参照附图描述本公开的实施例。虽然附图中显示了本公开的一些实施例,然而本公开可以通过多种形式来实现,而且不应该被解释为限于这里阐述的实施例,提供这些实施例是为了理解本公开。
本公开的方法实施方式中记载的多个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开的范围在此方面不受限制。
本文使用的术语“包括”及其变形是开放性包括,即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。
本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进 行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。
本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,除非在上下文另有指出,否则应该理解为“一个或多个”。
实施例一
图1为本公开实施例一所提供的一种搜索目标内容的方法的流程示意图,本公开实施例适用于根据待处理搜索内容中的搜索标识确定与待处理内容对应的目标搜索内容,进而从文本中筛选出与目标搜索内容相一致内容的情形,该方法可以由搜索目标内容的装置来执行,该装置可以通过软件和/或硬件的形式实现。
如图1,本实施例的方法包括:
S110、确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略。
待处理搜索内容是从目标页面上获取到的内容。在目标页面上包括搜索内容编辑控件。在搜索内容编辑控件中用户可以编辑相应的内容,编辑的内容被作为待处理搜索内容。可选地,目标页面上还可以包括基于不同语种类型或者相同语种类型的语音信息生成的文本信息。待处理搜索内容中包括多种搜索标识,可选的,“、”等。搜索标识可以是预先设置的,在检测到待处理搜索内容中包括预先设置的搜索标识时,则确定待处理搜索内容中包括搜索标识,可以将待处理搜索内容中包括的搜索标识作为目标搜索标识。
在获取到待处理搜索内容时,可以根据预先设置的搜索标识,确定待处理搜索内容中是否包括搜索标识,若待处理搜索内容中包括搜索标识,则可以将待处理搜索内容中的搜索标识作为目标搜索标识。在确定目标搜索标识后,可以确定与目标搜索标识相对应的搜索策略,以便基于搜索策略确定与待处理搜索内容相对应的目标搜索内容。
在本实施例的一些可选的实现方式中,获取搜索内容编辑控件中编辑的待处理搜索内容,包括:若检测到启动搜索的控件被触发,获取所述搜索内容编辑控件中编辑的目标搜索内容。示例性的,目标页面上可以包括搜索内容编辑控件和启动搜索的控件,可选的,启动搜索的控件可以是“确认搜索”的控件。用户可以在搜索内容编辑控件中编辑相应的内容,在内容编辑完成后,用户可以触发启动搜索的控件,即点击“确认搜索”的控件,服务器可以获取搜索内容编辑控件中的待处理搜索内容。
或者,在另一些可选的实现方式中,若检测到搜索内容编辑控件被触发, 获取所述搜索内容编辑控件中编辑的待处理搜索内容。
示例性的,目标页面上包括搜索内容编辑控件:在检测到用户触发搜索内容编辑控件时,开始获取搜索内容编辑控件中编辑的搜索内容,在预设时长内,可选的,在30S内,若未检测到用户编辑新的搜索内容,则将获取到的搜索内容作为待处理搜索内容。
S120、根据所述目标搜索标识、目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容。
目标搜索策略与目标搜索标识相对应,可以根据相应的目标搜索策略确定与待处理搜索内容对应的目标搜索内容。即,目标搜索内容为根据目标搜索策略对待处理搜索内容处理后,得到的搜索内容。在本实施例中,目标搜索内容可以是一个、两个或者多个,其数量与目标搜索策略相关。
若待处理搜索内容中包括多个目标搜索标识,则可以确定与每个目标搜索标识相对应的目标搜索策略,进而基于目标搜索策略对待处理搜索内容进行处理,得到与待处理搜索内容相对应的目标搜索内容。
S130、从文本信息中搜索出与所述目标搜索内容相对应的目标内容。
文本信息可以是任意文本,可选的,文本信息是通过一种获取方式预先获取到的文档。文本信息还可以是基于多媒体数据流的音频信息确定出的文本。多媒体数据流可以是基于实时互动场景(例如,多媒体会议、直播、视频聊天)生成的,或者是基于录制得到的。实时互动场景可通过互联网和计算机手段实现,例如,通过原生程序或全球广域网(World Wide Web,web)程序等实现的交互应用程序。可以采集多媒体数据流的音频信息,并将音频信息转换为相应的文字,展示在目标页面上。可以将展示在目标页面上的文字作为文本信息。目标内容是与目标搜索内容相同的内容。
根据目标搜索内容,可以从文本信息中筛选出来与目标搜索内容相同的内容,将此内容作为目标内容。本实施例的技术方案中,由于目标搜索内容是与待处理搜索内容相关联的关联内容,因此基于目标搜索内容可以从文本信息中筛选出与待处理搜索内容关联的内容,提高了确定出的目标内容的全面性。
本公开实施例的技术方案,通过获取待处理搜索内容中的目标搜索标识,并确定与目标搜索标识相对应的目标搜索策略,基于目标搜索策略对待处理搜索内容进行优化,即优化搜索条件,基于优化后的搜索条件从文本信息中筛选出相应的目标内容,提高了确定的目标内容的全面性。
在上述技术方案的基础上,在从文本信息中筛选出与目标搜索内容相同的目标内容后,为了便于用户确定目标内容在文本中的位置,可以将目标内容在 文本信息中区别显示。
目标内容本身也是文本信息中的一个或多个元素。在显示时,可以与其他元素区别显示,从而突出筛选后的目标内容,让用户可能更直观、便捷地发现目标内容。区别显示可以是以颜色、字体、背景图案等显示格式来区别显示。
在本实施例的一些可选的实现方式中,在从文本信息中筛选出目标内容之前,还需要生成相应的文本信息。可选的,基于多媒体数据流生成文本信息,可以是:基于多媒体数据流,采集语音信息;根据所述语音信息、与所述语音信息对应的原始语种类型以及目标翻译语种类型,生成显示在目标页面上与所述目标翻译语种类型相对应的文本信息。
在一些应用场景中,参与实时互动的用户或者录屏视频中参与发言的用户的数量可以是多个,每个发言用户发言时所使用的语种类型可以相同也可以不同,当其它发言用户所用的语种类型与本用户所使用的语种差异较大时,可能存在本用户无法了解其他发言用户的发言信息的情况。
为了解决这一问题,在将所采集的发言用户的语音信息转换为基于原始语种类型生成的文本信息的基础上,可以将该基于原始语种类型生成的文本信息转换为用户期望的语种类型(即目标翻译语种类型)对应的文本信息。
原始语种类型可以理解为多媒体数据流中,参与语音交互的用户各自使用的语种。相应地,目标翻译语种类型可以理解为用户期望的用于显示文本信息的语种类型。相应地,与目标翻译语种类型相对应的文本信息,可以是以目标翻译语种类型呈现的、与语音信息相对应的译文数据。为了便于用户直观地从文本信息中确定每条译文数据对应的发言用户以及发言时间,文本信息中可以显示每条译文数据的发言用户身份标识以及发言时间戳。
可以从与互动行为界面相对应的多媒体数据流中,采集多个参与互动的用户的语音数据,即语音信息,并记录语音信息所对应的原始语种类型。可以将语音信息从原始语种类型翻译为目标翻译语种类型,得到与语音信息对应的译文数据。将译文数据、译文数据对应的发言用户身份标识以及发言时间戳作为展示在目标页面上的文本信息。
在这些可选的实现方式的一些应用场景中,目标页面上可以设置语种类型选择控件。这样一来,用户可以通过触发目标页面上的语种类型选择控件来选择翻译语种类型,以将发言用户的语音信息生成的原始语种类型下的文本信息翻译为所选择的翻译语种类型对应的文本信息。
或者,在这些可选的实现方式的另一些应用场景中,目标翻译语种类型还可以通过如下至少一种方式来确定:获取目标客户端上预先设置的语种类型作 为目标翻译语种类型;获取所述目标客户端的登录地址,基于所述登录地址确定与所述目标客户端所在地理位置对应的目标翻译语种类型。
也就是说,确定目标翻译语种类型的方式可以包括至少两种。第一种方式可以是:将发言用户所属设备中的默认语种类型作为目标翻译语种类型;或者,用户可以预先设置将要转换的语种类型,从而将用户预先设置的语种类型作为目标翻译语种类型。示例性的,在将语音信息转换为文字信息之前,用户可以预先设置目标翻译语种类型,可选的,在用户触发语种类型选择控件时,目标页面上可以弹出语种类型选择列表以供用户选择。用户可以选择任意一种语种类型,如,用户触发了语种类型选择列表中的中文语种类型并点击了确认按键,服务端或客户端可以确定目标翻译语种类型为中文语种类型。也就是说,可以将多媒体数据流中的语音信息转换为中文文本信息,并将其展示在目标界面上。
第二种方式可以是:可以获取客户端的登录地址,即客户端的互联网协议(Internet Protocol,IP)地址,根据登录地址可以确定客户端所属的区域,进而确定该区域所使用的语种类型,并将该所使用的语种类型作为目标翻译语种类型。
在本实施例中,通过将多媒体数据流中的语音信息转换为目标翻译语种类型的文本信息,使文本信息更符合用户的阅读习惯,从而便于用户能够快速理解多媒体数据流所对应的内容,进而提高交互的效率。
在上述基础上,在得到所述文本信息之后,还包括:建立所述文本信息与所述多媒体数据流之间的时间戳同步关联关系,并将所述文本信息和所述多媒体数据流显示在目标页面上,以在检测到所述目标内容被触发时,将所述多媒体数据流跳转到与所述目标内容对应的视频播放时刻。
时间戳同步关联关系可以理解为多媒体数据流和文本信息是基于时间同步联动的。当文本信息中的一条内容被触发时,可以确定该条内容所对应的当前时间戳,基于预先建立的时间戳同步关联关系,跳转到与当前时间戳所对应的多媒体数据流,例如,可以使多媒体数据流跳转到与当前时间戳相对应的播放位置。其次,由于建立了多媒体数据流和文本信息之间的时间戳同步关联关系,因此拖动录屏视频的进度条到一个位置时,可以获取该位置所对应的当前时间戳,基于预先建立的时间戳同步关联关系,可以确定该播放位置的语音内容在文本信息中所对应的内容。为了便于用户确认该音频所对应的文本内容在文本信息中的位置,可以基于时间戳同步关联关系将该文本内容区别显示在目标页面上,可选的,高亮显示。在检测到触发文本信息中的文本内容时,也可以获取文本内容所对应的当前时间戳,基于时间戳同步关联关系,获取相应的音视频帧并显示。可选的,若检测到目标内容被触发,确定所述目标内容的当前时 间戳;基于预先建立的时间戳的同步关联关系以及所述当前时间戳,将多媒体数据流跳转至所述当前时间戳对应的视频帧。
在检测到用户触发目标内容时,可以获取目标内容所属句子所对应的当前时间戳。基于预先建立的时间戳同步关联关系,将多媒体数据流跳转到与当前时间戳所对应的播放位置,以便于用户了解发言用户在发表包括目标内容的语音信息时的语气和状态,从而提高交互的效率。
在上述技术方案,所述方法还包括:在与所述多媒体数据流相对应的时间轴上显示所述目标内容的标识。
在确定目标内容后,可以确定每个目标内容所属句子的时间戳,并将该时间戳作为与目标内容相对应的当前时间戳。根据多媒体数据流所对应的时间轴,将每个与目标内容所对应的当前时间戳,在时间轴上进行显示,以便用户快速确定每个目标内容在多媒体数据流中的相对位置。
本实施例技术方案,通过建立多媒体数据流和文本信息之间的时间戳同步关联关系,实现了文本信息和多媒体数据流的同步联动,从而便于用户快速查找到相应文本信息在多媒体数据流中的位置,进而便于结合前后语境了解发言用户的语音信息,提高了信息交互的效率。
实施例二
图2为本公开实施例二所提供的一种搜索目标内容的方法的流程示意图。在前述实施例的基础上,在获取到待处理搜索内容中的目标搜索标识时,可以确定与不同目标搜索标识相对应的待组合搜索策略,基于待组合搜索策略确定目标搜索策略,进而基于目标搜索策略确定与待处理搜索内容相对应的目标搜索内容。其中,与上述实施例相同或者相应的技术术语在此不再赘述。
如图2所示,所述方法包括:
S210、获取所述待处理搜索内容中的至少一个目标搜索标识。
待处理搜索内容中可以包括一个或者多个目标搜索标识。目标搜索标识可以是预先设置的特殊符号。特殊符号可以是“,、-”等符号。
可以预先设置搜索标识库,搜索标识库中包括多个搜索标识,相应的,为了快速调取与每个搜索标识相对应的搜索策略,可以建立搜索标识和搜索策略之间的对应关系,以在检测到存在搜索标识时,根据对应关系调取与搜索标识相对应的搜索策略。在本实施例中,获取待处理搜索内容中的多个目标搜索标识可以是:根据预先设置的搜索标识库,确定待处理搜索内容中包括的搜索标 识。或者是,获取待处理搜索内容中的多个符号,根据预先设置的搜索标识库,从每个符号中确定目标符号,并将目标符号作为目标搜索标识。
待处理搜索内容中包括的目标搜索标识可以是一个、两个或者多个,目标搜索标识可以相同也可以不同,若目标搜索标识相同,所对应的搜索策略也相同。
S220、确定与每个目标搜索标识对应的待组合搜索策略。
在获取到多个目标搜索标识后,可以根据对应关系表确定与每个目标搜索标识相对应的搜索策略,并将所确定的搜索策略作为待组合搜索策略。也就是说,可以将每个目标搜索标识所对应的搜索策略作为待组合搜索策略。
若待处理搜索内容中所包含的多个目标搜索标识相同,目标所对应的待组合搜索策略也相同;若多个目标搜索标识中存在不同的目标搜索标识,所对应的待组合搜索策略可能相同或不同。
在本实施例中,确定与每个目标搜索标识相对应的待组合搜索策略可以是:预先建立搜索标识和搜索策略之间的对应关系。在获取到目标搜索标识后,可以根据搜索标识和搜索策略之间的对应关系,确定与目标搜索标识对应的待组合搜索策略。若目标搜索标识为一个,待组合策略可以是一个;若目标搜索标识为多个,待组合搜索策略可以是多个。
S230、基于每个待组合搜索策略,生成与所述待处理搜索内容相对应的目标搜索策略。
目标搜索策略是基于每个待组合搜索策略构成的。待组合搜索策略为一个时,目标搜索策略为一个;若待组合搜索策略为多个,则目标搜索策略包括多个待组合搜索策略。
S240、根据所述至少一个目标搜索标识,确定待处理搜索内容中的待转换内容,依据所述目标搜索策略对所述待转换内容进行处理,确定与所述待转换内容相对应的待替换内容。
待转换内容是根据目标搜索标识确定出的内容。待替换内容是根据目标搜索策略对待转换内容进行处理后得到的内容。
若目标搜索标识的数量为多个,则可以对每个目标搜索标识进行处理,以确定与每个目标搜索标识相对应的搜索策略以及待转换内容,根据待转换内容以及与待转换内容相对应的搜索策略,确定出与待转换内容相对应的待替换内容。
在本实施例中,所述目标搜索策略包括以下至少一种;将待处理搜索内容 中的部分字符串进行合并搜索的搜索策略;确定待处理搜索内容中部分字符串的关联内容,并基于关联内容进行搜索的搜索策略;将待处理搜索内容中的部分字符串拆分搜索的搜索策略。
也就是说,待组合搜索策略可以是上述任意一种搜索策略。目标搜索策略为上述待组合搜索策略中的一种或者多种。所述部分字符串是基于待处理搜索内容中的目标搜索标识来确定的。
合并搜索策略可以理解为根据目标搜索标识确定出字符串后,将字符串和目标搜索标识作为整体进行搜索的搜索策略。基于关联内容进行搜索的搜索策略可以是根据目标搜索标识确定字符串后,确定与该字符串相关联的内容,并基于确定出的相关联的内容进行搜索的策略。拆分搜索的搜索策略指的是根据目标搜索标识确定出字符串后,将每个字符串作为待替换内容。
在本实施例中,可以根据目标搜索标识来确定相应的目标搜索策略和待转换内容,进而基于待转换内容生成待替换内容,以便基于待替换内容和待处理搜索内容生成目标搜索内容。
可以将不同搜索标识分为至少三类,分别是第一目标搜索标识、第二目标搜索标识和第三目标搜索标识,与不同搜索标识相对应的搜索策略也可以分为至少三类。若待处理搜索内容中包括多类搜索标识时,可以分别确定与每一类搜索标识对应的搜索策略,进而确定待替换内容,并根据待替换内容生成目标搜索内容。
在本实施例中,根据目标搜索标识,确定待处理搜索内容中的待转换内容和待替换内容可以参见下述方式中的至少一种。上述三类搜索策略对应的具体实施方式可以参见下述描述。
第一种方式可以是,若所述目标搜索标识包括第一目标搜索标识,根据所述第一目标搜索标识,确定与所述第一目标搜索标识相邻的字符串,基于所述字符串生成所述待转换内容;基于所述待转换内容以及所述第一目标搜索标识,生成所述待替换内容。
第一目标搜索标识可以是缩写标识符,可选的,若待处理搜索内容中包括“you’ll”,此时的目标标识符可以是“’”,可以将“’”作为第一目标搜索标识。在确定目标标识符后,可确定与目标标识符相邻的字符串,并将相邻的字符串作为待转换内容。例如,目标标识符为“’”,与目标标识符相邻的字符串为“you ll”,可以将“you ll”作为待转换内容。待替换内容是根据待转换内容和第一目标搜索标识生成的。由于与第一目标搜索标识相对应的目标搜索策略为将待处理搜索内容中的部分字符串进行合并搜索的搜索策略,因此可以将待转换内容和第一 目标标识符合并在一起作为待替换内容,因此得到的待替换内容可以是“you’ll”。
基于上述方式可以确定待处理搜索内容中的待转换内容,并得到与待转换内容对应的待替换内容。
第二种方式可以是,若所述目标搜索标识包括第二目标搜索标识,获取所述待处理搜索内容中的第二目标搜索标识之前并与所述第二目标搜索标识相邻的目标字符串,并将所述目标字符串作为待替换内容;确定与所述目标字符串相对应的至少一个关联词,基于所述至少一个关联词确定待替换内容。
与第二目标搜索标识相对应的目标搜索策略可以是:确定待处理搜索内容中部分字符串的关联内容,并基于关联内容进行搜索的搜索策略。第二目标搜索标识是预先设置的,若检测到存在第二目标搜索标识时,可以获取第二目标搜索标识之前并与第二目标搜索标识相邻的字符串,作为目标字符串。可以将此目标字符串作为待转换内容。由于与第二目标搜索标识相对应的搜索策略为获取部分字符串的关联内容的搜索策略,因此关联内容可以是对目标字符串进行至少一种形式变换后得到的内容,可选的,对目标字符串进行时态变化、单复数变化等。可以将与目标字符串关联的词汇作为待替换内容。
在检测到目标搜索内容中包括第二目标搜索标识时,可以获取待处理搜索内容中第二目标搜索标识之前并与第二目标搜索标识相邻的字符串作为目标字符串,并将目标字符串作为待转换内容。可以检测目标字符串是否存在关联内容,可选的,关联内容可以是目标字符串在不同时态下所对应的词汇、单复数形式的词汇等,若目标字符串存在关联内容,则确定待转换内容的关联内容,并将确定出的关联内容作为待替换内容。
在本实施例中,所述关联词,即关联内容包括目标字符串的衍生词、与目标字符串对应的多种时态下的字符串、目标字符串的单数形式以及复数形式中的至少一种。
衍生词可以是将目标字符串作为词根,确定出的与目标字符串相对应的词汇。时态包括现在时、将来时以及过去时,可以确定目标字符串在不同时态下所对应的词汇。还可以确定目标字符串是否存在单复数形式,若目标字符串存在单复数形式,目标字符串的单复数形式也是关联词。
可以确定目标字符串是否存在关联词,若目标字符串存在关联词,则可以依据上述原则获取相应的关联词,并将关联词作为待替换内容。
第三种方式可以是,若所述目标搜索标识中包括第三目标搜索标识,根据所述第三目标搜索标识,确定与所述第三目标搜索标识相邻的字符串,基于所述字符串生成所述待转换内容;将所述待转换内容中的每个字符串作为待替换 内容。
与第三目标搜索标识相对应的目标搜索策略可以是将待处理搜索内容中的部分字符串拆分搜索的搜索策略。部分字符串是基于第三目标搜索标识来确定的。第三目标搜索标识是将部分字符串拆分搜索的标识,可选的,第三标识可以是“,、-”等。若检测到待处理搜索内容包括第三目标搜索标识,确定与第三目标搜索标识相邻的字符串。由于与第三目标搜索标识相对应的搜索策略为字符串拆分搜索策略,因此可以将此时确定的字符串分别作为待转换内容。
示例性的,若待处理搜索内容中包括“A-B”,获取待处理搜索内容中包括的搜索标识为“-”时,则确定目标搜索标识为“-”。与“-”相对应的目标搜索策略为部分字符串拆分搜索的策略,可以获取与目标搜索标识相邻的字符串,可选的,获取到的相邻字符串为“A”、“B”,即字符串“A”、“B”为待转换内容。可以将待转换内容中的“A”、“B”分别作为待替换内容。
S250、基于所述待替换内容以及所述待处理搜索内容,生成所述目标搜索内容。
若待处理搜索内容中包括一个目标搜索标识,可以确定与该目标搜索标识相对应的至少一个待替换内容,可以将待替换内容和待处理搜索内容均作为目标搜索内容中的一个搜索条件。若待处理搜索内容中包括至少两个目标搜索标识,可以依据相应的搜索策略分别确定与每个目标搜索标识相对应的待转换内容,以及与每个待转换内容相对应的至少一个待替换内容。将每个待转换内容分别替换为待替换内容,得到目标搜索内容。
可选的,将所述待处理搜索内容中的待转换内容替换为所述待替换内容,得到至少一个目标子搜索内容;基于所述至少一个目标子搜索内容,生成所述目标搜索内容。
与待处理搜索内容相对应的待替换内容的数量不止一个,可以将待替换内容替换到待处理搜索内容中的相应位置处,可选的,将待替换内容替换到相应的待转换内容处,将替换后得到的待处理搜索内容,作为目标搜索内容中的一个目标子搜索内容。也就是说,目标搜索内容中可以包括多个目标子搜索内容,目标子搜索内容是将待替换内容替换到待处理搜索内容中的待转换内容后得到的。目标子搜索内容的数量与待替换内容的数量相对应。
可以获取与每个待转换内容相对应的待替换内容,并将待转换内容替换为相应的待替换内容,得到目标搜索内容中的至少一个目标子搜索内容。至少一个目标子搜索内容的集合为目标搜索内容。
S260、根据所述目标搜索内容,从所述文本信息中搜索出与所述目标搜索 内容完全相同的内容,得到所述目标内容。
在确定目标搜索内容后,可以从文本信息中获取与目标搜索内容完全相同的内容,并作为目标内容。
本公开实施例的技术方案,通过获取待处理搜索内容中目标搜索标识,并确定与目标搜索标识相对应的目标搜索策略,进而基于目标搜索策略对待处理搜索内容进行完善和丰富,以得到目标搜索内容,在基于目标搜索内容从文本信息中搜索相应的目标内容时,提高了目标内容的全面性以及丰富性。
在上述技术方案的基础上,所述方法还包括:确定每个目标内容在多媒体数据流相对应的时间轴中的时间戳,并在所述时间轴上与所述时间戳对应的位置进行标记。
时间轴是与多媒体数据流所对应的时间轴,可选的,多媒体数据流所对应的总时长为50min,与多媒体数据流所对应的时间轴也为50min。
在确定目标内容后,可以根据目标内容所属的句子,确定目标内容所对应的时间戳。在确定时间戳后,可以确定时间戳在时间轴上的位置,并在该位置处进行标记,例如,可以在时间轴的位置下方用圆点标记,或者用三角标记,参见图3。
在上述技术方案的基础上,在所述时间轴上与所述时间戳对应的位置进行标记,包括:在与所述时间轴中的所有时间戳一一对应的多个控制控件中,确定与所述时间戳对应的控制控件的位置,并在所述位置进行标记;其中,所述控制控件用于调整所述多媒体数据流的音视频帧。参见图3,控制控件的位置即为圆点标记所在的位置。
示例性的,参见图3,用户在搜索内容编辑控件编辑的搜索内容为“算法”,可以从文本信息中搜索出与“算法”相同的目标内容并区别显示,如高亮显示,并确定目标内容所属句子的时间戳,基于时间戳在与多媒体数据流相对应的时间轴上进行标记,如,用圆点标记。其中,标记的颜色、大小等用户可以根据实际需求进行设置,在此不再限定。
在搜索内容编辑控件中,可以显示目标内容的数量,例如,搜索内容编辑控件中显示的总数量为12,参见图3。
在本实施例中,在时间轴上标记与目标内容相对应的音视频帧的好处在于,可以使用户根据时间轴上的标记,清楚地确定目标内容在多媒体数据流中的位置,从而提高查找相应目标内容的便捷性。
目标内容的数量可以不止一个,相应的,时间轴上标记的数量也可以不止一个,参见图3,目标内容的数量为12个,时间轴上的标记也为12个。为了便 于用户确定当前触发的目标内容为所有目标内容中的第几个,搜索内容编辑控件中还显示当前触发的目标内容所对应的顺序。
在上述技术方案的基础上,所述方法还包括:当检测到一目标内容被触发时,确定与所述目标内容相对应的目标时间戳;将与所述目标时间戳对应的标记进行区别显示。
用户可以触发任意一个目标内容,在检测到用户触发目标内容时,可以确定与用户触发的目标内容所对应的时间戳(目标时间戳),可以确定目标时间戳在时间轴上所对应的目标标记,并将目标标记与时间轴上的其他标记区别显示,以凸显目标标记。例如,将目标标记和其他标记以不同的颜色区别显示。
示例性的,参见图4,当用户触发标记1对应的目标内容时,可以确定标记1对应的目标内容所对应的目标时间戳,根据目标时间戳可以确定其在时间轴上所对应的标记为标记2所对应的标记,可以将该标记突出显示。
在本实施例中,当目标内容被触发时,在时间轴上将与目标内容相对应的标记区别显示的好处在于,可以使用户了解触发的目标内容在多媒体数据流中的位置,提高了用户确定的目标内容所对应的音视频帧的准确性。
实施例三
图5为本公开实施例三所提供的一种搜索目标内容的装置的结构示意图。如图5所示,所述装置包括:目标搜索策略确定模块310、目标搜索内容确定模块320以及目标内容确定模块330。
其中,目标搜索策略确定模块310,设置为确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略;目标搜索内容确定模块320,设置为根据所述目标搜索标识、目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容;目标内容确定模块330,设置为从文本信息中搜索出与所述目标搜索内容相对应的目标内容。
本公开实施例的技术方案,通过获取待处理搜索内容中的目标搜索标识,并确定与目标搜索标识相对应的目标搜索策略,基于目标搜索策略对待处理搜索内容进行优化,即优化搜索条件,基于优化后的搜索条件从文本信息中筛选出相应的目标内容,提高了确定的目标内容的全面性以及准确性。
在上述技术方案的基础上,目标搜索策略确定模块310,包括:
目标搜索标识确定单元,设置为获取所述待处理搜索内容中的至少一个目标搜索标识;待组合搜索策略确定单元,设置为确定与每个目标搜索标识对应 的待组合搜索策略;目标搜索策略确定单元,设置为基于每个待组合搜索策略,生成与所述待处理搜索内容相对应的目标搜索策略。
在上述技术方案的基础上,所述目标搜索策略包括以下至少一种:将待处理搜索内容中的部分字符串进行合并搜索的搜索策略;确定待处理搜索内容中部分字符串的关联内容,并基于关联内容进行搜索的搜索策略;将待处理搜索内容中的部分字符串拆分搜索的搜索策略;其中,所述部分字符串是基于待处理搜索内容中的目标搜索标识来确定的
在上述技术方案的基础上,目标搜索内容确定模块320,设置为根据每个目标搜索标识,确定所述待处理搜索内容中的待转换内容,依据所述目标搜索策略对所述待转换内容进行处理,确定与所述待转换内容相对应的待替换内容;基于所述待替换内容以及所述待处理搜索内容,生成所述目标搜索内容。
在上述技术方案的基础上,所述目标搜索策略包括将待处理搜索内容中的部分字符串进行合并搜索的搜索策略;所述目标搜索内容确定模块320,包括:第一待转换内容确定单元,设置为若所述至少一个目标搜索标识包括第一目标搜索标识,根据所述第一目标搜索标识,确定与所述第一目标搜索标识相邻的字符串,基于所述字符串生成所述待转换内容;第一待替换内容确定单元,设置为基于所述待转换内容以及所述第一目标搜索标识,生成所述待替换内容。
在上述技术方案的基础上,所述目标搜索策略包括确定待处理搜索内容中部分字符串的关联内容,并基于关联内容进行搜索的搜索策略;所述目标搜索内容确定模块320,包括:第二待转换内容确定单元,设置为若所述至少一个目标搜索标识包括第二目标搜索标识,获取所述待处理搜索内容中的第二目标搜索标识之前并与所述第二目标搜索标识相邻的目标字符串,并将所述目标字符串作为待替换内容;第二待替换内容确定单元,设置为确定与所述待转换内容相对应的至少一个关联词,基于所述至少一个关联词确定待替换内容。
在上述技术方案的基础上,所述至少一个关联词包括所述目标字符串的衍生词、与所述目标字符串对应的多种时态下的字符串、目标字符串的单数形式以及复数形式中的至少一种。
在上述技术方案的基础上,所述目标搜索策略包括将待处理搜索内容中的部分字符串拆分搜索的搜索策略;所述目标搜索策略确定模块310,包括:第三待转换内容确定单元,设置为若所述至少一个目标搜索标识中包括第三目标搜索标识,根据所述第三目标搜索标识,确定与所述第三目标搜索标识相邻的字符串,基于所述字符串生成所述待转换内容;第三待替换内容确定单元,设置为将所述待转换内容中的每个字符串作为待替换内容。
在上述技术方案的基础上,目标搜索内容确定模块320,包括:
目标子搜索内容确定单元,设置为将所述待处理搜索内容中的待转换内容替换为所述待替换内容,得到至少一个目标子搜索内容;目标搜索内容确定单元,设置为基于所述至少一个目标子搜索内容,生成所述目标搜索内容。
在上述技术方案的基础上,目标内容确定模块330,设置为根据所述目标搜索内容,从所述文本信息中搜索出与所述目标搜索内容完全相同的内容,得到所述目标内容。
在上述技术方案的基础上,将所述目标内容区别显示在所述文本信息中。
在上述技术方案的基础上,所述装置还包括:
语音信息确定模块,设置为基于多媒体数据流,确定语音信息;文本信息确定模块,设置为根据所述语音信息、与所述语音信息对应的原始语种类型以及目标翻译语种类型,生成显示在目标页面上与所述目标翻译语种类型相对应的文本信息。
在上述技术方案的基础上,所述装置还包括:显示模块,设置为在与所述多媒体数据流相对应的时间轴上显示所述目标内容的标识。
在上述技术方案的基础上,所述装置还包括:时间戳同步关联关系确定模块,设置为建立所述文本信息与所述多媒体数据流之间的时间戳同步关联关系,并将所述文本信息和所述多媒体数据流显示在目标页面上,以在检测到一目标内容被触发时,将所述多媒体数据流跳转到与所述一目标内容所对应的视频帧。
在上述技术方案的基础上,所述装置还包括:当前时间戳确定单元,设置为若检测到一目标内容被触发,确定所述一目标内容的当前时间戳;视频帧确定单元,设置为基于预先建立的时间戳的同步关联关系以及所述当前时间戳,将多媒体数据流跳转至所述当前时间戳对应的视频帧。
本公开实施例所提供的搜索目标内容的装置可执行本公开任意实施例所提供的搜索目标内容的方法,具备执行方法相应的功能模块和效果。
上述装置所包括的多个单元和模块只是按照功能逻辑进行划分的,但并不局限于上述的划分,只要能够实现相应的功能即可;另外,功能单元的名称也只是为了便于相互区分,并不用于限制本公开实施例的保护范围。
实施例四
下面参考图6,其示出了适于用来实现本公开实施例的电子设备(例如图6中的终端设备或服务器)400的结构示意图。本公开实施例中的终端设备可以包 括但不限于诸如移动电话、笔记本电脑、数字广播接收器、个人数字助理(Personal Digital Assistant,PDA)、平板电脑(PAD)、便携式多媒体播放器(Portable Media Player,PMP)、车载终端(例如车载导航终端)等等的移动终端以及诸如数字(Television,TV)、台式计算机等等的固定终端。图6示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。
如图6所示,电子设备400可以包括处理装置(例如中央处理器、图形处理器等)401,其可以根据存储在只读存储器(Read-Only Memory,ROM)402中的程序或者从存储装置408加载到随机访问存储器(Random Access Memory,RAM)403中的程序而执行多种适当的动作和处理。在RAM403中,还存储有电子设备400操作所需的多种程序和数据。处理装置401、ROM402以及RAM403通过总线404彼此相连。输入/输出(Input/Output,I/O)接口405也连接至总线404。
通常,以下装置可以连接至I/O接口405:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置406;包括例如液晶显示器(Liquid Crystal Display,LCD)、扬声器、振动器等的输出装置407;包括例如磁带、硬盘等的存储装置408;以及通信装置409。通信装置409可以允许电子设备400与其他设备进行无线或有线通信以交换数据。虽然图6示出了具有多种装置的电子设备400,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。
根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在非暂态计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置409从网络上被下载和安装,或者从存储装置408被安装,或者从ROM402被安装。在该计算机程序被处理装置401执行时,执行本公开实施例的方法中限定的上述功能。
本公开实施例提供的电子设备与上述实施例提供的搜索目标内容的方法属于同一构思,未在本实施例中详尽描述的技术细节可参见上述实施例,并且本实施例与上述实施例具有相同的效果。
实施例五
本公开实施例提供了一种计算机存储介质,其上存储有计算机程序,该程 序被处理器执行时实现上述实施例所提供的搜索目标内容的方法。
本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、RAM、ROM、可擦式可编程只读存储器(Erasable Programmable Read-Only Memory,EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(Compact Disc Read-Only Memory,CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、射频(Radio Frequency,RF)等等,或者上述的任意合适的组合。
在一些实施方式中,客户端、服务器可以利用诸如超文本传输协议(HyperText Transfer Protocol,HTTP)之类的任何当前已知或未来研发的网络协议进行通信,并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(Local Area Network,LAN),广域网(Wide Area Network,WAN),网际网(例如,互联网)以及端对端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:
确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略;根据所述目标搜索标识、目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容;从文本信息中搜索出与所述目标搜索内容相对应的目标内容。
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的 计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括LAN或WAN—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。
附图中的流程图和框图,图示了按照本公开多种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。
描述于本公开实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,单元/模块的名称在一种情况下并不构成对该单元本身的限定,例如,目标搜索策略确定模块还可以被描述为“搜索策略确定模块”。
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(Field Programmable Gate Array,FPGA)、专用集成电路(Application Specific Integrated Circuit,ASIC)、专用标准产品(Application Specific Standard Parts,ASSP)、片上系统(System on Chip,SOC)、复杂可编程逻辑设备(Complex Programmable Logic Device,CPLD)等等。
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、RAM、 ROM、EPROM或快闪存储器、光纤、CD-ROM、光学储存设备、磁储存设备、或上述内容的任何合适组合。
根据本公开的一个或多个实施例,【示例一】提供了一种搜索目标内容的方法,该方法包括:
确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略;根据所述目标搜索标识、所述目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容;从文本信息中搜索出与所述目标搜索内容相对应的目标内容。
根据本公开的一个或多个实施例,【示例二】提供了一种搜索目标内容的方法,还包括:
可选的,所述确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略,包括:
获取所述待处理搜索内容中的至少一个目标搜索标识;确定与每个目标搜索标识对应的待组合搜索策略;基于每个待组合搜索策略,生成与所述待处理搜索内容相对应的目标搜索策略。
根据本公开的一个或多个实施例,【示例三】提供了一种搜索目标内容的方法,还包括:
可选的,所述目标搜索策略包括以下至少一种:
将所述待处理搜索内容中的部分字符串进行合并搜索的搜索策略;确定所述待处理搜索内容中部分字符串的关联内容,并基于关联内容进行搜索的搜索策略;将所述待处理搜索内容中的部分字符串拆分搜索的搜索策略;其中,所述部分字符串是基于所述待处理搜索内容中的目标搜索标识来确定的。
根据本公开的一个或多个实施例,【示例四】提供了一种搜索目标内容的方法,还包括:
可选的,所述根据所述目标搜索标识、所述目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容,包括:
根据每个目标搜索标识,确定所述待处理搜索内容中的待转换内容,依据所述目标搜索策略对所述待转换内容进行处理,确定与所述待转换内容相对应的待替换内容;基于所述待替换内容以及所述待处理搜索内容,生成所述目标搜索内容。
根据本公开的一个或多个实施例,【示例五】提供了一种搜索目标内容的 方法,还包括:
可选的,所述目标搜索策略包括将所述待处理搜索内容中的部分字符串进行合并搜索的搜索策略;所述根据每个目标搜索标识,确定所述待处理搜索内容中的待转换内容,依据所述目标搜索策略对所述待转换内容进行处理,确定与所述待转换内容相对应的待替换内容,包括:
若所述至少一个目标搜索标识包括第一目标搜索标识,根据所述第一目标搜索标识,确定与所述第一目标搜索标识相邻的字符串,基于所述相邻的字符串生成所述待转换内容;基于所述待转换内容以及所述第一目标搜索标识,生成所述待替换内容。
根据本公开的一个或多个实施例,【示例六】提供了一种搜索目标内容的方法,还包括:
可选的,所述目标搜索策略包括确定所述待处理搜索内容中部分字符串的关联内容,并基于关联内容进行搜索的搜索策略;所述根据每个目标搜索标识,确定所述待处理搜索内容中的待转换内容,依据所述目标搜索策略对所述待转换内容进行处理,确定与所述待转换内容相对应的待替换内容,包括:
若所述至少一个目标搜索标识包括第二目标搜索标识,获取所述待处理搜索内容中的所述第二目标搜索标识之前并与所述第二目标搜索标识相邻的目标字符串,并将所述目标字符串作为所述待替换内容;确定与所述待转换内容相对应的至少一个关联词,基于所述至少一个关联词确定所述待替换内容。
根据本公开的一个或多个实施例,【示例七】提供了一种搜索目标内容的方法,还包括:
可选的,所述至少一个关联词包括所述目标字符串的衍生词、与所述目标字符串对应的多种时态下的字符串、所述目标字符串的单数形式以及复数形式中的至少一种。
根据本公开的一个或多个实施例,【示例八】提供了一种搜索目标内容的方法,还包括:
可选的,所述目标搜索策略包括将所述待处理搜索内容中的部分字符串拆分搜索的搜索策略;所述根据每个目标搜索标识,确定所述待处理搜索内容中的待转换内容,依据所述目标搜索策略对所述待转换内容进行处理,确定与所述待转换内容相对应的待替换内容,包括:
若所述至少一个目标搜索标识中包括第三目标搜索标识,根据所述第三目标搜索标识,确定与所述第三目标搜索标识相邻的字符串,基于所述相邻的字符串生成所述待转换内容;将所述待转换内容中的每个字符串作为所述待替换 内容。
根据本公开的一个或多个实施例,【示例九】提供了一种搜索目标内容的方法,还包括:
可选的,所述基于所述待替换内容以及所述待处理搜索内容,生成所述目标搜索内容,包括:
将所述待处理搜索内容中的待转换内容替换为所述待替换内容,得到至少一个目标子搜索内容;基于所述至少一个目标子搜索内容,生成所述目标搜索内容。
根据本公开的一个或多个实施例,【示例十】提供了一种搜索目标内容的方法,还包括:
可选的,所述从文本信息中搜索出与所述目标搜索内容相对应的目标内容,包括:
根据所述目标搜索内容,从所述文本信息中搜索出与所述目标搜索内容完全相同的内容,得到所述目标内容。
根据本公开的一个或多个实施例,【示例十一】提供了一种搜索目标内容的方法,还包括:
可选的,将所述目标内容区别显示在所述文本信息中。
根据本公开的一个或多个实施例,【示例十二】提供了一种搜索目标内容的方法,其中,所述文本信息基于多媒体数据流的音频信息确定得到。
根据本公开的一个或多个实施例,【示例十三】提供了一种搜索目标内容的方法,其中,所述文本信息包括翻译文本信息;所述翻译文本信息根据所述音频信息、与所述音频信息对应的原始语种类型以及目标翻译语种类型生成。
根据本公开的一个或多个实施例,【示例十四】提供了一种搜索目标内容的方法,还包括:
可选的,在与所述多媒体数据流相对应的时间轴上显示所述目标内容的标识。
根据本公开的一个或多个实施例,【示例十五】提供了一种搜索目标内容的方法,还包括:
可选的,在得到所述文本信息之后,还包括:
建立所述文本信息与所述多媒体数据流之间的时间戳同步关联关系,并将所述文本信息和所述多媒体数据流显示在所述目标页面上,以在检测到一目标 内容被触发时,将所述多媒体数据流跳转到与所述一目标内容对应的视频帧。
根据本公开的一个或多个实施例,【示例十六】提供了一种搜索目标内容的方法,还包括:
可选的,所述在检测到一目标内容被触发的情况下,将所述多媒体数据流跳转到与所述一目标内容对应的视频帧,包括:
若检测到一目标内容被触发,确定所述一目标内容的当前时间戳;基于预先建立的所述时间戳的同步关联关系以及所述当前时间戳,将多媒体数据流跳转至所述当前时间戳对应的视频帧。
根据本公开的一个或多个实施例,【示例十七】提供了一种搜索目标内容的方法,还包括:
可选的,所述在与所述多媒体数据流相对应的时间轴上显示所述目标内容的标识,包括:
确定每个目标内容在所述多媒体数据流相对应的时间轴中的时间戳,并在所述时间轴上与所述时间戳对应的位置处进行标记。
根据本公开的一个或多个实施例,【示例十八】提供了一种搜索目标内容的方法,还包括:
可选的,所述在所述时间轴上与所述时间戳对应的位置处进行标记,包括:
在与所述时间轴中的所有时间戳一一对应的多个控制控件中,确定与所述时间戳对应的控制控件的位置,并在所述位置进行标记;其中,所述控制控件用于调整所述多媒体数据流的音视频帧。
根据本公开的一个或多个实施例,【示例十九】提供了一种搜索目标内容的方法,还包括:
可选的,当检测到一目标内容被触发时,确定与所述一目标内容相对应的目标时间戳;将与所述目标时间戳对应的标记进行区别显示。
根据本公开的一个或多个实施例,【示例二十】提供了一种搜索目标内容的装置,该装置包括:
目标搜索策略确定模块,设置为确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略;目标搜索内容确定模块,设置为根据所述目标搜索标识、所述目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容;目标内容确定模块,设置为从文本信息中搜索出与所述目标搜索内容相对应的目标内容。
此外,虽然采用特定次序描绘了多个操作,但是这不应当理解为要求这些 操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了多个实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的一些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的多种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。

Claims (22)

  1. 一种搜索目标内容的方法,包括:
    确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略;
    根据所述目标搜索标识、所述目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容;
    从文本信息中搜索出与所述目标搜索内容相对应的目标内容。
  2. 根据权利要求1所述的方法,其中,所述确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略,包括:
    获取所述待处理搜索内容中的至少一个目标搜索标识;
    确定与每个目标搜索标识对应的待组合搜索策略;
    基于每个待组合搜索策略,生成与所述待处理搜索内容相对应的目标搜索策略。
  3. 根据权利要求1-2中任一项所述的方法,其中,所述目标搜索策略包括以下至少一种:
    将所述待处理搜索内容中的部分字符串进行合并搜索的搜索策略;
    确定所述待处理搜索内容中部分字符串的关联内容,并基于所述关联内容进行搜索的搜索策略;
    将所述待处理搜索内容中的部分字符串拆分搜索的搜索策略;
    其中,所述部分字符串是基于所述待处理搜索内容中的目标搜索标识来确定的。
  4. 根据权利要求2所述的方法,其中,所述根据所述目标搜索标识、所述目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容,包括:
    根据每个目标搜索标识,确定所述待处理搜索内容中的待转换内容,依据所述目标搜索策略对所述待转换内容进行处理,确定与所述待转换内容相对应的待替换内容;
    基于所述待替换内容以及所述待处理搜索内容,生成所述目标搜索内容。
  5. 根据权利要求4所述的方法,其中,所述目标搜索策略包括将所述待处理搜索内容中的部分字符串进行合并搜索的搜索策略;
    所述根据每个目标搜索标识,确定所述待处理搜索内容中的待转换内容, 依据所述目标搜索策略对所述待转换内容进行处理,确定与所述待转换内容相对应的待替换内容,包括:
    在所述至少一个目标搜索标识包括第一目标搜索标识的情况下,根据所述第一目标搜索标识,确定与所述第一目标搜索标识相邻的字符串,基于所述相邻的字符串生成所述待转换内容;
    基于所述待转换内容以及所述第一目标搜索标识,生成所述待替换内容。
  6. 根据权利要求4所述的方法,其中,所述目标搜索策略包括确定所述待处理搜索内容中部分字符串的关联内容,并基于所述关联内容进行搜索的搜索策略;
    所述根据每个目标搜索标识,确定所述待处理搜索内容中的待转换内容,依据所述目标搜索策略对所述待转换内容进行处理,确定与所述待转换内容相对应的待替换内容,包括:
    在所述至少一个目标搜索标识包括第二目标搜索标识的情况下,获取所述待处理搜索内容中的所述第二目标搜索标识之前并与所述第二目标搜索标识相邻的目标字符串,并将所述目标字符串作为所述待转换内容;
    确定与所述待转换内容相对应的至少一个关联词,基于所述至少一个关联词确定所述待替换内容。
  7. 根据权利要求6所述的方法,其中,所述至少一个关联词包括所述目标字符串的衍生词、与所述目标字符串对应的多种时态下的字符串、所述目标字符串的单数形式以及复数形式中的至少一种。
  8. 根据权利要求4所述的方法,其中,所述目标搜索策略包括将所述待处理搜索内容中的部分字符串拆分搜索的搜索策略;
    所述根据每个目标搜索标识,确定所述待处理搜索内容中的待转换内容,依据所述目标搜索策略对所述待转换内容进行处理,确定与所述待转换内容相对应的待替换内容,包括:
    在所述至少一个目标搜索标识中包括第三目标搜索标识的情况下,根据所述第三目标搜索标识,确定与所述第三目标搜索标识相邻的字符串,基于所述相邻的字符串生成所述待转换内容;
    将所述待转换内容中的每个字符串作为所述待替换内容。
  9. 根据权利要求4所述的方法,其中,所述基于所述待替换内容以及所述待处理搜索内容,生成所述目标搜索内容,包括:
    将所述待处理搜索内容中的待转换内容替换为所述待替换内容,得到至少 一个目标子搜索内容;
    基于所述至少一个目标子搜索内容,生成所述目标搜索内容。
  10. 根据权利要求1所述的方法,其中,所述从文本信息中搜索出与所述目标搜索内容相对应的目标内容,包括:
    根据所述目标搜索内容,从所述文本信息中搜索出与所述目标搜索内容完全相同的内容,得到所述目标内容。
  11. 根据权利要求1所述的方法,还包括:
    将所述目标内容区别显示在所述文本信息中。
  12. 根据权利要求1所述的方法,其中,所述文本信息基于多媒体数据流的音频信息确定得到。
  13. 根据权利要求12所述的方法,其中,所述文本信息包括翻译文本信息;
    所述翻译文本信息根据所述音频信息、与所述音频信息对应的原始语种类型以及目标翻译语种类型生成。
  14. 根据权利要求12所述的方法,还包括:
    在与所述多媒体数据流相对应的时间轴上显示所述目标内容的标识。
  15. 根据权利要求12所述的方法,在得到所述文本信息之后,还包括:
    建立所述文本信息与所述多媒体数据流之间的时间戳同步关联关系,并将所述文本信息和所述多媒体数据流显示在所述目标页面上,以在检测到一目标内容被触发的情况下,将所述多媒体数据流跳转到与所述一目标内容对应的视频帧。
  16. 根据权利要求15所述的方法,其中,所述在检测到一目标内容被触发的情况下,将所述多媒体数据流跳转到与所述一目标内容对应的视频帧,包括:
    在检测到一目标内容被触发的情况下,确定所述一目标内容的当前时间戳;
    基于预先建立的所述时间戳的同步关联关系以及所述当前时间戳,将所述多媒体数据流跳转至所述当前时间戳对应的视频帧。
  17. 根据权利要求14所述的方法,其中,所述在与所述多媒体数据流相对应的时间轴上显示所述目标内容的标识,包括:
    确定每个目标内容在所述多媒体数据流相对应的时间轴中的时间戳,并在所述时间轴上与所述时间戳对应的位置处进行标记。
  18. 根据权利要求17所述的方法,其中,所述在所述时间轴上与所述时间 戳对应的位置处进行标记,包括:
    在与所述时间轴中的所有时间戳一一对应的多个控制控件中,确定与所述时间戳对应的控制控件的位置,并在所述位置进行标记;
    其中,所述控制控件用于调整所述多媒体数据流的音视频帧。
  19. 根据权利要求18所述的方法,还包括:
    在检测到一目标内容被触发的情况下,确定与所述一目标内容相对应的目标时间戳;
    将与所述目标时间戳对应的标记进行区别显示。
  20. 一种搜索目标内容的装置,包括:
    目标搜索策略确定模块,设置为确定待处理搜索内容中的目标搜索标识,并确定与所述目标搜索标识对应的目标搜索策略;
    目标搜索内容确定模块,设置为根据所述目标搜索标识、所述目标搜索策略以及所述待处理搜索内容,生成与所述待处理搜索内容相对应的目标搜索内容;
    目标内容确定模块,设置为从文本信息中搜索出与所述目标搜索内容相对应的目标内容。
  21. 一种电子设备,包括:
    至少一个处理器;
    存储装置,设置为存储至少一个程序;
    当所述至少一个程序被所述至少一个处理器执行,使得所述至少一个处理器实现如权利要求1-19中任一项所述的搜索目标内容的方法。
  22. 一种包含计算机可执行指令的存储介质,所述计算机可执行指令在由计算机处理器执行时用于执行如权利要求1-19中任一项所述的搜索目标内容的方法。
PCT/CN2021/115283 2020-09-29 2021-08-30 搜索目标内容的方法、装置、电子设备及存储介质 WO2022068496A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP21874157.7A EP4206953A4 (en) 2020-09-29 2021-08-30 METHOD AND APPARATUS FOR SEARCHING TARGET CONTENT, ELECTRONIC DEVICE AND RECORDING MEDIUM
JP2023507867A JP2023536992A (ja) 2020-09-29 2021-08-30 ターゲットコンテンツの検索方法、装置、電子機器および記憶媒体

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011056294.1 2020-09-29
CN202011056294.1A CN112163104B (zh) 2020-09-29 2020-09-29 搜索目标内容的方法、装置、电子设备及存储介质

Publications (1)

Publication Number Publication Date
WO2022068496A1 true WO2022068496A1 (zh) 2022-04-07

Family

ID=73861525

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/115283 WO2022068496A1 (zh) 2020-09-29 2021-08-30 搜索目标内容的方法、装置、电子设备及存储介质

Country Status (4)

Country Link
EP (1) EP4206953A4 (zh)
JP (1) JP2023536992A (zh)
CN (1) CN112163104B (zh)
WO (1) WO2022068496A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112163104B (zh) * 2020-09-29 2022-04-15 北京字跳网络技术有限公司 搜索目标内容的方法、装置、电子设备及存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012128419A1 (ko) * 2011-03-21 2012-09-27 주식회사 코난테크놀로지 통합 멀티미디어 컨텐츠를 제공하는 검색 시스템 및 검색 방법
CN103106220A (zh) * 2011-11-15 2013-05-15 阿里巴巴集团控股有限公司 一种搜索方法、搜索装置及一种搜索引擎系统
CN108829765A (zh) * 2018-05-29 2018-11-16 平安科技(深圳)有限公司 一种信息查询方法、装置、计算机设备及存储介质
CN110737677A (zh) * 2018-07-20 2020-01-31 武汉烽火众智智慧之星科技有限公司 一种数据搜索系统及方法
CN112163104A (zh) * 2020-09-29 2021-01-01 北京字跳网络技术有限公司 搜索目标内容的方法、装置、电子设备及存储介质

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7680853B2 (en) * 2006-04-10 2010-03-16 Microsoft Corporation Clickable snippets in audio/video search results
CN102323937A (zh) * 2011-08-31 2012-01-18 百度在线网络技术(北京)有限公司 一种提供搜索结果的方法与设备
CN104572774B (zh) * 2013-10-28 2019-03-15 腾讯科技(深圳)有限公司 搜索方法及装置
US20170092277A1 (en) * 2015-09-30 2017-03-30 Seagate Technology Llc Search and Access System for Media Content Files
CN110019903A (zh) * 2017-10-10 2019-07-16 阿里巴巴集团控股有限公司 图像处理引擎组件的生成方法、搜索方法及终端、系统
CN109246472A (zh) * 2018-08-01 2019-01-18 平安科技(深圳)有限公司 视频播放方法、装置、终端设备及存储介质
CN110362714B (zh) * 2019-07-25 2023-05-02 腾讯科技(深圳)有限公司 视频内容的搜索方法和装置
CN111368178A (zh) * 2020-03-05 2020-07-03 北京云族佳科技有限公司 一种信息处理的方法及装置、可读存储介质

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012128419A1 (ko) * 2011-03-21 2012-09-27 주식회사 코난테크놀로지 통합 멀티미디어 컨텐츠를 제공하는 검색 시스템 및 검색 방법
CN103106220A (zh) * 2011-11-15 2013-05-15 阿里巴巴集团控股有限公司 一种搜索方法、搜索装置及一种搜索引擎系统
CN108829765A (zh) * 2018-05-29 2018-11-16 平安科技(深圳)有限公司 一种信息查询方法、装置、计算机设备及存储介质
CN110737677A (zh) * 2018-07-20 2020-01-31 武汉烽火众智智慧之星科技有限公司 一种数据搜索系统及方法
CN112163104A (zh) * 2020-09-29 2021-01-01 北京字跳网络技术有限公司 搜索目标内容的方法、装置、电子设备及存储介质

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4206953A4

Also Published As

Publication number Publication date
CN112163104B (zh) 2022-04-15
CN112163104A (zh) 2021-01-01
EP4206953A4 (en) 2024-01-10
JP2023536992A (ja) 2023-08-30
EP4206953A1 (en) 2023-07-05

Similar Documents

Publication Publication Date Title
US11917344B2 (en) Interactive information processing method, device and medium
WO2021093737A1 (zh) 生成视频的方法、装置、电子设备和计算机可读介质
WO2022042593A1 (zh) 字幕编辑方法、装置和电子设备
WO2021196903A1 (zh) 视频处理方法、装置、可读介质及电子设备
US11914845B2 (en) Music sharing method and apparatus, electronic device, and storage medium
CN106098056B (zh) 一种语音新闻的处理方法、新闻服务器及系统
TWI547159B (zh) 媒體內容分享方法和終端設備及內容分享系統
CN112163102B (zh) 搜索内容匹配方法、装置、电子设备及存储介质
WO2022105760A1 (zh) 一种多媒体浏览方法、装置、设备及介质
CN113014854B (zh) 互动记录的生成方法、装置、设备及介质
US20230139416A1 (en) Search content matching method, and electronic device and storage medium
WO2022160603A1 (zh) 歌曲的推荐方法、装置、电子设备及存储介质
CN112601102A (zh) 同声传译字幕的确定方法、装置、电子设备及存储介质
WO2022068496A1 (zh) 搜索目标内容的方法、装置、电子设备及存储介质
WO2022068494A1 (zh) 搜索目标内容的方法、装置、电子设备及存储介质
WO2024007834A1 (zh) 视频播放方法、装置、设备和存储介质
CN112163433A (zh) 关键词汇的匹配方法、装置、电子设备及存储介质
CN112380362A (zh) 基于用户交互的音乐播放方法、装置、设备及存储介质
WO2022257777A1 (zh) 多媒体处理方法、装置、设备及介质
CN113132789B (zh) 一种多媒体的交互方法、装置、设备及介质
CN116800988A (zh) 视频生成方法、装置、设备、存储介质和程序产品
US20230135783A1 (en) Target content search method, electronic device and storage medium
US20230140442A1 (en) Method for searching target content, and electronic device and storage medium
CN104079948B (zh) 生成铃声文件的方法及装置
KR20130050539A (ko) 음원 정보를 제공하는 이동 단말기 및 시스템, 및 음원 정보 제공 방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21874157

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2023507867

Country of ref document: JP

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2021874157

Country of ref document: EP

Effective date: 20230328

NENP Non-entry into the national phase

Ref country code: DE