CN111930970A - News storage and search method based on video and voice recognition - Google Patents

News storage and search method based on video and voice recognition Download PDF

Info

Publication number
CN111930970A
CN111930970A CN202010782002.6A CN202010782002A CN111930970A CN 111930970 A CN111930970 A CN 111930970A CN 202010782002 A CN202010782002 A CN 202010782002A CN 111930970 A CN111930970 A CN 111930970A
Authority
CN
China
Prior art keywords
news
video
content
voice
storage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010782002.6A
Other languages
Chinese (zh)
Inventor
王纲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tongwei Digital Technology Shanghai Co ltd
Original Assignee
Tongwei Digital Technology Shanghai Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tongwei Digital Technology Shanghai Co ltd filed Critical Tongwei Digital Technology Shanghai Co ltd
Priority to CN202010782002.6A priority Critical patent/CN111930970A/en
Publication of CN111930970A publication Critical patent/CN111930970A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/41Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying
    • G06F16/432Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/45Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Abstract

The invention discloses a news storage and search method based on video and voice recognition; the method comprises the following steps: s1, intercepting video and voice fragments; s2, storing the news content; s3, storing and returning; s4, searching the stored news; s5, returning the search content; according to the invention, news contents are respectively stored, so that files, videos, voices and the like are independently stored, storage disorder is avoided, and a one-to-many mapping mode is realized through addresses of news brief introduction, so that the searching speed is accelerated; the news content can be rapidly searched through various searching modes, videos and voice segments can be directly searched through the videos and the voice content when the news content is stored, the searching modes are various, the searching can be rapidly realized, and the mapping relation can prevent the news from being confused during searching.

Description

News storage and search method based on video and voice recognition
Technical Field
The invention belongs to the technical field of news storage and search, and particularly relates to a news storage and search method based on video and voice recognition.
Background
News, also called messages, is a term for information that is broadcast through media paths such as newspapers, radio stations, radio, television stations, and the like. It is a cultural relic for recording society, transmitting information and reflecting the era. The concept of news is broadly and narrowly defined, and in the broad sense, except for the comments and the texts published in newspapers, broadcasts, the internet, televisions, etc., the common texts belong to news columns including messages, communications, features, shorthand (some shorthand are included in the list of features), etc., the narrow news specifically refers to messages, and the messages are summarized and narrated to quickly and timely report the valuable facts newly occurring at home and abroad and to let others understand the facts. Each news item generally includes five parts, a title, a subject, a background, and a final. The first three are the main parts and the second two are the auxiliary parts. The description is mainly related to the writing, and sometimes includes discussion, description, and comment. The news is a news service platform containing mass information and truly reflects the important events at every moment. The latest progress of news events, hot topics, character dynamics, product information and the like can be quickly known by looking at the news events, the hot topics, the character dynamics, the product information and the like, and in the existing internet era, news needs to be stored and searched, however, various news storage numbers and search modes on the market still have various problems.
Although the method and the device for searching news videos disclosed by the authorized bulletin number CN101944111B effectively solve the problems of automatic, accurate and timely searching and integration of internet news videos, can quickly and accurately identify news video websites, and can automatically and timely find and integrate news videos, the method and the device do not solve the problems that the existing storage method is troublesome, a large number of files, videos, voices and the like are stored in the same place, huge burden is caused to searching, searching is too slow, storage is easy to be disordered, the existing searching method is too complicated, news contents need to be traversed, searching is slow, and the like.
Disclosure of Invention
The present invention is directed to a news storage and search method based on video and voice recognition to solve the problems set forth in the background art described above.
In order to achieve the purpose, the invention provides the following technical scheme: the news storage and search method based on video and voice recognition comprises the following steps:
s1, intercepting video and voice fragments: the method comprises the steps of carrying out fragment interception on video and voice in news and using the video and voice in the news as URL (uniform resource locator) for news storage;
s2, storing the news content: storing the news into a database by taking the intercepted video and voice segments in the S1 as a type of URL, and classifying and storing the news;
s3, storage return: after the storage of S2, the front page skips to display that the storage is successful, and automatically calls out the stored content for display;
s4, searching the stored news: inputting retrieval contents in a search box, and retrieving the storage contents in the database by a retrieval engine;
s5, returning search content: and displaying the detected tag content on a front-end page, clicking a tag of the retrieved news after the determined retrieved content is correct, sequentially and respectively extracting news text content, video content, voice content and picture content through a URL (uniform resource locator) corresponding to the news tag content, and displaying complete news content on the front-end page.
Preferably, the URL in S2 further includes a news brief, a news tag, a news title, and a news time, wherein the news brief is an introduction of news in a simple language by a news editor, and the news tag is a simple summary of news contents by the news editor.
Preferably, the classification in S2 includes positive news, neutral news, and negative news articles, which include super news, big news, and general news, respectively.
Preferably, the database in S2 includes a local database and a cloud database, and the cloud database realizes transmission connection through a data interface during storage and search.
Preferably, the search box in S4 includes a video file input box, a voice file input box, a news tag input box, a news name input box, a news time input box, and a news keyword input box in sequence.
Preferably, the steps of processing the video and the voice transmitted in the video file input box and the voice file input box are as follows:
s41: decomposing a video into pictures according to each frame, then carrying out gray processing on the pictures, carrying out edge processing on the pictures, and then carrying out feature extraction on the pictures;
s42: processing a voice file, converting the voice into characters, and analyzing the semantic meaning and the sound domain of the voice;
s43: news selected according to the news label, the news name, the news time and the news keyword is extracted and stored in a cache memory, then the news is respectively compared with files stored in a database through a video file and a voice file, news content is determined after comparison, and then the news content is displayed on a front-end page.
Preferably, the search mode of the search engine in S4 is one of an enumeration algorithm, a depth-first search, a breadth-first search, an a algorithm, a backtracking algorithm, a monte carlo tree search, a hash function, and the like.
Preferably, the URLs in S5 are addresses of news profiles, respectively, and the URLs correspond to news text content, video content, voice content, and picture content, respectively, through a one-to-many mapping relationship.
Preferably, the news text content, the video content, the voice content and the picture content are stored through a one-to-many mapping relation respectively, so that storage disorder and slow retrieval are avoided.
Compared with the prior art, the invention has the beneficial effects that:
(1) according to the invention, news contents are respectively stored, so that files, videos, voices and the like are independently stored, storage disorder is avoided, and a one-to-many mapping mode is realized through addresses of news brief introduction, so that the searching speed is accelerated;
(2) the invention realizes the rapid search of news contents through various search modes, intercepts the video and voice fragments during storage, and can directly search the news through the video and voice contents, so that the search modes are various, the search can be rapidly realized, the mapping relation can be used, and the news can be prevented from being disordered during the search.
Drawings
FIG. 1 is a schematic view of the step structure of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, the present invention provides a technical solution:
the first embodiment is as follows:
the news storage and search method based on video and voice recognition comprises the following steps:
s1, intercepting video and voice fragments: the method comprises the steps of carrying out fragment interception on video and voice in news and using the video and voice in the news as URL (uniform resource locator) for news storage;
s2, storing the news content: storing the news into a database by taking the intercepted video and voice segments in the S1 as a type of URL, and classifying and storing the news;
s3, storage return: after the storage of S2, the front page skips to display that the storage is successful, and automatically calls out the stored content for display;
s4, searching the stored news: inputting retrieval contents in a search box, and retrieving the storage contents in the database by a retrieval engine;
s5, returning search content: and displaying the detected tag content on a front-end page, clicking a tag of the retrieved news after the determined retrieved content is correct, sequentially and respectively extracting news text content, video content, voice content and picture content through a URL (uniform resource locator) corresponding to the news tag content, and displaying complete news content on the front-end page.
In order to implement the multiple ways of storing news for the subsequent search, in this embodiment, it is preferable that the URL in S2 further includes a news brief, a news tag, a news title, and a news time, the news brief is a news editor introducing news in a simple language, and the news tag is a news editor simply summarizing news contents.
In order to prevent the news from being confused during storage and cause the searching process to be too slow, in this embodiment, it is preferable that the classification in S2 includes positive news, neutral news, and negative news articles, which include super news, big news, and general news, respectively.
In order to implement storage and facilitate searching, in this embodiment, it is preferable that the database in S2 includes a local database and a cloud database, and the cloud database implements transmission connection through a data interface during storage and searching.
In order to perform rapid and simple various searches on news through multiple channels, in this embodiment, it is preferable that the search box in S4 sequentially include a video file input box, a voice file input box, a news tag input box, a news name input box, a news time input box, and a news keyword input box.
In order to implement processing of video and voice and further implement comparison and search of video and voice in news, in this embodiment, preferably, the steps of processing the video and voice transmitted in the video file input box and the voice file input box are as follows:
s41: decomposing a video into pictures according to each frame, then carrying out gray processing on the pictures, carrying out edge processing on the pictures, and then carrying out feature extraction on the pictures;
s42: processing a voice file, converting the voice into characters, and analyzing the semantic meaning and the sound domain of the voice;
s43: news selected according to the news label, the news name, the news time and the news keyword is extracted and stored in a cache memory, then the news is respectively compared with files stored in a database through a video file and a voice file, news content is determined after comparison, and then the news content is displayed on a front-end page.
In order to retrieve news content, in this embodiment, preferably, the search mode of the search engine in S4 is one of an enumeration algorithm, a depth-first search, a breadth-first search, an a algorithm, a backtracking algorithm, a monte carlo tree search, a hash function, and the like.
In order to realize fast detection, the addresses of the news content and the news brief are combined into a one-to-many mapping relationship, and the news content is classified and stored, in this embodiment, it is preferable that the URLs in S5 are the addresses of the news brief, respectively, and the URLs correspond to the news text content, the video content, the voice content, and the picture content through the one-to-many mapping relationship, respectively, and the storage of the news text content, the video content, the voice content, and the picture content through the one-to-many mapping relationship does not generate storage confusion and retrieval slowness.
Example two:
the news storage and search method based on video and voice recognition comprises the following steps:
s1, intercepting video and voice fragments: the method comprises the steps of carrying out fragment interception on video and voice in news and using the video and voice in the news as URL (uniform resource locator) for news storage;
s2, storing the news content: storing the news into a database by taking the intercepted video and voice segments in the S1 as a type of URL, and classifying and storing the news;
s3, storage return: after the storage of S2, the front page skips to display that the storage is successful, and automatically calls out the stored content for display;
s4, searching the stored news: inputting retrieval contents in a search box, and retrieving the storage contents in the database by a retrieval engine;
s5, returning search content: and displaying the detected tag content on a front-end page, clicking a tag of the retrieved news after the determined retrieved content is correct, sequentially and respectively extracting news text content, video content, voice content and picture content through a URL (uniform resource locator) corresponding to the news tag content, and displaying complete news content on the front-end page.
In order to implement the multiple ways of storing news for the subsequent search, in this embodiment, it is preferable that the URL in S2 further includes a news brief, a news tag, a news title, and a news time, the news brief is a news editor introducing news in a simple language, and the news tag is a news editor simply summarizing news contents.
In order to perform rapid and simple various searches on news through multiple channels, in this embodiment, it is preferable that the search box in S4 sequentially include a video file input box, a voice file input box, a news tag input box, a news name input box, a news time input box, and a news keyword input box.
In order to implement processing of video and voice and further implement comparison and search of video and voice in news, in this embodiment, preferably, the steps of processing the video and voice transmitted in the video file input box and the voice file input box are as follows:
s41: decomposing a video into pictures according to each frame, then carrying out gray processing on the pictures, carrying out edge processing on the pictures, and then carrying out feature extraction on the pictures;
s42: processing a voice file, converting the voice into characters, and analyzing the semantic meaning and the sound domain of the voice;
s43: news selected according to the news label, the news name, the news time and the news keyword is extracted and stored in a cache memory, then the news is respectively compared with files stored in a database through a video file and a voice file, news content is determined after comparison, and then the news content is displayed on a front-end page.
In order to retrieve news content, in this embodiment, preferably, the search mode of the search engine in S4 is one of an enumeration algorithm, a depth-first search, a breadth-first search, an a algorithm, a backtracking algorithm, a monte carlo tree search, a hash function, and the like.
In order to realize fast detection, the addresses of the news content and the news brief are combined into a one-to-many mapping relationship, and the news content is classified and stored, in this embodiment, it is preferable that the URLs in S5 are the addresses of the news brief, respectively, and the URLs correspond to the news text content, the video content, the voice content, and the picture content through the one-to-many mapping relationship, respectively, and the storage of the news text content, the video content, the voice content, and the picture content through the one-to-many mapping relationship does not generate storage confusion and retrieval slowness.
The working principle and the using process of the invention are as follows:
the first step, intercepting video and voice segments: the method comprises the steps of carrying out fragment interception on video and voice in news and using the video and voice in the news as URL (uniform resource locator) for news storage;
secondly, storing news contents: storing the news into a database by taking the intercepted video and voice segments in the S1 as a type of URL, and classifying and storing the news;
and step three, storage and return: after the storage of S2, the front page skips to display that the storage is successful, and automatically calls out the stored content for display;
fourthly, searching the stored news: inputting retrieval contents in a search box, and retrieving the storage contents in the database by a retrieval engine;
and step five, returning retrieval contents: and displaying the detected tag content on a front-end page, clicking a tag of the retrieved news after the determined retrieved content is correct, sequentially and respectively extracting news text content, video content, voice content and picture content through a URL (uniform resource locator) corresponding to the news tag content, and displaying complete news content on the front-end page.
Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims (9)

1. The news storage and search method based on video and voice recognition is characterized in that: the method comprises the following steps:
s1, intercepting video and voice fragments: the method comprises the steps of carrying out fragment interception on video and voice in news and using the video and voice in the news as URL (uniform resource locator) for news storage;
s2, storing the news content: storing the news into a database by taking the intercepted video and voice segments in the S1 as a type of URL, and classifying and storing the news;
s3, storage return: after the storage in S2, the front page jumps to display that the storage is successful, and automatically retrieves the storage content for display.
S4, searching the stored news: inputting retrieval contents in a search box, and retrieving the storage contents in the database by a retrieval engine;
s5, returning search content: and displaying the detected tag content on a front-end page, clicking a tag of the retrieved news after the determined retrieved content is correct, sequentially and respectively extracting news text content, video content, voice content and picture content through a URL (uniform resource locator) corresponding to the news tag content, and displaying complete news content on the front-end page.
2. The video and speech recognition-based news storage and search method of claim 1, wherein: the URL in S2 further includes a news brief that the news editor introduces to the news in a simple language, a news tag that the news editor simply summarizes the news content, a news title, and a news time.
3. The video and speech recognition-based news storage and search method of claim 1, wherein: the classification in S2 includes positive news, neutral news, and negative news articles, which include super news, big news, and general news, respectively.
4. The video and speech recognition-based news storage and search method of claim 1, wherein: the database in S2 includes a local database and a cloud database, and the cloud database realizes transmission connection through a data interface during storage and search.
5. The video and speech recognition-based news storage and search method of claim 1, wherein: the search box in S4 includes a video file input box, a voice file input box, a news tag input box, a news name input box, a news time input box, and a news keyword input box in sequence.
6. The video and speech recognition-based news storage and search method of claim 5, wherein: the video and voice processing steps transmitted in the video file input box and the voice file input box are as follows:
s41: decomposing a video into pictures according to each frame, then carrying out gray processing on the pictures, carrying out edge processing on the pictures, and then carrying out feature extraction on the pictures;
s42: processing a voice file, converting the voice into characters, and analyzing the semantic meaning and the sound domain of the voice;
s43: news selected according to the news label, the news name, the news time and the news keyword is extracted and stored in a cache memory, then the news is respectively compared with files stored in a database through a video file and a voice file, news content is determined after comparison, and then the news content is displayed on a front-end page.
7. The video and speech recognition-based news storage and search method of claim 1, wherein: the search mode of the search engine in S4 is one of an enumeration algorithm, a depth-first search, a breadth-first search, an a algorithm, a backtracking algorithm, a monte carlo tree search, a hash function, and the like.
8. The video and speech recognition-based news storage and search method of claim 1, wherein: the URLs in S5 are addresses of news profiles, respectively, and correspond to news text contents, video contents, voice contents, and picture contents, respectively, through a one-to-many mapping relationship.
9. The video and speech recognition-based news storage and search method of claim 8, wherein: the news text content, the video content, the voice content and the picture content are stored through the one-to-many mapping relation respectively, so that storage disorder and slow retrieval are avoided.
CN202010782002.6A 2020-08-06 2020-08-06 News storage and search method based on video and voice recognition Pending CN111930970A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010782002.6A CN111930970A (en) 2020-08-06 2020-08-06 News storage and search method based on video and voice recognition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010782002.6A CN111930970A (en) 2020-08-06 2020-08-06 News storage and search method based on video and voice recognition

Publications (1)

Publication Number Publication Date
CN111930970A true CN111930970A (en) 2020-11-13

Family

ID=73307927

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010782002.6A Pending CN111930970A (en) 2020-08-06 2020-08-06 News storage and search method based on video and voice recognition

Country Status (1)

Country Link
CN (1) CN111930970A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101021855A (en) * 2006-10-11 2007-08-22 鲍东山 Video searching system based on content
US20080086476A1 (en) * 2006-10-04 2008-04-10 Theodore Jack London Shrader Method for providing news syndication discovery and competitive awareness
US20110202844A1 (en) * 2010-02-16 2011-08-18 Msnbc Interactive News, L.L.C. Identification of video segments
US9342599B2 (en) * 2011-05-25 2016-05-17 Thomas Stetson Elliott Methods and systems for centralized audio and video news product collection, optimization, storage, and distribution

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080086476A1 (en) * 2006-10-04 2008-04-10 Theodore Jack London Shrader Method for providing news syndication discovery and competitive awareness
CN101021855A (en) * 2006-10-11 2007-08-22 鲍东山 Video searching system based on content
US20110202844A1 (en) * 2010-02-16 2011-08-18 Msnbc Interactive News, L.L.C. Identification of video segments
CN102163212A (en) * 2010-02-16 2011-08-24 微软公司 Identification of video segments
US9342599B2 (en) * 2011-05-25 2016-05-17 Thomas Stetson Elliott Methods and systems for centralized audio and video news product collection, optimization, storage, and distribution

Similar Documents

Publication Publication Date Title
US8656264B2 (en) Dynamic aggregation and display of contextually relevant content
US11580181B1 (en) Query modification based on non-textual resource context
US8200649B2 (en) Image search engine using context screening parameters
US8713002B1 (en) Identifying media content in queries
WO2016150083A1 (en) Information input method and apparatus
CN106682147A (en) Mass data based query method and device
US20090144240A1 (en) Method and systems for using community bookmark data to supplement internet search results
US20040199495A1 (en) Name browsing systems and methods
US20030055810A1 (en) Front-end weight factor search criteria
US10839013B1 (en) Generating a graphical representation of relationships among a set of articles and information associated with the set of articles
US8965916B2 (en) Method and apparatus for providing media content
WO2010106642A1 (en) Search processing method and apparatus
US8086953B1 (en) Identifying transient portions of web pages
CN104025077A (en) Real-Time Natural Language Processing Of Datastreams
CN110888990A (en) Text recommendation method, device, equipment and medium
JP2011529600A (en) Method and apparatus for relating datasets by using semantic vector and keyword analysis
JP2015525929A (en) Weight-based stemming to improve search quality
US20150206101A1 (en) System for determining infringement of copyright based on the text reference point and method thereof
CN113297457B (en) High-precision intelligent information resource pushing system and pushing method
WO2014000130A1 (en) Method or system for automated extraction of hyper-local events from one or more web pages
CN109783599A (en) Knowledge mapping search method and system based on multi storage
US20090313558A1 (en) Semantic Image Collection Visualization
US20230090601A1 (en) System and method for polarity analysis
CN111930970A (en) News storage and search method based on video and voice recognition
Waitelonis et al. Use what you have: Yovisto video search engine takes a semantic turn

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination