CN109657181B - Internet information chain storage method, device, computer equipment and storage medium - Google Patents

Internet information chain storage method, device, computer equipment and storage medium Download PDF

Info

Publication number
CN109657181B
CN109657181B CN201811526834.0A CN201811526834A CN109657181B CN 109657181 B CN109657181 B CN 109657181B CN 201811526834 A CN201811526834 A CN 201811526834A CN 109657181 B CN109657181 B CN 109657181B
Authority
CN
China
Prior art keywords
information
file
data information
text
text file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811526834.0A
Other languages
Chinese (zh)
Other versions
CN109657181A (en
Inventor
吴壮伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Technology Shenzhen Co Ltd
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Priority to CN201811526834.0A priority Critical patent/CN109657181B/en
Publication of CN109657181A publication Critical patent/CN109657181A/en
Priority to PCT/CN2019/092551 priority patent/WO2020119064A1/en
Application granted granted Critical
Publication of CN109657181B publication Critical patent/CN109657181B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an internet information chained storage method, an internet information chained storage device, computer equipment and a storage medium. The method comprises the following steps: acquiring website information of a webpage to be monitored, and monitoring data information published in the webpage to be monitored in real time according to the website information of the webpage to be monitored to acquire newly added data information; judging whether the file in the newly added data information is a text file or not; if the file in the newly added data information is a non-text file, converting the non-text file into a text file through a preset information identification model; and storing the text file in the newly added data information and/or the text file obtained by conversion into a preset data linked list. The invention is based on the data compression storage technology, can ensure that the information stored in the data linked list cannot be deleted and modified, can facilitate the user to acquire the deleted data information on the Internet so as to assist the user to obtain evidence of the related data information, and has great practical value.

Description

Internet information chain storage method, device, computer equipment and storage medium
Technical Field
The present invention relates to the field of computer technologies, and in particular, to an internet information chain storage method, an internet information chain storage device, a computer device, and a storage medium.
Background
The method has the advantages that massive data information is stored on each webpage in the Internet, the data information stored in the webpage is gradually replaced by newly added data information, and the situation that the data information in the webpage changes is caused, so that the deleted or modified data information on the Internet cannot be acquired by the existing method for storing the data information in the Internet, and the evidence collection of the related data information published on the Internet is extremely difficult in judicial practice. Therefore, the existing data information storage method cannot acquire deleted data information.
Disclosure of Invention
The embodiment of the invention provides an internet information chain type storage method, an internet information chain type storage device, computer equipment and a storage medium, and aims to solve the problem that a data information storage method in the prior art cannot acquire deleted data information.
In a first aspect, an embodiment of the present invention provides an internet information chained storage method, which includes:
acquiring website information of a webpage to be monitored, and monitoring data information published in the webpage to be monitored in real time according to the website information of the webpage to be monitored to acquire newly added data information;
Judging whether the file in the newly added data information is a text file or not;
If the file in the newly added data information is a non-text file, converting the non-text file into a text file through a preset information identification model;
And storing the text file in the newly added data information and/or the text file obtained by conversion into a preset data linked list.
In a second aspect, an embodiment of the present invention provides an internet information chain storage device, including:
The webpage monitoring unit is used for acquiring website information of the webpage to be monitored, and real-time monitoring is carried out on data information issued in the webpage to be monitored according to the website information of the webpage to be monitored so as to acquire newly-added data information;
The judging unit is used for judging whether the file in the newly added data information is a text file or not;
The information conversion unit is used for converting the non-text file into a text file through a preset information identification model if the file in the newly added data information is the non-text file;
the information storage unit is used for storing the text files in the newly added data information and/or the text files obtained through conversion into a preset data linked list.
In a third aspect, an embodiment of the present invention further provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and capable of running on the processor, where the processor implements the internet information chained storage method in the first aspect when executing the computer program.
In a fourth aspect, an embodiment of the present invention further provides a computer readable storage medium, where the computer readable storage medium stores a computer program, where the computer program when executed by a processor causes the processor to perform the internet information chained storage method in the first aspect.
The embodiment of the invention provides an internet information chain type storage method, an internet information chain type storage device, computer equipment and a storage medium. The method and the device have the advantages that the data information released in the webpage is monitored, whether the file in the webpage is a text file is judged, the file of the non-text file is converted into the text file, all the text files are stored in the data linked list to realize chain storage of the internet information, the stored text files can be ensured to be incapable of being deleted and modified, the deleted data information on the internet can be conveniently acquired by a user to assist the user in evidence collection of the related data information, and the method and the device have great practical value.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a flow chart of an internet information chain storage method according to an embodiment of the present invention;
fig. 2 is a schematic sub-flowchart of an internet information chain storage method according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of another sub-flowchart of the method for storing information in a chain manner according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of another sub-flowchart of the method for storing information in a chain manner according to an embodiment of the present invention;
FIG. 5 is a schematic diagram of another sub-flowchart of an Internet information chain storage method according to an embodiment of the present invention;
FIG. 6 is a schematic block diagram of an Internet information chain storage device according to an embodiment of the present invention;
FIG. 7 is a schematic block diagram of a subunit of an Internet information chain storage device according to an embodiment of the present invention;
FIG. 8 is a schematic block diagram of another subunit of an Internet information chain storage device according to an embodiment of the present invention;
FIG. 9 is a schematic block diagram of another subunit of an Internet information chain storage device according to an embodiment of the present invention;
FIG. 10 is a schematic block diagram of another subunit of an Internet information chain storage device according to an embodiment of the present invention;
fig. 11 is a schematic block diagram of a computer device according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are some, but not all embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
It should be understood that the terms "comprises" and "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
It is also to be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in this specification and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.
It should be further understood that the term "and/or" as used in the present specification and the appended claims refers to any and all possible combinations of one or more of the associated listed items, and includes such combinations.
Referring to fig. 1, fig. 1 is a flowchart of an internet information chained storage method according to an embodiment of the invention. The internet information chain storage method is applied to terminal equipment with an information storage function, such as a desktop computer, a notebook computer, a tablet computer or a mobile phone.
As shown in fig. 1, the method includes steps S110 to S140.
S110, acquiring website information of the webpage to be monitored, and monitoring the data information issued in the webpage to be monitored in real time according to the website information of the webpage to be monitored to acquire newly-added data information.
Acquiring website information of a webpage to be monitored, and monitoring data information published in the webpage to be monitored in real time according to the website information of the webpage to be monitored to acquire newly added data information. The web page information to be monitored is web site information of a web page to be monitored, which is input by a user, and the web page to be monitored can be all data information published on the internet, such as microblogs, weChats, enterprise web sites, government websites and the like, and publishers can be individuals, enterprises, organizations or government departments, for example, information published by a celebrity in the microblogs of the celebrity is monitored, and the web page information to be monitored is web site information of the celebrity microblog web page.
The data information released in the web page to be monitored can comprise files in various formats, such as text format information, video format information, audio format information, picture format information and the like. Because the data information released in the webpage to be monitored is released in real time, the webpage to be monitored needs to be monitored to acquire the latest data information released in the webpage in real time. In one embodiment, as shown in FIG. 2, step S110 includes sub-steps S111, S112, and S113.
And S111, if the data information is published in the webpage to be monitored, generating publishing source information according to the website information of the webpage to be monitored and the publisher of the data information.
If the data information is published in the webpage to be monitored, generating publishing source information according to the website information of the webpage to be monitored and the publisher of the data information. In order to obtain the publisher of the newly added data information, corresponding publishing source information is generated according to the website information of the webpage to be monitored and the publisher of the data information. The publishing source information comprises the website information of the webpage to be monitored and the publisher of the data information, wherein the website information of the webpage to be monitored is the webpage information to be monitored input by the user; the publisher is a publishing subject for publishing the newly added data information, and the publisher can be a person, an enterprise, an organization or a government department.
S112, generating a release time stamp according to the release time of the data information.
Generating a release time stamp of the newly added data information according to the release time of the data information, and generating a corresponding release time stamp according to the release time of the data information in order to record the release time of the newly added data information, wherein the release time stamp cannot be changed after being generated, namely, the release time of the newly added data information is ensured to be recorded in time and cannot be changed.
For example, the web page to be monitored is a microblog web page of a person, each piece of microblog information is issued with an issue time, and the issue time for acquiring the microblog information is the issue time stamp of the corresponding newly added data information.
S113, acquiring all files in the release data information, release source information and release time stamps to obtain newly added data information.
And acquiring all files in the release data information, release source information and release time stamps to obtain newly added data information. And acquiring all files in the published data information as newly-added data information, and acquiring the obtained published source information and the published time stamp to obtain the newly-added data information, wherein the newly-added data information can comprise one or more files.
For example, the web page to be monitored is a microblog web page of a celebrity, a section of text information and a video information are released in the celebrity microblog web page, and then the released text information and video information, release source information and release time stamp are obtained to obtain newly added data information comprising a text file and a video file.
S120, judging whether the file in the newly added data information is a text file.
Judging whether the file in the newly added data information is a text file or not, and judging whether the file in the newly added data information is a text file or not in order to save the files in various formats in the newly added data information. Specifically, the format information of each file in the newly added data information is obtained to judge whether the file is a text file or not.
And obtaining format information of each file in the newly added data information. Each file has respective format information, the format information of different files is matched with the corresponding types, and the specific types of the files can be judged through the format information. Judging whether each file is a text file according to the format information of each file, and judging the specific type of the file according to the format information of each file.
For example, if the format information of a certain file is txt and string, the file is a text file; if the format information of a certain file is wav, mp3 and wma, the file is an audio file; if the format information of a certain file is avi, flv, rmvb, the file is a video file.
S130, if the file in the newly added data information is a non-text file, converting the non-text file into a text file through an information identification model.
If the file in the newly added data information is a non-text file, converting the non-text file into a text file through a preset information identification model. Specifically, the non-text file is also one of the files, and the non-text file comprises an audio file, a video file and a picture. The information recognition model is a model for recognizing and converting the non-text file, wherein the information recognition model comprises an audio recognition model and a picture recognition model.
In one embodiment, as shown in FIG. 3, step S130 includes sub-steps S131, S132, and S133.
S131, acquiring format information of the non-text file and judging whether the file is an audio file, and if the file is the audio file, identifying the file through an audio identification model in an information identification model to obtain a corresponding text file.
And acquiring format information of the file, judging whether the file is an audio file, and if the file is the audio file, identifying the file through an audio identification model in the information identification model to obtain a corresponding text file. The voice information in the audio files can be identified and converted through the audio identification model, so that corresponding text files containing text information are obtained, and each audio file is correspondingly obtained after conversion. The audio recognition model comprises an acoustic model, a voice characteristic dictionary and a semantic analysis model.
In one embodiment, as shown in FIG. 4, step S131 includes sub-steps S1311 and S1312.
S1311, slicing the voice information in the audio file according to the acoustic model in the audio recognition model to obtain a plurality of phonemes contained in the voice information.
And segmenting the voice information in the audio file according to the acoustic model in the audio recognition model to obtain a plurality of phonemes contained in the voice information. Specifically, the speech information is composed of phonemes of a plurality of character utterances, and the phonemes of one character include the frequency and timbre of the character utterances. The acoustic model comprises phonemes of all character pronunciations, the phonemes of a single character in the voice information can be segmented by matching the voice information with all the phonemes in the acoustic model, and a plurality of phonemes contained in the voice information are finally obtained through segmentation.
S1312, matching the obtained phonemes according to a voice feature dictionary in the audio recognition model so as to convert all the phonemes into pinyin information.
And matching the obtained phonemes according to a voice feature dictionary in the audio recognition model, and converting all the phonemes into pinyin information. The phonetic feature dictionary contains the phonetic information corresponding to all the character phonetic alphabets, and the obtained phonetic information is matched with the phonetic information corresponding to the character phonetic alphabets, so that the phonetic of a single character can be converted into the character phonetic alphabets matched with the phonetic alphabets in the phonetic feature dictionary, and all the phonetic alphabets contained in the phonetic information are converted into phonetic information.
S1313, carrying out semantic analysis on the obtained pinyin information according to a semantic analysis model in the audio recognition model to obtain a text file containing text information.
And carrying out semantic analysis on the obtained pinyin information according to a semantic analysis model in the audio recognition model so as to convert the pinyin information into a corresponding text file. The semantic analysis model comprises a mapping relation corresponding to the Pinyin information and the text information, and the obtained Pinyin information can be subjected to semantic analysis through the mapping relation contained in the semantic analysis model so as to convert the Pinyin information into a text file containing the text information.
S132, acquiring format information of the non-text file and judging whether the file is a picture, and if the file is the picture, identifying the file through a picture identification model in an information identification model to obtain a corresponding text file.
And acquiring format information of the non-text file, judging whether the file is a picture, and if the file is the picture, identifying the text contained in the file through a picture identification model in an information identification model to obtain a corresponding text file. Specifically, the text templates are template information for identifying the text in the picture, one text template corresponds to one text in the picture, one text template comprises multiple fonts corresponding to the corresponding text, the text in the picture can be matched with one of the fonts in the corresponding text template, and the text contained in the picture can be identified by matching the text template in the picture identification model with the picture so as to obtain the corresponding text file.
S133, acquiring format information of the non-text file and judging whether the file is a video file, and if the file is the video file, identifying the file through an audio identification model and a picture identification model in an information identification model to obtain a corresponding text file.
And acquiring format information of the non-text file, judging whether the file is a video file, and if the file is the video file, identifying the file through an audio identification model and a picture identification model in an information identification model to obtain a corresponding text file. If the file is a video file, firstly acquiring voice information in the video file, and identifying and converting the voice information in the video file through an audio identification model to obtain text information corresponding to the voice information, wherein the specific identification and conversion method is the same as that of the step S131; and obtaining each frame of picture contained in the video file, and identifying each frame of picture contained in the video file through a picture identification model to obtain text information contained in each frame of picture, wherein the specific identification method is the same as that of the step S132. The text information corresponding to the voice information of the video file and the text information contained in each frame of picture in the video file are obtained, so that the text file corresponding to the video file can be finally obtained, namely, each video file is converted to obtain a text file correspondingly.
In addition, if the non-text file is not any of the video file, the audio file and the picture, the information recognition model cannot process the file, and then alarm prompt information is generated to prompt the user that the file cannot be processed.
And S140, storing the text file in the newly added data information and/or the converted text file into a preset data linked list.
And storing the text file in the newly added data information and/or the text file obtained by conversion into a preset data linked list so as to store the newly added data information. The data link table is a database preset in the terminal device and used for storing information, specifically, the data link table is a database for storing text files contained in newly added data information according to a time axis, the logic sequence of the data information stored in the data link table is realized through a pointer link sequence in the data link table, in this embodiment, the issuing time stamp of the newly added data information is used as the logic sequence of the data link table, that is, the text files in the newly added data information are stored in the data link table through the time information as the pointer link sequence. By storing newly added data information in a logical sequence taking the time sequence as a linked list, a user can acquire a text file list taking the time information as the sequence through the data linked list, and the information stored in the data linked list has the characteristic of incapability of deleting.
In addition, other non-text files are converted into text files containing text information for storage, so that the storage space required for storing corresponding data information can be greatly compressed, and the use of users is facilitated.
In one embodiment, as shown in FIG. 5, step S140 includes sub-steps S141, S142, and S143.
S141, acquiring release source information and release time stamps of newly added data information in the webpage to be monitored.
And acquiring release source information and release time stamps of the newly added data information in the webpage to be monitored. In order to facilitate the storage of the newly added data information and the later retrieval of the stored information data information, the release source information and the release time stamp of the newly added data information need to be acquired, and specifically, the release source information comprises the website information of the webpage to be monitored and the release person of the data information.
S142, classifying the text files contained in the newly added data information and/or the converted text files according to the release source information.
And classifying the text files contained in the newly added data information and/or the converted text files according to the release source information. Specifically, in order to implement classification storage of newly-added data information, the newly-added data information needs to be classified according to publishers in the publishing source information, wherein one publisher corresponds to one class, each class corresponds to one sub-linked list in the data linked list, the newly-added data information published by the same publisher is separated into the sub-linked list corresponding to the publishing source information for storage, and text files contained in the newly-added data information and/or text files obtained through conversion can be classified by the publishers in the publishing source information for classification storage of the newly-added data information.
And S143, storing the text file contained in the newly added data information and/or the text file obtained by conversion into a sub-linked list corresponding to the release source information in a preset data linked list according to the release time stamp.
And storing the text files contained in the newly added data information and/or the text files obtained by conversion into corresponding sub-linked lists in a preset data linked list according to the release time stamp. One publisher corresponds to one category, each category corresponds to one sub-linked list in the data linked list, and because the data information in the data linked list is stored according to the time axis, the text file corresponding to the newly added data information is stored in the sub-linked list corresponding to the publisher category in the data linked list according to the release time stamp of the newly added data information, so that the newly added data information can be stored.
The text files stored in the data linked list cannot be deleted and modified, so that the historical data information released in the webpage to be monitored can be stored, and the historical data information released by the corresponding publisher can be conveniently and later obtained.
The method and the device have the advantages that the data information released in the webpage is monitored, whether the file in the webpage is a text file is judged, the file of the non-text file is converted into the text file, all the text files are stored in the data linked list to realize chain storage of the internet information, the stored text files can be ensured to be incapable of being deleted and modified, the deleted data information on the internet can be conveniently acquired by a user to assist the user in evidence collection of the related data information, and the method and the device have great practical value.
The embodiment of the invention also provides an internet information chain storage device which is used for executing any embodiment of the internet information chain storage method. In particular, referring to fig. 6, fig. 6 is a schematic block diagram of an internet information chain storage device according to an embodiment of the present invention. The internet information chain storage device can be configured in terminal equipment such as a desktop computer, a notebook computer, a tablet computer or a mobile phone.
As shown in fig. 6, the internet information chain storage device 100 includes a web page monitoring unit 110, a judging unit 120, an information converting unit 130, and an information storage unit 140.
The web page monitoring unit 110 is configured to obtain web address information of a web page to be monitored, and monitor, in real time, data information published in the web page to be monitored according to the web address information of the web page to be monitored to obtain newly added data information.
Acquiring website information of a webpage to be monitored, and monitoring data information published in the webpage to be monitored in real time according to the website information of the webpage to be monitored to acquire newly added data information. The web page information to be monitored is web site information of a web page to be monitored, which is input by a user, and the web page to be monitored can be all data information published on the internet, such as microblogs, weChats, enterprise web sites, government websites and the like, and publishers can be individuals, enterprises, organizations or government departments, for example, information published by a celebrity in the microblogs of the celebrity is monitored, and the web page information to be monitored is web site information of the celebrity microblog web page.
The data information released in the web page to be monitored can comprise files in various formats, such as text format information, video format information, audio format information, picture format information and the like. Because the data information released in the webpage to be monitored is released in real time, the webpage to be monitored needs to be monitored to acquire the latest data information released in the webpage in real time.
In other embodiments of the present invention, as shown in fig. 7, the web page monitoring unit 110 includes a subunit: a distribution source information generating unit 111, a distribution time stamp generating unit 112, and a newly added data information acquiring unit 113.
The publishing source information generating unit 111 is configured to generate publishing source information according to web address information of a web page to be monitored and a publisher of the data information if the data information is published in the web page to be monitored.
If the data information is published in the webpage to be monitored, generating publishing source information according to the website information of the webpage to be monitored and the publisher of the data information. In order to obtain the publisher of the newly added data information, corresponding publishing source information is generated according to the website information of the webpage to be monitored and the publisher of the data information. The publishing source information comprises the website information of the webpage to be monitored and the publisher of the data information, wherein the website information of the webpage to be monitored is the webpage information to be monitored input by the user; the publisher is a publishing subject for publishing the newly added data information, and the publisher can be a person, an enterprise, an organization or a government department.
The distribution time stamp generating unit 112 is configured to generate a distribution time stamp according to the distribution time of the data information.
Generating a release time stamp of the newly added data information according to the release time of the data information, and generating a corresponding release time stamp according to the release time of the data information in order to record the release time of the newly added data information, wherein the release time stamp cannot be changed after being generated, namely, the release time of the newly added data information is ensured to be recorded in time and cannot be changed.
And the newly added data information obtaining unit 113 is configured to obtain all files in the published data information, the published source information, and the published time stamp, so as to obtain newly added data information.
And acquiring all files in the release data information, release source information and release time stamps to obtain newly added data information. And acquiring all files in the published data information as newly-added data information, and acquiring the obtained published source information and the published time stamp to obtain the newly-added data information, wherein the newly-added data information can comprise one or more files.
The judging unit 120 is configured to judge whether the file in the newly added data information is a text file.
Judging whether the file in the newly added data information is a text file or not, and judging whether the file in the newly added data information is a text file or not in order to save the files in various formats in the newly added data information. Specifically, the format information of each file in the newly added data information is obtained to judge whether the file is a text file or not.
And obtaining format information of each file in the newly added data information. Each file has respective format information, the format information of different files is matched with the corresponding types, and the specific types of the files can be judged through the format information. Judging whether each file is a text file according to the format information of each file, and judging the specific type of the file according to the format information of each file.
The information conversion unit 130 is configured to convert the non-text file into a text file through a preset information identification model if the file in the newly added data information is a non-text file.
If the file in the newly added data information is a non-text file, converting the non-text file into a text file through a preset information identification model. Specifically, the non-text file is also one of the files, and the non-text file comprises an audio file, a video file and a picture. The information recognition model is a model for recognizing and converting the non-text file, wherein the information recognition model comprises an audio recognition model and a picture recognition model.
In other embodiments of the invention, as shown in fig. 8, the information converting unit 130 includes a subunit: a first text file acquiring unit 131, a second text file acquiring unit 132, and a third text file acquiring unit 133.
The first text file obtaining unit 131 is configured to obtain format information of the non-text file and determine whether the file is an audio file, and if the file is an audio file, identify the file through an audio identification model in the information identification model to obtain a corresponding text file.
And acquiring format information of the file, judging whether the file is an audio file, and if the file is the audio file, identifying the file through an audio identification model in the information identification model to obtain a corresponding text file. The voice information in the audio files can be identified and converted through the audio identification model, so that corresponding text files containing text information are obtained, and each audio file is correspondingly obtained after conversion. The audio recognition model comprises an acoustic model, a voice characteristic dictionary and a semantic analysis model.
In other embodiments of the present invention, as shown in fig. 9, the first text file obtaining unit 131 includes a subunit: a phoneme segmentation unit 1311, a phoneme conversion unit 1312, and a speech analysis unit 1313.
The phoneme slicing unit 1311 is configured to slice the speech information in the audio file according to the acoustic model in the audio recognition model to obtain a plurality of phonemes contained in the speech information.
And segmenting the voice information in the audio file according to the acoustic model in the audio recognition model to obtain a plurality of phonemes contained in the voice information. Specifically, the speech information is composed of phonemes of a plurality of character utterances, and the phonemes of one character include the frequency and timbre of the character utterances. The acoustic model comprises phonemes of all character pronunciations, the phonemes of a single character in the voice information can be segmented by matching the voice information with all the phonemes in the acoustic model, and a plurality of phonemes contained in the voice information are finally obtained through segmentation.
A phoneme conversion unit 1312 for matching the resultant phonemes according to a speech feature dictionary in the audio recognition model to convert all the phonemes into pinyin information.
And matching the obtained phonemes according to a voice feature dictionary in the audio recognition model, and converting all the phonemes into pinyin information. The phonetic feature dictionary contains the phonetic information corresponding to all the character phonetic alphabets, and the obtained phonetic information is matched with the phonetic information corresponding to the character phonetic alphabets, so that the phonetic of a single character can be converted into the character phonetic alphabets matched with the phonetic alphabets in the phonetic feature dictionary, and all the phonetic alphabets contained in the phonetic information are converted into phonetic information.
The voice parsing unit 1313 is configured to perform semantic parsing on the obtained pinyin information according to a semantic parsing model in the audio recognition model, so as to obtain a text file containing text information.
And carrying out semantic analysis on the obtained pinyin information according to a semantic analysis model in the audio recognition model so as to convert the pinyin information into a corresponding text file. The semantic analysis model comprises a mapping relation corresponding to the Pinyin information and the text information, and the obtained Pinyin information can be subjected to semantic analysis through the mapping relation contained in the semantic analysis model so as to convert the Pinyin information into a text file containing the text information.
The second text file obtaining unit 132 is configured to obtain format information of the non-text file and determine whether the file is a picture, and if the file is a picture, identify the file through a picture identification model in the information identification model to obtain a corresponding text file.
And acquiring format information of the non-text file, judging whether the file is a picture, and if the file is the picture, identifying the text contained in the file through a picture identification model in an information identification model to obtain a corresponding text file. Specifically, the text templates are template information for identifying the text in the picture, one text template corresponds to one text in the picture, one text template comprises multiple fonts corresponding to the corresponding text, the text in the picture can be matched with one of the fonts in the corresponding text template, and the text contained in the picture can be identified by matching the text template in the picture identification model with the picture so as to obtain the corresponding text file.
And a third text file obtaining unit 133, configured to obtain format information of the non-text file and determine whether the file is a video file, and if the file is a video file, identify the file through an audio identification model and a picture identification model in the information identification model to obtain a corresponding text file.
And acquiring format information of the non-text file, judging whether the file is a video file, and if the file is the video file, identifying the file through an audio identification model and a picture identification model in an information identification model to obtain a corresponding text file. If the file is a video file, the voice information in the video file is acquired first, and the voice information in the video file can be identified and converted through an audio identification model to obtain text information corresponding to the voice information, wherein the specific identification and conversion method is the same as the method executed in the first text file acquiring unit 131; each frame of picture contained in the video file is acquired, and each frame of picture contained in the video file is identified by a picture identification model, so as to obtain text information contained in each frame of picture, and a specific identification method is the same as that executed in the second text file acquisition unit 132. The text information corresponding to the voice information of the video file and the text information contained in each frame of picture in the video file are obtained, so that the text file corresponding to the video file can be finally obtained, namely, each video file is converted to obtain a text file correspondingly.
In addition, if the non-text file is not any of the video file, the audio file and the picture, the information recognition model cannot process the file, and then alarm prompt information is generated to prompt the user that the file cannot be processed.
The information storage unit 140 is configured to store the text file in the newly added data information and/or the converted text file in the preset data link list.
And storing the text file in the newly added data information and/or the text file obtained by conversion into a preset data linked list so as to store the newly added data information. The data link table is a database preset in the terminal device and used for storing information, specifically, the data link table is a database for storing text files contained in newly added data information according to a time axis, the logic sequence of the data information stored in the data link table is realized through a pointer link sequence in the data link table, in this embodiment, the issuing time stamp of the newly added data information is used as the logic sequence of the data link table, that is, the text files in the newly added data information are stored in the data link table through the time information as the pointer link sequence. By storing newly added data information in a logical sequence taking the time sequence as a linked list, a user can acquire a text file list taking the time information as the sequence through the data linked list, and the information stored in the data linked list has the characteristic of incapability of deleting.
In addition, other non-text files are converted into text files containing text information for storage, so that the storage space required for storing corresponding data information can be greatly compressed, and the use of users is facilitated.
In other embodiments of the invention, as shown in fig. 10, the information storage unit 140 includes a subunit: an information acquisition unit 141, a file classification unit 142, and a file storage unit 143.
The information obtaining unit 141 is configured to obtain the publishing source information and the publishing timestamp of the newly added data information in the web page to be monitored.
And acquiring release source information and release time stamps of the newly added data information in the webpage to be monitored. In order to facilitate the storage of the newly added data information and the later retrieval of the stored information data information, the release source information and the release time stamp of the newly added data information need to be acquired, and specifically, the release source information comprises the website information of the webpage to be monitored and the release person of the data information.
The file classifying unit 142 is configured to classify the text file and/or the converted text file included in the newly added data information according to the distribution source information.
And classifying the text files contained in the newly added data information and/or the converted text files according to the release source information. Specifically, in order to implement classification storage of newly-added data information, the newly-added data information needs to be classified according to publishers in the publishing source information, wherein one publisher corresponds to one class, each class corresponds to one sub-linked list in the data linked list, the newly-added data information published by the same publisher is separated into the sub-linked list corresponding to the publishing source information for storage, and text files contained in the newly-added data information and/or text files obtained through conversion can be classified by the publishers in the publishing source information for classification storage of the newly-added data information.
The file storage unit 143 is configured to store, according to the release timestamp, a text file included in the newly added data information and/or a text file obtained by conversion to a sub-linked list corresponding to the release source information in the preset data linked list.
And storing the text files contained in the newly added data information and/or the text files obtained by conversion into corresponding sub-linked lists in a preset data linked list according to the release time stamp. One publisher corresponds to one category, each category corresponds to one sub-linked list in the data linked list, and because the data information in the data linked list is stored according to the time axis, the text file corresponding to the newly added data information is stored in the sub-linked list corresponding to the publisher category in the data linked list according to the release time stamp of the newly added data information, so that the newly added data information can be stored.
The text files stored in the data linked list cannot be deleted and modified, so that the historical data information released in the webpage to be monitored can be stored, and the historical data information released by the corresponding publisher can be conveniently and later obtained.
The method and the device have the advantages that the data information released in the webpage is monitored, whether the file in the webpage is a text file is judged, the file of the non-text file is converted into the text file, all the text files are stored in the data linked list to realize chain storage of the internet information, the stored text files can be ensured to be incapable of being deleted and modified, the deleted data information on the internet can be conveniently acquired by a user to assist the user in evidence collection of the related data information, and the method and the device have great practical value.
The above-described internet information chain storage device may be implemented in the form of a computer program which can be run on a computer apparatus as shown in fig. 11.
Referring to fig. 11, fig. 11 is a schematic block diagram of a computer device according to an embodiment of the present invention.
With reference to FIG. 11, the computer device 500 includes a processor 502, memory, and a network interface 505 connected by a system bus 501, where the memory may include a non-volatile storage medium 503 and an internal memory 504.
The non-volatile storage medium 503 may store an operating system 5031 and a computer program 5032. The computer program 5032, when executed, causes the processor 502 to perform an internet information chained storage method.
The processor 502 is used to provide computing and control capabilities to support the operation of the overall computer device 500.
The internal memory 504 provides an environment for the execution of a computer program 5032 in the non-volatile storage medium 503, which computer program 5032, when executed by the processor 502, causes the processor 502 to perform an internet information chained storage method.
The network interface 505 is used for network communication, such as providing for transmission of data information, etc. It will be appreciated by those skilled in the art that the structure shown in FIG. 11 is merely a block diagram of some of the structures associated with the present inventive arrangements and does not constitute a limitation of the computer device 500 to which the present inventive arrangements may be applied, and that a particular computer device 500 may include more or fewer components than shown, or may combine some of the components, or have a different arrangement of components.
Wherein the processor 502 is configured to execute a computer program 5032 stored in a memory to perform the following functions: acquiring website information of a webpage to be monitored, and monitoring data information published in the webpage to be monitored in real time according to the website information of the webpage to be monitored to acquire newly added data information; judging whether the file in the newly added data information is a text file or not; if the file in the newly added data information is a non-text file, converting the non-text file into a text file through a preset information identification model; and storing the text file in the newly added data information and/or the text file obtained by conversion into a preset data linked list.
In one embodiment, when the processor 502 performs the step of acquiring the website information of the to-be-monitored webpage, and monitoring the data information published in the to-be-monitored webpage in real time according to the website information of the to-be-monitored webpage to acquire the newly added data information, the following operations are performed: if the data information is published in the webpage to be monitored, generating publishing source information according to the website information of the webpage to be monitored and the publisher of the data information; generating a release time stamp according to the release time of the data information; and acquiring all files in the release data information, release source information and release time stamps to obtain newly added data information.
In one embodiment, when executing the step of converting the non-text file into the text file through the preset information identification model if the file in the newly added data information is the non-text file, the processor 502 executes the following operations: acquiring format information of the non-text file and judging whether the file is an audio file, if so, identifying the file through an audio identification model in an information identification model to obtain a corresponding text file; acquiring format information of the non-text file and judging whether the file is a picture, if so, identifying the file through a picture identification model in an information identification model to obtain a corresponding text file; and acquiring format information of the non-text file, judging whether the file is a video file, and if the file is the video file, identifying the file through an audio identification model and a picture identification model in an information identification model to obtain a corresponding text file.
In one embodiment, when the processor 502 performs the steps of obtaining the format information of the non-text file and determining whether the file is an audio file, if the file is an audio file, identifying the file by using an audio identification model in the information identification model to obtain a corresponding text file, the following operations are performed: segmenting voice information in an audio file according to an acoustic model in the audio recognition model to obtain a plurality of phonemes contained in the voice information; matching the obtained phonemes according to a voice feature dictionary in the audio recognition model so as to convert all the phonemes into pinyin information; and carrying out semantic analysis on the obtained pinyin information according to a semantic analysis model in the audio recognition model to obtain a text file containing text information.
In one embodiment, the processor 502 performs the following operations when executing the step of saving the text file in the newly added data information and/or the converted text file in the preset data link list: acquiring release source information and release time stamps of newly added data information in a webpage to be monitored; classifying the text files contained in the newly added data information and/or the converted text files according to the release source information; and storing the text file contained in the newly added data information and/or the text file obtained by conversion into a sub-linked list corresponding to the release source information in a preset data linked list according to the release time stamp.
Those skilled in the art will appreciate that the embodiment of the computer device shown in fig. 11 is not limiting of the specific construction of the computer device, and in other embodiments, the computer device may include more or less components than those shown, or certain components may be combined, or a different arrangement of components. For example, in some embodiments, the computer device may include only a memory and a processor, and in such embodiments, the structure and function of the memory and the processor are consistent with the embodiment shown in fig. 11, and will not be described again.
It should be appreciated that in embodiments of the present invention, the Processor 502 may be a central processing unit (Central Processing Unit, CPU), the Processor 502 may also be other general purpose processors, digital signal processors (DIGITAL SIGNAL processors, DSPs), application SPECIFIC INTEGRATED Circuits (ASICs), off-the-shelf Programmable gate arrays (Field-Programmable GATE ARRAY, FPGA) or other Programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or the like. Wherein the general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
In another embodiment of the invention, a computer-readable storage medium is provided. The computer readable storage medium may be a non-volatile computer readable storage medium. The computer readable storage medium stores a computer program, wherein the computer program when executed by a processor performs the steps of: acquiring website information of a webpage to be monitored, and monitoring data information published in the webpage to be monitored in real time according to the website information of the webpage to be monitored to acquire newly added data information; judging whether the file in the newly added data information is a text file or not; if the file in the newly added data information is a non-text file, converting the non-text file into a text file through a preset information identification model; and storing the text file in the newly added data information and/or the text file obtained by conversion into a preset data linked list.
In an embodiment, the step of obtaining the website information of the to-be-monitored webpage, and monitoring the data information published in the to-be-monitored webpage in real time according to the website information of the to-be-monitored webpage to obtain the newly added data information includes: if the data information is published in the webpage to be monitored, generating publishing source information according to the website information of the webpage to be monitored and the publisher of the data information; generating a release time stamp according to the release time of the data information; and acquiring all files in the release data information, release source information and release time stamps to obtain newly added data information.
In an embodiment, the step of converting the non-text file into the text file through the preset information identification model if the file in the newly added data information is the non-text file includes: acquiring format information of the non-text file and judging whether the file is an audio file, if so, identifying the file through an audio identification model in an information identification model to obtain a corresponding text file; acquiring format information of the non-text file and judging whether the file is a picture, if so, identifying the file through a picture identification model in an information identification model to obtain a corresponding text file; and acquiring format information of the non-text file, judging whether the file is a video file, and if the file is the video file, identifying the file through an audio identification model and a picture identification model in an information identification model to obtain a corresponding text file.
In an embodiment, the step of obtaining the format information of the non-text file and determining whether the file is an audio file, and if the file is an audio file, identifying the file through an audio identification model in an information identification model to obtain a corresponding text file includes: segmenting voice information in an audio file according to an acoustic model in the audio recognition model to obtain a plurality of phonemes contained in the voice information; matching the obtained phonemes according to a voice feature dictionary in the audio recognition model so as to convert all the phonemes into pinyin information; and carrying out semantic analysis on the obtained pinyin information according to a semantic analysis model in the audio recognition model to obtain a text file containing text information.
In an embodiment, the step of storing the text file in the newly added data information and/or the text file obtained by conversion in the preset data link list includes: acquiring release source information and release time stamps of newly added data information in a webpage to be monitored; classifying the text files contained in the newly added data information and/or the converted text files according to the release source information; and storing the text file contained in the newly added data information and/or the text file obtained by conversion into a sub-linked list corresponding to the release source information in a preset data linked list according to the release time stamp.
It will be clearly understood by those skilled in the art that, for convenience and brevity of description, specific working procedures of the apparatus, device and unit described above may refer to corresponding procedures in the foregoing method embodiments, which are not repeated herein. Those of ordinary skill in the art will appreciate that the elements and algorithm steps described in connection with the embodiments disclosed herein may be embodied in electronic hardware, in computer software, or in a combination of the two, and that the elements and steps of the examples have been generally described in terms of function in the foregoing description to clearly illustrate the interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
In the several embodiments provided by the present invention, it should be understood that the disclosed apparatus, device and method may be implemented in other manners. For example, the apparatus embodiments described above are merely illustrative, and for example, the division of the units is merely a logical function division, there may be another division manner in actual implementation, or units having the same function may be integrated into one unit, for example, multiple units or components may be combined or may be integrated into another system, or some features may be omitted, or not performed. In addition, the coupling or direct coupling or communication connection shown or discussed with each other may be an indirect coupling or communication connection via some interfaces, devices, or elements, or may be an electrical, mechanical, or other form of connection.
The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the embodiment of the present invention.
In addition, each functional unit in the embodiments of the present invention may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.
The integrated units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention is essentially or part of what contributes to the prior art, or all or part of the technical solution may be embodied in the form of a software product stored in a computer-readable storage medium, comprising several instructions for causing a computer device (which may be a personal computer, a server, a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned computer-readable storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a magnetic disk, an optical disk, or other various media capable of storing program codes.
While the invention has been described with reference to certain preferred embodiments, it will be understood by those skilled in the art that various changes and substitutions of equivalents may be made and equivalents will be apparent to those skilled in the art without departing from the scope of the invention. Therefore, the protection scope of the invention is subject to the protection scope of the claims.

Claims (7)

1. An internet information chain storage method, comprising:
acquiring website information of a webpage to be monitored, and monitoring data information published in the webpage to be monitored in real time according to the website information of the webpage to be monitored to acquire newly added data information;
Judging whether the file in the newly added data information is a text file or not;
If the file in the newly added data information is a non-text file, converting the non-text file into a text file through a preset information identification model;
the text file in the newly added data information and/or the text file obtained by conversion are stored in a preset data linked list;
the real-time monitoring of the data information issued in the web page to be monitored according to the web site information of the web page to be monitored to obtain newly added data information comprises the following steps:
if the data information is published in the webpage to be monitored, generating publishing source information according to the website information of the webpage to be monitored and the publisher of the data information; the publishing source information comprises web site information of a web page to be monitored and a publisher of the data information, and the publisher is a publishing main body for publishing the data information;
generating a release time stamp according to the release time of the data information;
acquiring all files in the release data information, release source information and release time stamps to obtain newly added data information;
The step of storing the text file in the newly added data information and/or the text file obtained by conversion into a preset data linked list comprises the following steps:
Acquiring release source information and release time stamps of newly added data information in a webpage to be monitored;
Classifying the text files contained in the newly added data information and/or the converted text files according to the release source information;
According to the release time stamp, the text file contained in the newly added data information and/or the text file obtained through conversion are stored in a sub-linked list corresponding to the release source information in a preset data linked list; one publisher corresponds to one category, and each category corresponds to one sub-linked list in the data linked list; and the data information in the data link list is stored according to the time axis.
2. The method for storing internet information in a chained manner according to claim 1, wherein the converting the non-text file into the text file by the preset information recognition model comprises:
Acquiring format information of the non-text file and judging whether the file is an audio file, if so, identifying the file through an audio identification model in an information identification model to obtain a corresponding text file;
acquiring format information of the non-text file and judging whether the file is a picture, if so, identifying the file through a picture identification model in an information identification model to obtain a corresponding text file;
And acquiring format information of the non-text file, judging whether the file is a video file, and if the file is the video file, identifying the file through an audio identification model and a picture identification model in an information identification model to obtain a corresponding text file.
3. The method for storing internet information in a chained manner according to claim 2, wherein if the file is an audio file, the method for identifying the file by using an audio identification model in the information identification model to obtain a corresponding text file comprises:
Segmenting voice information in an audio file according to an acoustic model in the audio recognition model to obtain a plurality of phonemes contained in the voice information;
matching the obtained phonemes according to a voice feature dictionary in the audio recognition model so as to convert all the phonemes into pinyin information;
And carrying out semantic analysis on the obtained pinyin information according to a semantic analysis model in the audio recognition model to obtain a text file containing text information.
4. An internet information chain storage device, comprising:
The webpage monitoring unit is used for acquiring website information of the webpage to be monitored, and real-time monitoring is carried out on data information issued in the webpage to be monitored according to the website information of the webpage to be monitored so as to acquire newly-added data information;
The judging unit is used for judging whether the file in the newly added data information is a text file or not;
The information conversion unit is used for converting the non-text file into a text file through a preset information identification model if the file in the newly added data information is the non-text file;
the information storage unit is used for storing the text files in the newly added data information and/or the text files obtained by conversion into a preset data linked list;
the web page monitoring unit comprises:
The system comprises a release source information generation unit, a data information generation unit and a data information generation unit, wherein the release source information generation unit is used for generating release source information according to the website information of the webpage to be monitored and the release person of the data information if the data information is released in the webpage to be monitored;
The issuing time stamp generating unit is used for generating an issuing time stamp according to the issuing time of the data information; the publishing source information comprises web site information of a web page to be monitored and a publisher of the data information, and the publisher is a publishing main body for publishing the data information;
The newly added data information acquisition unit is used for acquiring all files in the release data information, release source information and release time stamps to obtain newly added data information;
The information storage unit includes: the information acquisition unit is used for acquiring release source information and release time stamps of newly added data information in the webpage to be monitored; the file classification unit is used for classifying the text files contained in the newly added data information and/or the text files obtained by conversion according to the release source information; the file storage unit is used for storing the text files contained in the newly added data information and/or the text files obtained by conversion into a sub-linked list corresponding to the release source information in a preset data linked list according to the release time stamp; one publisher corresponds to one category, and each category corresponds to one sub-linked list in the data linked list; and the data information in the data link list is stored according to the time axis.
5. The internet-information chain storage device according to claim 4, wherein the information converting unit comprises:
The first text file acquisition unit is used for acquiring the format information of the non-text file and judging whether the file is an audio file, and if the file is the audio file, the file is identified through an audio identification model in the information identification model so as to obtain a corresponding text file;
the second text file obtaining unit is used for obtaining the format information of the non-text file and judging whether the file is a picture, if the file is the picture, the file is identified through a picture identification model in the information identification model so as to obtain a corresponding text file;
and the third text file acquisition unit is used for acquiring the format information of the non-text file and judging whether the file is a video file, and if the file is the video file, the file is identified through an audio identification model and a picture identification model in the information identification model so as to obtain a corresponding text file.
6. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the internet information chain storage method according to any of claims 1 to 3 when executing the computer program.
7. A computer readable storage medium, characterized in that the computer readable storage medium stores a computer program which, when executed by a processor, causes the processor to perform the internet information chain storage method according to any one of claims 1 to 3.
CN201811526834.0A 2018-12-13 2018-12-13 Internet information chain storage method, device, computer equipment and storage medium Active CN109657181B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201811526834.0A CN109657181B (en) 2018-12-13 2018-12-13 Internet information chain storage method, device, computer equipment and storage medium
PCT/CN2019/092551 WO2020119064A1 (en) 2018-12-13 2019-06-24 Method and device for storing internet information in linked manner, computer apparatus and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811526834.0A CN109657181B (en) 2018-12-13 2018-12-13 Internet information chain storage method, device, computer equipment and storage medium

Publications (2)

Publication Number Publication Date
CN109657181A CN109657181A (en) 2019-04-19
CN109657181B true CN109657181B (en) 2024-05-14

Family

ID=66113068

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811526834.0A Active CN109657181B (en) 2018-12-13 2018-12-13 Internet information chain storage method, device, computer equipment and storage medium

Country Status (2)

Country Link
CN (1) CN109657181B (en)
WO (1) WO2020119064A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109657181B (en) * 2018-12-13 2024-05-14 平安科技(深圳)有限公司 Internet information chain storage method, device, computer equipment and storage medium
CN110490538B (en) * 2019-07-04 2023-08-22 平安科技(深圳)有限公司 Information chain generation method, device, computer equipment and storage medium
CN111125345B (en) * 2019-12-24 2024-04-16 南京三百云信息科技有限公司 Data application method and device
CN112104747B (en) * 2020-10-30 2021-02-26 广州市玄武无线科技股份有限公司 Request response system based on chain processing
CN112784077A (en) * 2021-03-17 2021-05-11 陕西省大数据集团有限公司 Method and device for classified extraction of data asset value

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101364955A (en) * 2008-09-28 2009-02-11 杭州电子科技大学 Method for analyzing and extracting evidence of e-mail customer terminal
CN101882162A (en) * 2010-06-29 2010-11-10 北京搜狗科技发展有限公司 Method and system for transmitting network information
CN103942639A (en) * 2014-03-21 2014-07-23 宁波中小在线信息服务有限公司 Policy management system and method for policy consultative service system
CN106412678A (en) * 2016-09-14 2017-02-15 安徽声讯信息技术有限公司 Method and system for transcribing and storing video news in real time
CN107680602A (en) * 2017-08-24 2018-02-09 平安科技(深圳)有限公司 Voice fraud recognition methods, device, terminal device and storage medium
CN108829765A (en) * 2018-05-29 2018-11-16 平安科技(深圳)有限公司 A kind of information query method, device, computer equipment and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090157407A1 (en) * 2007-12-12 2009-06-18 Nokia Corporation Methods, Apparatuses, and Computer Program Products for Semantic Media Conversion From Source Files to Audio/Video Files
US10332506B2 (en) * 2015-09-02 2019-06-25 Oath Inc. Computerized system and method for formatted transcription of multimedia content
CN107450900B (en) * 2016-06-01 2021-01-29 阿里巴巴(中国)有限公司 Screenshot method and device
CN109657181B (en) * 2018-12-13 2024-05-14 平安科技(深圳)有限公司 Internet information chain storage method, device, computer equipment and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101364955A (en) * 2008-09-28 2009-02-11 杭州电子科技大学 Method for analyzing and extracting evidence of e-mail customer terminal
CN101882162A (en) * 2010-06-29 2010-11-10 北京搜狗科技发展有限公司 Method and system for transmitting network information
CN103942639A (en) * 2014-03-21 2014-07-23 宁波中小在线信息服务有限公司 Policy management system and method for policy consultative service system
CN106412678A (en) * 2016-09-14 2017-02-15 安徽声讯信息技术有限公司 Method and system for transcribing and storing video news in real time
CN107680602A (en) * 2017-08-24 2018-02-09 平安科技(深圳)有限公司 Voice fraud recognition methods, device, terminal device and storage medium
CN108829765A (en) * 2018-05-29 2018-11-16 平安科技(深圳)有限公司 A kind of information query method, device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN109657181A (en) 2019-04-19
WO2020119064A1 (en) 2020-06-18

Similar Documents

Publication Publication Date Title
CN109657181B (en) Internet information chain storage method, device, computer equipment and storage medium
WO2019095586A1 (en) Meeting minutes generation method, application server, and computer readable storage medium
US9514741B2 (en) Data shredding for speech recognition acoustic model training under data retention restrictions
US9697819B2 (en) Method for building a speech feature library, and method, apparatus, device, and computer readable storage media for speech synthesis
WO2020253399A1 (en) Log classification rule generation method, device, apparatus, and readable storage medium
US9514740B2 (en) Data shredding for speech recognition language model training under data retention restrictions
US10108698B2 (en) Common data repository for improving transactional efficiencies of user interactions with a computing device
US20160012082A1 (en) Content-based revision history timelines
CN109299227B (en) Information query method and device based on voice recognition
US20140195532A1 (en) Collecting digital assets to form a searchable repository
CN111159987A (en) Data chart drawing method, device, equipment and computer readable storage medium
CN111400361A (en) Data real-time storage method and device, computer equipment and storage medium
CN111639157B (en) Audio marking method, device, equipment and readable storage medium
US20210133212A1 (en) Data archiving method and computing device implementing same
CN111128160B (en) Receipt modification method and device based on voice recognition and computer equipment
WO2016188334A1 (en) Method and device for processing application access data
US9747891B1 (en) Name pronunciation recommendation
CN108846098B (en) Information flow abstract generating and displaying method
CN117275466A (en) Business intention recognition method, device, equipment and storage medium thereof
CN109524009B (en) Policy entry method and related device based on voice recognition
CN116010545A (en) Data processing method, device and equipment
CN109493868B (en) Policy entry method and related device based on voice recognition
CN113761206A (en) Intelligent information query method, device, equipment and medium based on intention recognition
CN108595470B (en) Audio paragraph collection method, device and system and computer equipment
CN113032515A (en) Method, system, device and storage medium for generating chart based on multiple data sources

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant