CN109657181A - Internet information chain type storage method, device, computer equipment and storage medium - Google Patents

Internet information chain type storage method, device, computer equipment and storage medium Download PDF

Info

Publication number
CN109657181A
CN109657181A CN201811526834.0A CN201811526834A CN109657181A CN 109657181 A CN109657181 A CN 109657181A CN 201811526834 A CN201811526834 A CN 201811526834A CN 109657181 A CN109657181 A CN 109657181A
Authority
CN
China
Prior art keywords
information
file
data information
newly
text file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811526834.0A
Other languages
Chinese (zh)
Other versions
CN109657181B (en
Inventor
吴壮伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Technology Shenzhen Co Ltd
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Priority to CN201811526834.0A priority Critical patent/CN109657181B/en
Priority claimed from CN201811526834.0A external-priority patent/CN109657181B/en
Publication of CN109657181A publication Critical patent/CN109657181A/en
Priority to PCT/CN2019/092551 priority patent/WO2020119064A1/en
Application granted granted Critical
Publication of CN109657181B publication Critical patent/CN109657181B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Abstract

The invention discloses internet information chain type storage method, device, computer equipment and storage mediums.Method includes: to obtain the website information of webpage to be monitored, treats the data information issued in web page monitored according to the website information of webpage to be monitored and is monitored in real time to obtain newly-increased data information;It whether is that text file judges to the file in newly-increased data information;If the file in newly-increased data information is non-legible file, non-legible file is converted to by text file by presupposed information identification model;Text file in newly-increased data information and/or the text file being converted to are saved into preset data chained list.The present invention is based on compression storing data technologies, the information that can ensure to be stored in data link table can not be deleted and be modified, user can be facilitated, which to obtain deleted data information on internet, has great practical value to assist user to collect evidence related data information.

Description

Internet information chain type storage method, device, computer equipment and storage medium
Technical field
The present invention relates to field of computer technology more particularly to a kind of internet information chain type storage methods, device, calculating Machine equipment and storage medium.
Background technique
The data information of magnanimity is preserved in internet on each webpage, and newly-increased data information can gradually substitute in webpage The data information of preservation causes the data information in webpage there is a situation where alternating to change, thus the existing number in internet It is believed that breath, which carries out storage method, to obtain the data information deleted or modified on internet, in juridical practice Evidence obtaining is carried out to the related data information issued on internet and there is greatly difficulty.Therefore, existing data information memory Method can not obtain and delete data information.
Summary of the invention
The embodiment of the invention provides a kind of internet information chain type storage method, device, computer equipment and storages to be situated between Matter, it is intended to solve the problems, such as that data information memory method can not obtain in the prior art and delete data information.
In a first aspect, the embodiment of the invention provides a kind of internet information chain type storage methods comprising:
The website information for obtaining webpage to be monitored is treated in web page monitored according to the website information of webpage to be monitored and is issued Data information monitored in real time to obtain newly-increased data information;
It whether is that text file judges to the file in newly-increased data information;
If the file in newly-increased data information is non-legible file, non-legible file is turned by presupposed information identification model It is changed to text file;
Text file in newly-increased data information and/or the text file being converted to are saved to preset data chained list In.
Second aspect, the embodiment of the invention provides a kind of internet information chain type storage devices comprising:
Web monitor unit, for obtaining the website information of webpage to be monitored, according to the website information pair of webpage to be monitored The data information issued in webpage to be monitored is monitored in real time to obtain newly-increased data information;
Judging unit, for whether being that text file judges to the file in newly-increased data information;
Information conversion unit is identified if the file for increasing newly in data information is non-legible file by presupposed information Non-legible file is converted to text file by model;
Information memory cell, for protecting the text file in newly-increased data information and/or the text file being converted to It deposits into preset data chained list.
The third aspect, the embodiment of the present invention provide a kind of computer equipment again comprising memory, processor and storage On the memory and the computer program that can run on the processor, the processor execute the computer program Internet information chain type storage method described in the above-mentioned first aspect of Shi Shixian.
Fourth aspect, the embodiment of the invention also provides a kind of computer readable storage mediums, wherein the computer can It reads storage medium and is stored with computer program, it is above-mentioned that the computer program when being executed by a processor executes the processor Internet information chain type storage method described in first aspect.
The embodiment of the invention provides a kind of internet information chain type storage method, device, computer equipment and storages to be situated between Matter.By being monitored and judging whether file therein is text file to the data information issued in webpage, by non-text The file of word file is converted to text file, and stores into data link table all text files to realize to internet information Carry out chain type storage, it can be ensured that the text file stored can not be deleted and be modified, and user can be facilitated to obtain on internet Deleted data information has great practical value to assist user to collect evidence related data information.
Detailed description of the invention
Technical solution in order to illustrate the embodiments of the present invention more clearly, below will be to needed in embodiment description Attached drawing is briefly described, it should be apparent that, drawings in the following description are some embodiments of the invention, general for this field For logical technical staff, without creative efforts, it is also possible to obtain other drawings based on these drawings.
Fig. 1 is the flow diagram of internet information chain type storage method provided in an embodiment of the present invention;
Fig. 2 is the sub-process schematic diagram of internet information chain type storage method provided in an embodiment of the present invention;
Fig. 3 is another sub-process schematic diagram of internet information chain type storage method provided in an embodiment of the present invention;
Fig. 4 is another sub-process schematic diagram of internet information chain type storage method provided in an embodiment of the present invention;
Fig. 5 is another sub-process schematic diagram of internet information chain type storage method provided in an embodiment of the present invention;
Fig. 6 is the schematic block diagram of internet information chain type storage device provided in an embodiment of the present invention;
Fig. 7 is the subelement schematic block diagram of internet information chain type storage device provided in an embodiment of the present invention;
Fig. 8 is another subelement schematic block diagram of internet information chain type storage device provided in an embodiment of the present invention;
Fig. 9 is another subelement schematic block diagram of internet information chain type storage device provided in an embodiment of the present invention;
Figure 10 is another subelement schematic block diagram of internet information chain type storage device provided in an embodiment of the present invention;
Figure 11 is the schematic block diagram of computer equipment provided in an embodiment of the present invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are some of the embodiments of the present invention, instead of all the embodiments.Based on this hair Embodiment in bright, every other implementation obtained by those of ordinary skill in the art without making creative efforts Example, shall fall within the protection scope of the present invention.
It should be appreciated that ought use in this specification and in the appended claims, term " includes " and "comprising" instruction Described feature, entirety, step, operation, the presence of element and/or component, but one or more of the other feature, whole is not precluded Body, step, operation, the presence or addition of element, component and/or its set.
It is also understood that mesh of the term used in this description of the invention merely for the sake of description specific embodiment And be not intended to limit the present invention.As description of the invention and it is used in the attached claims, unless on Other situations are hereafter clearly indicated, otherwise " one " of singular, "one" and "the" are intended to include plural form.
It will be further appreciated that the term "and/or" used in description of the invention and the appended claims is Refer to any combination and all possible combinations of one or more of associated item listed, and including these combinations.
Referring to Fig. 1, Fig. 1 is the flow diagram of internet information chain type storage method provided in an embodiment of the present invention. The internet information chain type storage method is applied in the terminal device with information storage function, such as desktop computer, notes This computer, tablet computer or mobile phone etc..
As shown in Figure 1, the method comprising the steps of S110~S140.
S110, the website information for obtaining webpage to be monitored, are treated in web page monitored according to the website information of webpage to be monitored The data information issued is monitored in real time to obtain newly-increased data information.
The website information for obtaining webpage to be monitored is treated in web page monitored according to the website information of webpage to be monitored and is issued Data information monitored in real time to obtain newly-increased data information.Wherein, webpage information to be monitored by user input to The website information of web page monitored, it is all in internet that webpage to be monitored can be microblogging, wechat, enterprise's network address, government website etc. On the data information issued, publisher can be individual, enterprise, tissue or government department, such as micro- at its to a certain famous person The information issued in rich is monitored, then webpage information to be monitored is the website information of famous person's microblogging webpage.
It may include the file of multiple format, such as the letter of text formatting in the data information issued in webpage to be monitored Breath, the information of video format, audio format, the information of picture format etc..By the data information issued in webpage to be monitored For real-time release, therefore web page monitored need to be treated and be monitored to obtain the newest data issued in webpage letter in real time Breath.In one embodiment, as shown in Fig. 2, step S110 includes sub-step S111, S112 and S113.
If S111, monitoring in webpage to be monitored and issuing data information, according to the website information of webpage to be monitored and described The publisher of data information generates publication source information.
If monitoring in webpage to be monitored and issuing data information, believed according to the website information of webpage to be monitored and the data The publisher of breath generates publication source information.For the publisher for obtaining newly-increased data information, need to be believed according to the network address of webpage to be monitored The publisher of breath and the data information generates corresponding publication source information.Issue the network address in source information including webpage to be monitored The publisher of information and the data information, the website information of webpage to be monitored that is to say the webpage to be monitored letter that user is inputted Breath;Publisher that is to say the issue main body of the publication newly-increased data information, and publisher can be individual, enterprise, tissue or government Department.
S112, issuing time stamp is generated according to the issuing time of the data information.
The issuing time stamp of newly-increased data information is generated, according to the issuing time of the data information to believe newly-increased data The issuing time of breath is recorded, and need to be generated corresponding issuing time according to the issuing time of data information and be stabbed, issuing time stamp It can not be modified after generation, namely be to ensure that the issuing time of newly-increased data information is recorded in time and can not be changed.
For example, webpage to be monitored is the microblogging webpage of a certain famous person, the publication of each micro-blog information includes a hair The cloth time, the issuing time for obtaining the micro-blog information is the issuing time stamp of corresponding newly-increased data information.
S113, the All Files in the publication data information are obtained and issue source information, issuing time stamp to obtain newly Increase data information.
The All Files and publication source information, issuing time obtained in the publication data information are stabbed to obtain newly-increased data Information.The All Files in issued data information are obtained as newly-increased data information, and obtain obtained publication source information And newly-increased data information can be obtained in issuing time stamp, increasing newly in data information may include one or more files.
For example, webpage to be monitored is the microblogging webpage of a certain famous person, passage letter has been issued in famous person's microblogging webpage Breath and a video information, then acquisition issued text information and video information, publication source information, issuing time are stabbed and are wrapped Newly-increased data information containing a text file and a video file.
It S120, whether is that text file judges to the file in newly-increased data information.
It whether is that text file judges to the file in newly-increased data information, for various lattice in newly-increased data information Whether the file of formula is saved, need to be first that text file judges to the file in newly-increased data information.Specifically, passing through The format information of each file in newly-increased data information is obtained to judge whether this document is text file.
Obtain the format information of each file in newly-increased data information.Each file is owned by respective format information, no Format information with file matches with corresponding type, can be judged the concrete type of file by format information. It whether is that text file judges to each file according to the format information of each file, it can be right by the format information of each file The concrete type of file is judged.
For example, this document is text file if the format information of a certain file is txt, string;If a certain file Format information is wav, mp3, wma, then this document is audio file;If the format information of a certain file be avi, flv, rmvb, Then this document is video file.
If the file in S130, newly-increased data information is non-legible file, by information identification model by non-legible file Be converted to text file.
If the file in newly-increased data information is non-legible file, non-legible file is turned by presupposed information identification model It is changed to text file.Specifically, non-legible file is also one of file, non-legible file includes audio file, video text Part, picture.Information identification model is the model for non-legible file to be identified and converted, wherein information identifies mould It include audio identification model and picture recognition model in type.
In one embodiment, as shown in figure 3, step S130 includes sub-step S131, S132 and S133.
S131, the format information for obtaining the non-legible file simultaneously judge whether this document is audio file, if this document Then pass through the audio identification model in information identification model for audio file to identify this document to obtain corresponding text File.
It obtains the format information of file and judges whether this document is audio file, pass through if this document is audio file Audio identification model in information identification model identifies this document to obtain corresponding text file.Pass through audio identification Model can be identified and be converted to the voice messaging in audio file, to obtain the text text comprising text information accordingly Part, each audio file correspondence obtains a text file after being converted.Wherein, audio identification model includes acoustic mode Type, phonetic feature dictionary and semantic analytic modell analytical model.
In one embodiment, as shown in figure 4, step S131 includes sub-step S1311 and S1312.
S1311, cutting is carried out to the voice messaging in audio file to obtain according to the acoustic model in audio identification model To multiple phonemes included in voice messaging.
Cutting is carried out to obtain voice to the voice messaging in audio file according to the acoustic model in audio identification model Multiple phonemes included in information.Specifically, voice messaging is made of, the sound of a character the phoneme of multiple character sounds Element includes the frequency and tone color of the character sound.It include the phoneme of all character sounds in acoustic model, by by voice messaging It is matched with phoneme all in acoustic model, cutting can be carried out to the phoneme of character single in voice messaging, by cutting Divide and finally obtains multiple phonemes included in the voice messaging.
S1312, obtained phoneme is matched will own according to the phonetic feature dictionary in audio identification model Phoneme conversion is Pinyin information.
Obtained phoneme is matched according to the phonetic feature dictionary in audio identification model, it can be by all phonemes Be converted to Pinyin information.It include the corresponding phoneme information of all character phonetics in phonetic feature dictionary, by by obtained sound Corresponding with the character phonetic phoneme information of element is matched, can be by the phoneme conversion of single character in phonetic feature dictionary and The character phonetic that the phoneme matches, to realize all phoneme conversions included in voice messaging as Pinyin information.
S1313, semantic parsing is carried out to obtained Pinyin information according to the semantic analytic modell analytical model in audio identification model To obtain the text file comprising text information.
Semantic parsing is carried out to obtained Pinyin information according to the semantic analytic modell analytical model in audio identification model, to realize Pinyin information is converted into corresponding text file.Comprising corresponding between Pinyin information and text information in semantic analytic modell analytical model Mapping relations, semantic solution can be carried out to obtained Pinyin information by mapping relations included in semantic analytic modell analytical model Analysis is to be converted to the text file comprising text information for Pinyin information.
S132, the format information for obtaining the non-legible file simultaneously judge whether this document is picture, if this document is figure Piece then passes through being identified to this document to obtain corresponding text file of picture recognition model in information identification model.
It obtains the format information of the non-legible file and judges whether this document is picture, lead to if this document is picture Cross being identified to text included in this document to obtain corresponding text of picture recognition model in information identification model File.Specifically, text template is the Template Information for being identified to text in picture, a text template and picture In a text it is corresponding, text template includes multiple fonts corresponding to corresponding text, the equal energy of text in picture Match with a certain font in corresponding text template, passes through the text template and picture progress in picture recognition model Match, text included in the picture can be identified to obtain corresponding text file.
S133, the format information for obtaining the non-legible file simultaneously judge whether this document is video file, if this document Then pass through for video file audio identification model and picture recognition model in information identification model to this document identified with Obtain corresponding text file.
It obtains the format information of the non-legible file and judges whether this document is video file, if this document is video File then passes through audio identification model in information identification model and picture recognition model identifies to obtain phase this document The text file answered.If this document is video file, the voice messaging in the video file is first obtained, and pass through audio identification Model can identify the voice messaging in the video file and be converted to obtain the corresponding text information of the voice messaging, Specific identification and conversion method are identical as the step S131;Each frame picture included in the video file is obtained, and Each frame picture included in the video file is identified by picture recognition model, to obtain institute in each frame picture The text information for including, specific recognition methods are identical as the step S132.Corresponding to the voice messaging for obtaining video file Text information and the video file in each frame picture text information for being included, the video file institute can be finally obtained Corresponding text file that is to say that each video file correspondence obtains a text file after being converted.
In addition, if not text file is not any in video file, audio file and picture, information identification model This document can not be handled, then generate prompt messages to prompt user that can not handle this document.
S140, the text file in newly-increased data information and/or the text file being converted to are saved to preset data In chained list.
Text file in newly-increased data information and/or the text file being converted to are saved into preset data chained list To be saved to newly-increased data information.Data link table is default for storing the database of information, tool in terminal device Body, data link table is the database stored according to time shaft to text file included in newly-increased data information, number Logical order according to the data information stored in chained list is realized by the pointer link orders in data link table, in this reality It applies in example to increase the issuing time stamp of data information newly as the logical order of data link table, that is to say that by temporal information be finger Needle link orders store the text file in newly-increased data information into data link table.By using time sequencing as chained list Logical order stores newly-increased data information, and user can get the text using temporal information as sequence by data link table Listed files, the information that data link table is stored have the characteristic that can not be deleted.
It is stored further, since other non-legible files are converted to the text file comprising text information, therefore can Greatly compression corresponding data information is carried out to store required memory space, convenient for user carry out using.
In one embodiment, as shown in figure 5, step S140 includes sub-step S141, S142 and S143.
S141, the publication source information for increasing data information in webpage to be monitored newly and issuing time stamp are obtained.
Obtain the publication source information for increasing data information in webpage to be monitored newly and issuing time stamp.For convenience of to newly-increased data Information, which is stored, retrieves the information data information stored with the later period, need to obtain the issue source letter of newly-increased data information Breath and issuing time stamp, specifically, including the website information of webpage to be monitored and the hair of the data information in publication source information Cloth people.
S142, according to publication source information by text file included in newly-increased data information and/or the text being converted to Word file is classified.
According to publication source information by text file included in newly-increased data information and/or the text file being converted to Classify.Specifically, classification storage is carried out to newly-increased data information to realize, it need to be according to the publisher couple in publication source information Newly-increased data information is classified, wherein the corresponding classification of a publisher, each classification and one in data link table Child list is corresponding, the newly-increased data information that same issue people is issued then divide into the corresponding child list of publication source information into Row saves, can be by text file included in newly-increased data information and/or conversion by the publisher in publication source information Obtained text file is classified to carry out classification preservation to newly-increased data information.
S143, it is stabbed according to issuing time by text file included in newly-increased data information and/or the text being converted to Word file saves into preset data chained list child list corresponding with publication source information.
It is stabbed according to issuing time by text file included in newly-increased data information and/or the text file being converted to Preservation is saved in corresponding child list into preset data chained list.The corresponding classification of one publisher, each classification It is corresponding with a child list in data link table, and since the data information in data link table is to be carried out according to time shaft Storage, thus need according to increase newly the issuing time stamp of data information by text file corresponding in newly-increased data information store to In data link table in child list corresponding with publisher's classification, it can be realized and newly-increased data information is saved.
Since the text file that data link table is stored can not be deleted and be modified, it can realize and treat institute in web page monitored The historical data information of publication is saved, and is carried out with facilitating the later period to carry out the historical data information issued to corresponding publisher Evidence obtaining.
It, will by being monitored and judging whether file therein is text file to the data information issued in webpage The file of non-legible file is converted to text file, and stores into data link table all text files to realize to internet Information carries out chain type storage, it can be ensured that the text file stored can not be deleted and be modified, and user can be facilitated to obtain interconnection Online deleted data information has great practical value to assist user to collect evidence related data information.
The embodiment of the present invention also provides a kind of internet information chain type storage device, the internet information chain type storage device For executing any embodiment of aforementioned internet information chain type storage method.Specifically, referring to Fig. 6, Fig. 6 is of the invention real The schematic block diagram of the internet information chain type storage device of example offer is provided.The internet information chain type storage device can configure In the terminal devices such as desktop computer, laptop, tablet computer or mobile phone.
As shown in fig. 6, internet information chain type storage device 100 includes web monitor unit 110, judging unit 120, letter Cease converting unit 130, information memory cell 140.
Web monitor unit 110, for obtaining the website information of webpage to be monitored, according to the website information of webpage to be monitored The data information issued in web page monitored is treated to be monitored in real time to obtain newly-increased data information.
The website information for obtaining webpage to be monitored is treated in web page monitored according to the website information of webpage to be monitored and is issued Data information monitored in real time to obtain newly-increased data information.Wherein, webpage information to be monitored by user input to The website information of web page monitored, it is all in internet that webpage to be monitored can be microblogging, wechat, enterprise's network address, government website etc. On the data information issued, publisher can be individual, enterprise, tissue or government department, such as micro- at its to a certain famous person The information issued in rich is monitored, then webpage information to be monitored is the website information of famous person's microblogging webpage.
It may include the file of multiple format, such as the letter of text formatting in the data information issued in webpage to be monitored Breath, the information of video format, audio format, the information of picture format etc..By the data information issued in webpage to be monitored For real-time release, therefore web page monitored need to be treated and be monitored to obtain the newest data issued in webpage letter in real time Breath.
In other inventive embodiments, as shown in fig. 7, the web monitor unit 110 includes subelement: publication source information is raw At unit 111, issuing time stamp generation unit 112 and newly-increased data information acquiring unit 113.
Issue source information generating unit 111, if data information is issued for monitoring in webpage to be monitored, according to be monitored The publisher of the website information of webpage and the data information generates publication source information.
If monitoring in webpage to be monitored and issuing data information, believed according to the website information of webpage to be monitored and the data The publisher of breath generates publication source information.For the publisher for obtaining newly-increased data information, need to be believed according to the network address of webpage to be monitored The publisher of breath and the data information generates corresponding publication source information.Issue the network address in source information including webpage to be monitored The publisher of information and the data information, the website information of webpage to be monitored that is to say the webpage to be monitored letter that user is inputted Breath;Publisher that is to say the issue main body of the publication newly-increased data information, and publisher can be individual, enterprise, tissue or government Department.
Issuing time stabs generation unit 112, for generating issuing time stamp according to the issuing time of the data information.
The issuing time stamp of newly-increased data information is generated, according to the issuing time of the data information to believe newly-increased data The issuing time of breath is recorded, and need to be generated corresponding issuing time according to the issuing time of data information and be stabbed, issuing time stamp It can not be modified after generation, namely be to ensure that the issuing time of newly-increased data information is recorded in time and can not be changed.
Newly-increased data information acquiring unit 113, for obtaining All Files and issue source in the publication data information Information, issuing time stamp are to obtain newly-increased data information.
The All Files and publication source information, issuing time obtained in the publication data information are stabbed to obtain newly-increased data Information.The All Files in issued data information are obtained as newly-increased data information, and obtain obtained publication source information And newly-increased data information can be obtained in issuing time stamp, increasing newly in data information may include one or more files.
Judging unit 120, for whether being that text file judges to the file in newly-increased data information.
It whether is that text file judges to the file in newly-increased data information, for various lattice in newly-increased data information Whether the file of formula is saved, need to be first that text file judges to the file in newly-increased data information.Specifically, passing through The format information of each file in newly-increased data information is obtained to judge whether this document is text file.
Obtain the format information of each file in newly-increased data information.Each file is owned by respective format information, no Format information with file matches with corresponding type, can be judged the concrete type of file by format information. It whether is that text file judges to each file according to the format information of each file, it can be right by the format information of each file The concrete type of file is judged.
Information conversion unit 130 is known if the file for increasing newly in data information is non-legible file by presupposed information Non-legible file is converted to text file by other model.
If the file in newly-increased data information is non-legible file, non-legible file is turned by presupposed information identification model It is changed to text file.Specifically, non-legible file is also one of file, non-legible file includes audio file, video text Part, picture.Information identification model is the model for non-legible file to be identified and converted, wherein information identifies mould It include audio identification model and picture recognition model in type.
In other inventive embodiments, as shown in figure 8, the information conversion unit 130 includes subelement: the first text file Acquiring unit 131, the second text file acquiring unit 132 and third text file acquiring unit 133.
First text file acquiring unit 131, for obtaining the format information of the non-legible file and judging this document Whether be audio file, pass through if this document is audio file audio identification model in information identification model to this document into Row identification is to obtain corresponding text file.
It obtains the format information of file and judges whether this document is audio file, pass through if this document is audio file Audio identification model in information identification model identifies this document to obtain corresponding text file.Pass through audio identification Model can be identified and be converted to the voice messaging in audio file, to obtain the text text comprising text information accordingly Part, each audio file correspondence obtains a text file after being converted.Wherein, audio identification model includes acoustic mode Type, phonetic feature dictionary and semantic analytic modell analytical model.
In other inventive embodiments, as shown in figure 9, the first text file acquiring unit 131 includes subelement: phoneme Cutting unit 1311, phoneme conversion unit 1312 and speech analysis unit 1313.
Phone segmentation unit 1311, for being believed according to the acoustic model in audio identification model the voice in audio file Breath carries out cutting to obtain multiple phonemes included in voice messaging.
Cutting is carried out to obtain voice to the voice messaging in audio file according to the acoustic model in audio identification model Multiple phonemes included in information.Specifically, voice messaging is made of, the sound of a character the phoneme of multiple character sounds Element includes the frequency and tone color of the character sound.It include the phoneme of all character sounds in acoustic model, by by voice messaging It is matched with phoneme all in acoustic model, cutting can be carried out to the phoneme of character single in voice messaging, by cutting Divide and finally obtains multiple phonemes included in the voice messaging.
Phoneme conversion unit 1312, for according to the phonetic feature dictionary in audio identification model to obtained phoneme into Row matching using by all phoneme conversions as Pinyin information.
Obtained phoneme is matched according to the phonetic feature dictionary in audio identification model, it can be by all phonemes Be converted to Pinyin information.It include the corresponding phoneme information of all character phonetics in phonetic feature dictionary, by by obtained sound Corresponding with the character phonetic phoneme information of element is matched, can be by the phoneme conversion of single character in phonetic feature dictionary and The character phonetic that the phoneme matches, to realize all phoneme conversions included in voice messaging as Pinyin information.
Speech analysis unit 1313, for being believed according to the semantic analytic modell analytical model in audio identification model obtained phonetic Breath carries out semantic parsing to obtain the text file comprising text information.
Semantic parsing is carried out to obtained Pinyin information according to the semantic analytic modell analytical model in audio identification model, to realize Pinyin information is converted into corresponding text file.Comprising corresponding between Pinyin information and text information in semantic analytic modell analytical model Mapping relations, semantic solution can be carried out to obtained Pinyin information by mapping relations included in semantic analytic modell analytical model Analysis is to be converted to the text file comprising text information for Pinyin information.
Second text file acquiring unit 132, for obtaining the format information of the non-legible file and judging this document Whether be picture, pass through if this document is picture picture recognition model in information identification model to this document identified with Obtain corresponding text file.
It obtains the format information of the non-legible file and judges whether this document is picture, lead to if this document is picture Cross being identified to text included in this document to obtain corresponding text of picture recognition model in information identification model File.Specifically, text template is the Template Information for being identified to text in picture, a text template and picture In a text it is corresponding, text template includes multiple fonts corresponding to corresponding text, the equal energy of text in picture Match with a certain font in corresponding text template, passes through the text template and picture progress in picture recognition model Match, text included in the picture can be identified to obtain corresponding text file.
Third text file acquiring unit 133, for obtaining the format information of the non-legible file and judging this document Whether it is video file, passes through audio identification model and picture recognition in information identification model if this document is video file Model identifies this document to obtain corresponding text file.
It obtains the format information of the non-legible file and judges whether this document is video file, if this document is video File then passes through audio identification model in information identification model and picture recognition model identifies to obtain phase this document The text file answered.If this document is video file, the voice messaging in the video file is first obtained, and pass through audio identification Model can identify the voice messaging in the video file and be converted to obtain the corresponding text information of the voice messaging, Specific identification and conversion method are identical as method performed in the first text file acquiring unit 131;Obtain the view Each frame picture included in frequency file, and by picture recognition model to each frame picture included in the video file It is identified, to obtain text information included in each frame picture, specific recognition methods and second text file Performed method is identical in acquiring unit 132.Obtain text information and video corresponding to the voice messaging of video file The text information that each frame picture is included in file can finally obtain text file corresponding to the video file, namely It is that each video file correspondence obtains a text file after being converted.
In addition, if not text file is not any in video file, audio file and picture, information identification model This document can not be handled, then generate prompt messages to prompt user that can not handle this document.
Information memory cell 140, for by the text file in newly-increased data information and/or the text file being converted to It saves into preset data chained list.
Text file in newly-increased data information and/or the text file being converted to are saved into preset data chained list To be saved to newly-increased data information.Data link table is default for storing the database of information, tool in terminal device Body, data link table is the database stored according to time shaft to text file included in newly-increased data information, number Logical order according to the data information stored in chained list is realized by the pointer link orders in data link table, in this reality It applies in example to increase the issuing time stamp of data information newly as the logical order of data link table, that is to say that by temporal information be finger Needle link orders store the text file in newly-increased data information into data link table.By using time sequencing as chained list Logical order stores newly-increased data information, and user can get the text using temporal information as sequence by data link table Listed files, the information that data link table is stored have the characteristic that can not be deleted.
It is stored further, since other non-legible files are converted to the text file comprising text information, therefore can Greatly compression corresponding data information is carried out to store required memory space, convenient for user carry out using.
In other inventive embodiments, as shown in Figure 10, the information memory cell 140 includes subelement: acquisition of information list Member 141, document classification unit 142 and file storage unit 143.
Information acquisition unit 141, when for obtaining the publication source information and publication that increase data information in webpage to be monitored newly Between stab.
Obtain the publication source information for increasing data information in webpage to be monitored newly and issuing time stamp.For convenience of to newly-increased data Information, which is stored, retrieves the information data information stored with the later period, need to obtain the issue source letter of newly-increased data information Breath and issuing time stamp, specifically, including the website information of webpage to be monitored and the hair of the data information in publication source information Cloth people.
Document classification unit 142, for according to publication source information by text file included in newly-increased data information and/ Or the text file being converted to is classified.
According to publication source information by text file included in newly-increased data information and/or the text file being converted to Classify.Specifically, classification storage is carried out to newly-increased data information to realize, it need to be according to the publisher couple in publication source information Newly-increased data information is classified, wherein the corresponding classification of a publisher, each classification and one in data link table Child list is corresponding, the newly-increased data information that same issue people is issued then divide into the corresponding child list of publication source information into Row saves, can be by text file included in newly-increased data information and/or conversion by the publisher in publication source information Obtained text file is classified to carry out classification preservation to newly-increased data information.
File storage unit 143, for according to issuing time stamp by text file included in newly-increased data information and/ Or the text file being converted to saves into preset data chained list child list corresponding with publication source information.
It is stabbed according to issuing time by text file included in newly-increased data information and/or the text file being converted to Preservation is saved in corresponding child list into preset data chained list.The corresponding classification of one publisher, each classification It is corresponding with a child list in data link table, and since the data information in data link table is to be carried out according to time shaft Storage, thus need according to increase newly the issuing time stamp of data information by text file corresponding in newly-increased data information store to In data link table in child list corresponding with publisher's classification, it can be realized and newly-increased data information is saved.
Since the text file that data link table is stored can not be deleted and be modified, it can realize and treat institute in web page monitored The historical data information of publication is saved, and is carried out with facilitating the later period to carry out the historical data information issued to corresponding publisher Evidence obtaining.
It, will by being monitored and judging whether file therein is text file to the data information issued in webpage The file of non-legible file is converted to text file, and stores into data link table all text files to realize to internet Information carries out chain type storage, it can be ensured that the text file stored can not be deleted and be modified, and user can be facilitated to obtain interconnection Online deleted data information has great practical value to assist user to collect evidence related data information.
Above-mentioned internet information chain type storage device can be implemented as the form of computer program, which can be with It is run in computer equipment as shown in figure 11.
Figure 11 is please referred to, Figure 11 is the schematic block diagram of computer equipment provided in an embodiment of the present invention.
Refering to fig. 11, which includes processor 502, memory and the net connected by system bus 501 Network interface 505, wherein memory may include non-volatile memory medium 503 and built-in storage 504.
The non-volatile memory medium 503 can storage program area 5031 and computer program 5032.The computer program 5032 are performed, and processor 502 may make to execute internet information chain type storage method.
The processor 502 supports the operation of entire computer equipment 500 for providing calculating and control ability.
The built-in storage 504 provides environment for the operation of the computer program 5032 in non-volatile memory medium 503, should When computer program 5032 is executed by processor 502, processor 502 may make to execute internet information chain type storage method.
The network interface 505 is for carrying out network communication, such as the transmission of offer data information.Those skilled in the art can To understand, structure shown in Figure 11, only the block diagram of part-structure relevant to the present invention program, is not constituted to this hair The restriction for the computer equipment 500 that bright scheme is applied thereon, specific computer equipment 500 may include than as shown in the figure More or fewer components perhaps combine certain components or with different component layouts.
Wherein, the processor 502 is for running computer program 5032 stored in memory, to realize following function Can: the website information of webpage to be monitored is obtained, the number issued in web page monitored is treated according to the website information of webpage to be monitored It is believed that breath is monitored in real time to obtain newly-increased data information;It whether is that text file carries out to the file in newly-increased data information Judgement;If the file in newly-increased data information is non-legible file, non-legible file is converted by presupposed information identification model For text file;Text file in newly-increased data information and/or the text file being converted to are saved to preset data chain In table.
In one embodiment, processor 502 is executing the website information for obtaining webpage to be monitored, according to webpage to be monitored When website information is treated the data information issued in web page monitored and is monitored in real time to obtain the step of newly-increased data information, It performs the following operations: issuing data information if monitoring in webpage to be monitored, according to the website information of webpage to be monitored and described The publisher of data information generates publication source information;Issuing time stamp is generated according to the issuing time of the data information;It obtains All Files and publication source information, issuing time stamp in the publication data information are to obtain newly-increased data information.
In one embodiment, if file of the processor 502 in the newly-increased data information of execution is non-legible file, by pre- If non-legible file is converted to the step of text file by information identification model, perform the following operations: obtaining described non-legible The format information of file simultaneously judges whether this document is audio file, passes through information identification model if this document is audio file In audio identification model this document is identified to obtain corresponding text file;Obtain the format of the non-legible file Information simultaneously judges whether this document is picture, passes through pair of picture recognition model in information identification model if this document is picture This document is identified to obtain corresponding text file;It obtains the format information of the non-legible file and judges that this document is The no audio identification model and picture recognition mould for being video file, passing through in information identification model if this document is video file Type identifies this document to obtain corresponding text file.
In one embodiment, processor 502 is executing the format information for obtaining the non-legible file and is judging this document Whether be audio file, pass through if this document is audio file audio identification model in information identification model to this document into When row identification is to obtain the step of corresponding text file, perform the following operations: according to the acoustic model in audio identification model Cutting is carried out to obtain multiple phonemes included in voice messaging to the voice messaging in audio file;According to audio identification mould Phonetic feature dictionary in type to obtained phoneme matched using by all phoneme conversions as Pinyin information;Known according to audio Semantic analytic modell analytical model in other model carries out semantic parsing to obtained Pinyin information to obtain the text comprising text information File.
In one embodiment, processor 502 is being executed text file in newly-increased data information and/or is being converted to It when text file saves the step into preset data chained list, performs the following operations: obtaining and increase data letter in webpage to be monitored newly The publication source information and issuing time of breath are stabbed;According to publication source information by text file included in newly-increased data information and/ Or the text file being converted to is classified;It is stabbed according to issuing time by text file included in newly-increased data information And/or the text file being converted to saves into preset data chained list child list corresponding with publication source information.
It will be understood by those skilled in the art that the embodiment of computer equipment shown in Figure 11 is not constituted to computer The restriction of equipment specific composition, in other embodiments, computer equipment may include components more more or fewer than diagram, or Person combines certain components or different component layouts.For example, in some embodiments, computer equipment can only include depositing Reservoir and processor, in such embodiments, the structure and function of memory and processor are consistent with embodiment illustrated in fig. 11, Details are not described herein.
It should be appreciated that in embodiments of the present invention, processor 502 can be central processing unit (Central Processing Unit, CPU), which can also be other general processors, digital signal processor (Digital Signal Processor, DSP), specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic Device, discrete gate or transistor logic, discrete hardware components etc..Wherein, general processor can be microprocessor or Person's processor is also possible to any conventional processor etc..
Computer readable storage medium is provided in another embodiment of the invention.The computer readable storage medium can be with For non-volatile computer readable storage medium.The computer-readable recording medium storage has computer program, wherein calculating Machine program performs the steps of the website information for obtaining webpage to be monitored when being executed by processor, according to the net of webpage to be monitored Location information is treated the data information issued in web page monitored and is monitored in real time to obtain newly-increased data information;To newly-increased data Whether the file in information is that text file is judged;If the file in newly-increased data information is non-legible file, by pre- If non-legible file is converted to text file by information identification model;By in newly-increased data information text file and/or conversion Obtained text file is saved into preset data chained list.
In one embodiment, the website information for obtaining webpage to be monitored, according to the website information pair of webpage to be monitored The data information issued in webpage to be monitored is monitored in real time the step of to obtain newly-increased data information, comprising: if monitoring Data information is issued into webpage to be monitored, is generated according to the publisher of the website information of webpage to be monitored and the data information Issue source information;Issuing time stamp is generated according to the issuing time of the data information;It obtains in the publication data information All Files and publication source information, issuing time stamp are to obtain newly-increased data information.
In one embodiment, it if the file in the newly-increased data information is non-legible file, is identified by presupposed information The step of non-legible file is converted to text file by model, comprising: obtain the format information of the non-legible file and judgement Whether this document is audio file, passes through the audio identification model in information identification model if this document is audio file to this File is identified to obtain corresponding text file;It obtains the format information of the non-legible file and whether judges this document For picture, pass through being identified this document to obtain of picture recognition model in information identification model if this document is picture Corresponding text file;It obtains the format information of the non-legible file and judges whether this document is video file, if this article Part is that video file then passes through audio identification model in information identification model and picture recognition model identifies this document To obtain corresponding text file.
In one embodiment, the format information for obtaining the non-legible file and judge whether this document is audio text Part passes through the audio identification model in information identification model if this document is audio file and is identified this document to obtain The step of corresponding text file, comprising: according to the acoustic model in audio identification model to the voice messaging in audio file Cutting is carried out to obtain multiple phonemes included in voice messaging;According to the phonetic feature dictionary in audio identification model to institute Obtained phoneme matched using by all phoneme conversions as Pinyin information;According to the semantic analytic modell analytical model in audio identification model Semantic parsing is carried out to obtain the text file comprising text information to obtained Pinyin information.
In one embodiment, the text file by newly-increased data information and/or the text file being converted to are protected Deposit the step into preset data chained list, comprising: obtain the publication source information and publication for increasing data information in webpage to be monitored newly Timestamp;According to publication source information by text file included in newly-increased data information and/or the text file being converted to Classify;It is according to issuing time stamp that text file included in newly-increased data information and/or the text being converted to is literary Part saves into preset data chained list child list corresponding with publication source information.
It is apparent to those skilled in the art that for convenience of description and succinctly, foregoing description is set The specific work process of standby, device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein. Those of ordinary skill in the art may be aware that unit described in conjunction with the examples disclosed in the embodiments of the present disclosure and algorithm Step can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and software Interchangeability generally describes each exemplary composition and step according to function in the above description.These functions are studied carefully Unexpectedly the specific application and design constraint depending on technical solution are implemented in hardware or software.Professional technician Each specific application can be used different methods to achieve the described function, but this realization is it is not considered that exceed The scope of the present invention.
In several embodiments provided by the present invention, it should be understood that disclosed unit and method, it can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit It divides, only logical function partition, there may be another division manner in actual implementation, can also will be with the same function Unit set is at a unit, such as multiple units or components can be combined or can be integrated into another system or some Feature can be ignored, or not execute.In addition, shown or discussed mutual coupling, direct-coupling or communication connection can Be through some interfaces, the indirect coupling or communication connection of device or unit, be also possible to electricity, mechanical or other shapes Formula connection.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.Some or all of unit therein can be selected to realize the embodiment of the present invention according to the actual needs Purpose.
It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit It is that each unit physically exists alone, is also possible to two or more units and is integrated in one unit.It is above-mentioned integrated Unit both can take the form of hardware realization, can also realize in the form of software functional units.
If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product When, it can store in a computer readable storage medium.Based on this understanding, technical solution of the present invention substantially or Person says that all or part of the part that contributes to existing technology or the technical solution can body in the form of software products Reveal and, which is stored in a computer readable storage medium, including some instructions are used so that one Platform computer equipment (can be personal computer, server or the network equipment etc.) executes described in each embodiment of the present invention The all or part of the steps of method.And computer readable storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory The various media that can store program code such as (ROM, Read-Only Memory), magnetic or disk.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can readily occur in various equivalent modifications or replace It changes, these modifications or substitutions should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with right It is required that protection scope subject to.

Claims (10)

1. a kind of internet information chain type storage method characterized by comprising
The website information for obtaining webpage to be monitored treats the number issued in web page monitored according to the website information of webpage to be monitored It is believed that breath is monitored in real time to obtain newly-increased data information;
It whether is that text file judges to the file in newly-increased data information;
If the file in newly-increased data information is non-legible file, non-legible file is converted to by presupposed information identification model Text file;
Text file in newly-increased data information and/or the text file being converted to are saved into preset data chained list.
2. internet information chain type storage method according to claim 1, which is characterized in that described according to webpage to be monitored Website information treat the data information issued in web page monitored and monitored in real time to obtain newly-increased data information, comprising:
If monitoring in webpage to be monitored and issuing data information, according to the website information of webpage to be monitored and the data information Publisher generates publication source information;
Issuing time stamp is generated according to the issuing time of the data information;
All Files and publication source information in the publication data information, issuing time stamp are obtained to obtain newly-increased data letter Breath.
3. internet information chain type storage method according to claim 1, which is characterized in that described to be known by presupposed information Non-legible file is converted to text file by other model, comprising:
It obtains the format information of the non-legible file and judges whether this document is audio file, if this document is audio file Then this document is identified by the audio identification model in information identification model to obtain corresponding text file;
It obtains the format information of the non-legible file and judges whether this document is picture, pass through letter if this document is picture Picture recognition model identifies this document to obtain corresponding text file in breath identification model;
It obtains the format information of the non-legible file and judges whether this document is video file, if this document is video file Then this document is identified by audio identification model in information identification model and picture recognition model corresponding to obtain Text file.
4. internet information chain type storage method according to claim 3, which is characterized in that if described this document is audio File then passes through the audio identification model in information identification model and is identified to this document to obtain corresponding text file, packet It includes:
Cutting is carried out to obtain voice messaging to the voice messaging in audio file according to the acoustic model in audio identification model Included in multiple phonemes;
Obtained phoneme is matched to be by all phoneme conversions according to the phonetic feature dictionary in audio identification model Pinyin information;
Semantic parsing is carried out to be included to obtained Pinyin information according to the semantic analytic modell analytical model in audio identification model The text file of text information.
5. internet information chain type storage method according to claim 2, which is characterized in that described by newly-increased data information In text file and/or the text file that is converted to save into preset data chained list, comprising:
Obtain the publication source information for increasing data information in webpage to be monitored newly and issuing time stamp;
Text file included in newly-increased data information and/or the text file being converted to are carried out according to publication source information Classification;
Text file included in newly-increased data information and/or the text file being converted to are saved according to issuing time stamp The child list corresponding with publication source information into preset data chained list.
6. a kind of internet information chain type storage device characterized by comprising
Web monitor unit treats prison according to the website information of webpage to be monitored for obtaining the website information of webpage to be monitored The data information issued in control webpage is monitored in real time to obtain newly-increased data information;
Judging unit, for whether being that text file judges to the file in newly-increased data information;
Information conversion unit passes through presupposed information identification model if the file for increasing newly in data information is non-legible file Non-legible file is converted into text file;
Information memory cell, for by the text file in newly-increased data information and/or the text file being converted to save to In preset data chained list.
7. internet information chain type storage device according to claim 6, which is characterized in that the web monitor unit, Include:
Issue source information generating unit, if issuing data information for monitoring in webpage to be monitored, according to webpage to be monitored The publisher of website information and the data information generates publication source information;
Issuing time stabs generation unit, for generating issuing time stamp according to the issuing time of the data information;
Newly-increased data information acquiring unit, for obtaining All Files and publication source information, hair in the publication data information Cloth timestamp is to obtain newly-increased data information.
8. internet information chain type storage device according to claim 6, which is characterized in that the information conversion unit, Include:
First text file acquiring unit, for obtaining the format information of the non-legible file and judging whether this document is sound Frequency file, pass through if this document is audio file audio identification model in information identification model to this document identified with Obtain corresponding text file;
Second text file acquiring unit, for obtaining the format information of the non-legible file and judging whether this document is figure It is corresponding to obtain to pass through being identified to this document for picture recognition model in information identification model if this document is picture for piece Text file;
Third text file acquiring unit, for obtaining the format information of the non-legible file and judging whether this document is view Frequency file passes through audio identification model in information identification model and picture recognition model to this if this document is video file File is identified to obtain corresponding text file.
9. a kind of computer equipment, including memory, processor and it is stored on the memory and can be on the processor The computer program of operation, which is characterized in that the processor realizes such as claim 1 to 5 when executing the computer program Any one of described in internet information chain type storage method.
10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage has computer journey Sequence, the computer program execute the processor as described in any one of claim 1 to 5 mutual Networked information chain type storage method.
CN201811526834.0A 2018-12-13 2018-12-13 Internet information chain storage method, device, computer equipment and storage medium Active CN109657181B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201811526834.0A CN109657181B (en) 2018-12-13 Internet information chain storage method, device, computer equipment and storage medium
PCT/CN2019/092551 WO2020119064A1 (en) 2018-12-13 2019-06-24 Method and device for storing internet information in linked manner, computer apparatus and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811526834.0A CN109657181B (en) 2018-12-13 Internet information chain storage method, device, computer equipment and storage medium

Publications (2)

Publication Number Publication Date
CN109657181A true CN109657181A (en) 2019-04-19
CN109657181B CN109657181B (en) 2024-05-14

Family

ID=

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111125345A (en) * 2019-12-24 2020-05-08 南京三百云信息科技有限公司 Data application method and device
WO2020119064A1 (en) * 2018-12-13 2020-06-18 平安科技(深圳)有限公司 Method and device for storing internet information in linked manner, computer apparatus and storage medium
CN112104747A (en) * 2020-10-30 2020-12-18 广州市玄武无线科技股份有限公司 Request response system based on chain processing
WO2021000496A1 (en) * 2019-07-04 2021-01-07 平安科技(深圳)有限公司 Information chain generation method and apparatus, and computer device and storage medium
CN112784077A (en) * 2021-03-17 2021-05-11 陕西省大数据集团有限公司 Method and device for classified extraction of data asset value

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101364955A (en) * 2008-09-28 2009-02-11 杭州电子科技大学 Method for analyzing and extracting evidence of e-mail customer terminal
US20090157407A1 (en) * 2007-12-12 2009-06-18 Nokia Corporation Methods, Apparatuses, and Computer Program Products for Semantic Media Conversion From Source Files to Audio/Video Files
CN101882162A (en) * 2010-06-29 2010-11-10 北京搜狗科技发展有限公司 Method and system for transmitting network information
CN103942639A (en) * 2014-03-21 2014-07-23 宁波中小在线信息服务有限公司 Policy management system and method for policy consultative service system
CN106412678A (en) * 2016-09-14 2017-02-15 安徽声讯信息技术有限公司 Method and system for transcribing and storing video news in real time
US20170062010A1 (en) * 2015-09-02 2017-03-02 Yahoo! Inc. Computerized system and method for formatted transcription of multimedia content
CN107680602A (en) * 2017-08-24 2018-02-09 平安科技(深圳)有限公司 Voice fraud recognition methods, device, terminal device and storage medium
CN108829765A (en) * 2018-05-29 2018-11-16 平安科技(深圳)有限公司 A kind of information query method, device, computer equipment and storage medium

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090157407A1 (en) * 2007-12-12 2009-06-18 Nokia Corporation Methods, Apparatuses, and Computer Program Products for Semantic Media Conversion From Source Files to Audio/Video Files
CN101364955A (en) * 2008-09-28 2009-02-11 杭州电子科技大学 Method for analyzing and extracting evidence of e-mail customer terminal
CN101882162A (en) * 2010-06-29 2010-11-10 北京搜狗科技发展有限公司 Method and system for transmitting network information
CN103942639A (en) * 2014-03-21 2014-07-23 宁波中小在线信息服务有限公司 Policy management system and method for policy consultative service system
US20170062010A1 (en) * 2015-09-02 2017-03-02 Yahoo! Inc. Computerized system and method for formatted transcription of multimedia content
CN106412678A (en) * 2016-09-14 2017-02-15 安徽声讯信息技术有限公司 Method and system for transcribing and storing video news in real time
CN107680602A (en) * 2017-08-24 2018-02-09 平安科技(深圳)有限公司 Voice fraud recognition methods, device, terminal device and storage medium
CN108829765A (en) * 2018-05-29 2018-11-16 平安科技(深圳)有限公司 A kind of information query method, device, computer equipment and storage medium

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020119064A1 (en) * 2018-12-13 2020-06-18 平安科技(深圳)有限公司 Method and device for storing internet information in linked manner, computer apparatus and storage medium
WO2021000496A1 (en) * 2019-07-04 2021-01-07 平安科技(深圳)有限公司 Information chain generation method and apparatus, and computer device and storage medium
CN111125345A (en) * 2019-12-24 2020-05-08 南京三百云信息科技有限公司 Data application method and device
CN111125345B (en) * 2019-12-24 2024-04-16 南京三百云信息科技有限公司 Data application method and device
CN112104747A (en) * 2020-10-30 2020-12-18 广州市玄武无线科技股份有限公司 Request response system based on chain processing
CN112784077A (en) * 2021-03-17 2021-05-11 陕西省大数据集团有限公司 Method and device for classified extraction of data asset value

Also Published As

Publication number Publication date
WO2020119064A1 (en) 2020-06-18

Similar Documents

Publication Publication Date Title
US20140278426A1 (en) Data shredding for speech recognition acoustic model training under data retention restrictions
US10832803B2 (en) Automated system and method for improving healthcare communication
WO2019095586A1 (en) Meeting minutes generation method, application server, and computer readable storage medium
CN104485105B (en) A kind of electronic health record generation method and electronic medical record system
US9514740B2 (en) Data shredding for speech recognition language model training under data retention restrictions
US11580951B2 (en) Speaker identity and content de-identification
CN110334110A (en) Natural language classification method, device, computer equipment and storage medium
JP2019061662A (en) Method and apparatus for extracting information
JP6019604B2 (en) Speech recognition apparatus, speech recognition method, and program
US20160189107A1 (en) Apparatus and method for automatically creating and recording minutes of meeting
US11114113B2 (en) Multilingual system for early detection of neurodegenerative and psychiatric disorders
CN111353065A (en) Voice archive storage method, device, equipment and computer readable storage medium
WO2019227629A1 (en) Text information generation method and apparatus, computer device and storage medium
CN113190675A (en) Text abstract generation method and device, computer equipment and storage medium
JP6179971B2 (en) Information providing apparatus and information providing method
CN113326696B (en) Text generation method and device
CN109243549B (en) Intelligent follow-up method and device and server
US10825558B2 (en) Method for improving healthcare
CN112309372B (en) Intent recognition method, device, equipment and storage medium based on intonation
EP3809411A1 (en) Multi-lingual system for early detection of alzheimer's disease
CN108962228A (en) model training method and device
CN109524009B (en) Policy entry method and related device based on voice recognition
JP2019520614A (en) Risk event recognition system based on SNS information, method, electronic device and storage medium
CN109657181A (en) Internet information chain type storage method, device, computer equipment and storage medium
US11431472B1 (en) Automated domain language parsing and data extraction

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant