CN109657181A - Internet information chain type storage method, device, computer equipment and storage medium - Google Patents
Internet information chain type storage method, device, computer equipment and storage medium Download PDFInfo
- Publication number
- CN109657181A CN109657181A CN201811526834.0A CN201811526834A CN109657181A CN 109657181 A CN109657181 A CN 109657181A CN 201811526834 A CN201811526834 A CN 201811526834A CN 109657181 A CN109657181 A CN 109657181A
- Authority
- CN
- China
- Prior art keywords
- information
- file
- data information
- newly
- text file
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000003860 storage Methods 0.000 title claims abstract description 72
- 238000000034 method Methods 0.000 title claims abstract description 48
- 238000006243 chemical reaction Methods 0.000 claims description 21
- 238000004590 computer program Methods 0.000 claims description 14
- 238000005520 cutting process Methods 0.000 claims description 12
- 238000012544 monitoring process Methods 0.000 claims description 8
- 239000004744 fabric Substances 0.000 claims description 4
- 230000006835 compression Effects 0.000 abstract description 3
- 238000007906 compression Methods 0.000 abstract description 3
- 238000005516 engineering process Methods 0.000 abstract description 3
- 238000010586 diagram Methods 0.000 description 16
- 230000006870 function Effects 0.000 description 8
- 238000004321 preservation Methods 0.000 description 5
- 238000013507 mapping Methods 0.000 description 4
- 230000002123 temporal effect Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000000151 deposition Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
Abstract
The invention discloses internet information chain type storage method, device, computer equipment and storage mediums.Method includes: to obtain the website information of webpage to be monitored, treats the data information issued in web page monitored according to the website information of webpage to be monitored and is monitored in real time to obtain newly-increased data information;It whether is that text file judges to the file in newly-increased data information;If the file in newly-increased data information is non-legible file, non-legible file is converted to by text file by presupposed information identification model;Text file in newly-increased data information and/or the text file being converted to are saved into preset data chained list.The present invention is based on compression storing data technologies, the information that can ensure to be stored in data link table can not be deleted and be modified, user can be facilitated, which to obtain deleted data information on internet, has great practical value to assist user to collect evidence related data information.
Description
Technical field
The present invention relates to field of computer technology more particularly to a kind of internet information chain type storage methods, device, calculating
Machine equipment and storage medium.
Background technique
The data information of magnanimity is preserved in internet on each webpage, and newly-increased data information can gradually substitute in webpage
The data information of preservation causes the data information in webpage there is a situation where alternating to change, thus the existing number in internet
It is believed that breath, which carries out storage method, to obtain the data information deleted or modified on internet, in juridical practice
Evidence obtaining is carried out to the related data information issued on internet and there is greatly difficulty.Therefore, existing data information memory
Method can not obtain and delete data information.
Summary of the invention
The embodiment of the invention provides a kind of internet information chain type storage method, device, computer equipment and storages to be situated between
Matter, it is intended to solve the problems, such as that data information memory method can not obtain in the prior art and delete data information.
In a first aspect, the embodiment of the invention provides a kind of internet information chain type storage methods comprising:
The website information for obtaining webpage to be monitored is treated in web page monitored according to the website information of webpage to be monitored and is issued
Data information monitored in real time to obtain newly-increased data information;
It whether is that text file judges to the file in newly-increased data information;
If the file in newly-increased data information is non-legible file, non-legible file is turned by presupposed information identification model
It is changed to text file;
Text file in newly-increased data information and/or the text file being converted to are saved to preset data chained list
In.
Second aspect, the embodiment of the invention provides a kind of internet information chain type storage devices comprising:
Web monitor unit, for obtaining the website information of webpage to be monitored, according to the website information pair of webpage to be monitored
The data information issued in webpage to be monitored is monitored in real time to obtain newly-increased data information;
Judging unit, for whether being that text file judges to the file in newly-increased data information;
Information conversion unit is identified if the file for increasing newly in data information is non-legible file by presupposed information
Non-legible file is converted to text file by model;
Information memory cell, for protecting the text file in newly-increased data information and/or the text file being converted to
It deposits into preset data chained list.
The third aspect, the embodiment of the present invention provide a kind of computer equipment again comprising memory, processor and storage
On the memory and the computer program that can run on the processor, the processor execute the computer program
Internet information chain type storage method described in the above-mentioned first aspect of Shi Shixian.
Fourth aspect, the embodiment of the invention also provides a kind of computer readable storage mediums, wherein the computer can
It reads storage medium and is stored with computer program, it is above-mentioned that the computer program when being executed by a processor executes the processor
Internet information chain type storage method described in first aspect.
The embodiment of the invention provides a kind of internet information chain type storage method, device, computer equipment and storages to be situated between
Matter.By being monitored and judging whether file therein is text file to the data information issued in webpage, by non-text
The file of word file is converted to text file, and stores into data link table all text files to realize to internet information
Carry out chain type storage, it can be ensured that the text file stored can not be deleted and be modified, and user can be facilitated to obtain on internet
Deleted data information has great practical value to assist user to collect evidence related data information.
Detailed description of the invention
Technical solution in order to illustrate the embodiments of the present invention more clearly, below will be to needed in embodiment description
Attached drawing is briefly described, it should be apparent that, drawings in the following description are some embodiments of the invention, general for this field
For logical technical staff, without creative efforts, it is also possible to obtain other drawings based on these drawings.
Fig. 1 is the flow diagram of internet information chain type storage method provided in an embodiment of the present invention;
Fig. 2 is the sub-process schematic diagram of internet information chain type storage method provided in an embodiment of the present invention;
Fig. 3 is another sub-process schematic diagram of internet information chain type storage method provided in an embodiment of the present invention;
Fig. 4 is another sub-process schematic diagram of internet information chain type storage method provided in an embodiment of the present invention;
Fig. 5 is another sub-process schematic diagram of internet information chain type storage method provided in an embodiment of the present invention;
Fig. 6 is the schematic block diagram of internet information chain type storage device provided in an embodiment of the present invention;
Fig. 7 is the subelement schematic block diagram of internet information chain type storage device provided in an embodiment of the present invention;
Fig. 8 is another subelement schematic block diagram of internet information chain type storage device provided in an embodiment of the present invention;
Fig. 9 is another subelement schematic block diagram of internet information chain type storage device provided in an embodiment of the present invention;
Figure 10 is another subelement schematic block diagram of internet information chain type storage device provided in an embodiment of the present invention;
Figure 11 is the schematic block diagram of computer equipment provided in an embodiment of the present invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete
Site preparation description, it is clear that described embodiments are some of the embodiments of the present invention, instead of all the embodiments.Based on this hair
Embodiment in bright, every other implementation obtained by those of ordinary skill in the art without making creative efforts
Example, shall fall within the protection scope of the present invention.
It should be appreciated that ought use in this specification and in the appended claims, term " includes " and "comprising" instruction
Described feature, entirety, step, operation, the presence of element and/or component, but one or more of the other feature, whole is not precluded
Body, step, operation, the presence or addition of element, component and/or its set.
It is also understood that mesh of the term used in this description of the invention merely for the sake of description specific embodiment
And be not intended to limit the present invention.As description of the invention and it is used in the attached claims, unless on
Other situations are hereafter clearly indicated, otherwise " one " of singular, "one" and "the" are intended to include plural form.
It will be further appreciated that the term "and/or" used in description of the invention and the appended claims is
Refer to any combination and all possible combinations of one or more of associated item listed, and including these combinations.
Referring to Fig. 1, Fig. 1 is the flow diagram of internet information chain type storage method provided in an embodiment of the present invention.
The internet information chain type storage method is applied in the terminal device with information storage function, such as desktop computer, notes
This computer, tablet computer or mobile phone etc..
As shown in Figure 1, the method comprising the steps of S110~S140.
S110, the website information for obtaining webpage to be monitored, are treated in web page monitored according to the website information of webpage to be monitored
The data information issued is monitored in real time to obtain newly-increased data information.
The website information for obtaining webpage to be monitored is treated in web page monitored according to the website information of webpage to be monitored and is issued
Data information monitored in real time to obtain newly-increased data information.Wherein, webpage information to be monitored by user input to
The website information of web page monitored, it is all in internet that webpage to be monitored can be microblogging, wechat, enterprise's network address, government website etc.
On the data information issued, publisher can be individual, enterprise, tissue or government department, such as micro- at its to a certain famous person
The information issued in rich is monitored, then webpage information to be monitored is the website information of famous person's microblogging webpage.
It may include the file of multiple format, such as the letter of text formatting in the data information issued in webpage to be monitored
Breath, the information of video format, audio format, the information of picture format etc..By the data information issued in webpage to be monitored
For real-time release, therefore web page monitored need to be treated and be monitored to obtain the newest data issued in webpage letter in real time
Breath.In one embodiment, as shown in Fig. 2, step S110 includes sub-step S111, S112 and S113.
If S111, monitoring in webpage to be monitored and issuing data information, according to the website information of webpage to be monitored and described
The publisher of data information generates publication source information.
If monitoring in webpage to be monitored and issuing data information, believed according to the website information of webpage to be monitored and the data
The publisher of breath generates publication source information.For the publisher for obtaining newly-increased data information, need to be believed according to the network address of webpage to be monitored
The publisher of breath and the data information generates corresponding publication source information.Issue the network address in source information including webpage to be monitored
The publisher of information and the data information, the website information of webpage to be monitored that is to say the webpage to be monitored letter that user is inputted
Breath;Publisher that is to say the issue main body of the publication newly-increased data information, and publisher can be individual, enterprise, tissue or government
Department.
S112, issuing time stamp is generated according to the issuing time of the data information.
The issuing time stamp of newly-increased data information is generated, according to the issuing time of the data information to believe newly-increased data
The issuing time of breath is recorded, and need to be generated corresponding issuing time according to the issuing time of data information and be stabbed, issuing time stamp
It can not be modified after generation, namely be to ensure that the issuing time of newly-increased data information is recorded in time and can not be changed.
For example, webpage to be monitored is the microblogging webpage of a certain famous person, the publication of each micro-blog information includes a hair
The cloth time, the issuing time for obtaining the micro-blog information is the issuing time stamp of corresponding newly-increased data information.
S113, the All Files in the publication data information are obtained and issue source information, issuing time stamp to obtain newly
Increase data information.
The All Files and publication source information, issuing time obtained in the publication data information are stabbed to obtain newly-increased data
Information.The All Files in issued data information are obtained as newly-increased data information, and obtain obtained publication source information
And newly-increased data information can be obtained in issuing time stamp, increasing newly in data information may include one or more files.
For example, webpage to be monitored is the microblogging webpage of a certain famous person, passage letter has been issued in famous person's microblogging webpage
Breath and a video information, then acquisition issued text information and video information, publication source information, issuing time are stabbed and are wrapped
Newly-increased data information containing a text file and a video file.
It S120, whether is that text file judges to the file in newly-increased data information.
It whether is that text file judges to the file in newly-increased data information, for various lattice in newly-increased data information
Whether the file of formula is saved, need to be first that text file judges to the file in newly-increased data information.Specifically, passing through
The format information of each file in newly-increased data information is obtained to judge whether this document is text file.
Obtain the format information of each file in newly-increased data information.Each file is owned by respective format information, no
Format information with file matches with corresponding type, can be judged the concrete type of file by format information.
It whether is that text file judges to each file according to the format information of each file, it can be right by the format information of each file
The concrete type of file is judged.
For example, this document is text file if the format information of a certain file is txt, string;If a certain file
Format information is wav, mp3, wma, then this document is audio file;If the format information of a certain file be avi, flv, rmvb,
Then this document is video file.
If the file in S130, newly-increased data information is non-legible file, by information identification model by non-legible file
Be converted to text file.
If the file in newly-increased data information is non-legible file, non-legible file is turned by presupposed information identification model
It is changed to text file.Specifically, non-legible file is also one of file, non-legible file includes audio file, video text
Part, picture.Information identification model is the model for non-legible file to be identified and converted, wherein information identifies mould
It include audio identification model and picture recognition model in type.
In one embodiment, as shown in figure 3, step S130 includes sub-step S131, S132 and S133.
S131, the format information for obtaining the non-legible file simultaneously judge whether this document is audio file, if this document
Then pass through the audio identification model in information identification model for audio file to identify this document to obtain corresponding text
File.
It obtains the format information of file and judges whether this document is audio file, pass through if this document is audio file
Audio identification model in information identification model identifies this document to obtain corresponding text file.Pass through audio identification
Model can be identified and be converted to the voice messaging in audio file, to obtain the text text comprising text information accordingly
Part, each audio file correspondence obtains a text file after being converted.Wherein, audio identification model includes acoustic mode
Type, phonetic feature dictionary and semantic analytic modell analytical model.
In one embodiment, as shown in figure 4, step S131 includes sub-step S1311 and S1312.
S1311, cutting is carried out to the voice messaging in audio file to obtain according to the acoustic model in audio identification model
To multiple phonemes included in voice messaging.
Cutting is carried out to obtain voice to the voice messaging in audio file according to the acoustic model in audio identification model
Multiple phonemes included in information.Specifically, voice messaging is made of, the sound of a character the phoneme of multiple character sounds
Element includes the frequency and tone color of the character sound.It include the phoneme of all character sounds in acoustic model, by by voice messaging
It is matched with phoneme all in acoustic model, cutting can be carried out to the phoneme of character single in voice messaging, by cutting
Divide and finally obtains multiple phonemes included in the voice messaging.
S1312, obtained phoneme is matched will own according to the phonetic feature dictionary in audio identification model
Phoneme conversion is Pinyin information.
Obtained phoneme is matched according to the phonetic feature dictionary in audio identification model, it can be by all phonemes
Be converted to Pinyin information.It include the corresponding phoneme information of all character phonetics in phonetic feature dictionary, by by obtained sound
Corresponding with the character phonetic phoneme information of element is matched, can be by the phoneme conversion of single character in phonetic feature dictionary and
The character phonetic that the phoneme matches, to realize all phoneme conversions included in voice messaging as Pinyin information.
S1313, semantic parsing is carried out to obtained Pinyin information according to the semantic analytic modell analytical model in audio identification model
To obtain the text file comprising text information.
Semantic parsing is carried out to obtained Pinyin information according to the semantic analytic modell analytical model in audio identification model, to realize
Pinyin information is converted into corresponding text file.Comprising corresponding between Pinyin information and text information in semantic analytic modell analytical model
Mapping relations, semantic solution can be carried out to obtained Pinyin information by mapping relations included in semantic analytic modell analytical model
Analysis is to be converted to the text file comprising text information for Pinyin information.
S132, the format information for obtaining the non-legible file simultaneously judge whether this document is picture, if this document is figure
Piece then passes through being identified to this document to obtain corresponding text file of picture recognition model in information identification model.
It obtains the format information of the non-legible file and judges whether this document is picture, lead to if this document is picture
Cross being identified to text included in this document to obtain corresponding text of picture recognition model in information identification model
File.Specifically, text template is the Template Information for being identified to text in picture, a text template and picture
In a text it is corresponding, text template includes multiple fonts corresponding to corresponding text, the equal energy of text in picture
Match with a certain font in corresponding text template, passes through the text template and picture progress in picture recognition model
Match, text included in the picture can be identified to obtain corresponding text file.
S133, the format information for obtaining the non-legible file simultaneously judge whether this document is video file, if this document
Then pass through for video file audio identification model and picture recognition model in information identification model to this document identified with
Obtain corresponding text file.
It obtains the format information of the non-legible file and judges whether this document is video file, if this document is video
File then passes through audio identification model in information identification model and picture recognition model identifies to obtain phase this document
The text file answered.If this document is video file, the voice messaging in the video file is first obtained, and pass through audio identification
Model can identify the voice messaging in the video file and be converted to obtain the corresponding text information of the voice messaging,
Specific identification and conversion method are identical as the step S131;Each frame picture included in the video file is obtained, and
Each frame picture included in the video file is identified by picture recognition model, to obtain institute in each frame picture
The text information for including, specific recognition methods are identical as the step S132.Corresponding to the voice messaging for obtaining video file
Text information and the video file in each frame picture text information for being included, the video file institute can be finally obtained
Corresponding text file that is to say that each video file correspondence obtains a text file after being converted.
In addition, if not text file is not any in video file, audio file and picture, information identification model
This document can not be handled, then generate prompt messages to prompt user that can not handle this document.
S140, the text file in newly-increased data information and/or the text file being converted to are saved to preset data
In chained list.
Text file in newly-increased data information and/or the text file being converted to are saved into preset data chained list
To be saved to newly-increased data information.Data link table is default for storing the database of information, tool in terminal device
Body, data link table is the database stored according to time shaft to text file included in newly-increased data information, number
Logical order according to the data information stored in chained list is realized by the pointer link orders in data link table, in this reality
It applies in example to increase the issuing time stamp of data information newly as the logical order of data link table, that is to say that by temporal information be finger
Needle link orders store the text file in newly-increased data information into data link table.By using time sequencing as chained list
Logical order stores newly-increased data information, and user can get the text using temporal information as sequence by data link table
Listed files, the information that data link table is stored have the characteristic that can not be deleted.
It is stored further, since other non-legible files are converted to the text file comprising text information, therefore can
Greatly compression corresponding data information is carried out to store required memory space, convenient for user carry out using.
In one embodiment, as shown in figure 5, step S140 includes sub-step S141, S142 and S143.
S141, the publication source information for increasing data information in webpage to be monitored newly and issuing time stamp are obtained.
Obtain the publication source information for increasing data information in webpage to be monitored newly and issuing time stamp.For convenience of to newly-increased data
Information, which is stored, retrieves the information data information stored with the later period, need to obtain the issue source letter of newly-increased data information
Breath and issuing time stamp, specifically, including the website information of webpage to be monitored and the hair of the data information in publication source information
Cloth people.
S142, according to publication source information by text file included in newly-increased data information and/or the text being converted to
Word file is classified.
According to publication source information by text file included in newly-increased data information and/or the text file being converted to
Classify.Specifically, classification storage is carried out to newly-increased data information to realize, it need to be according to the publisher couple in publication source information
Newly-increased data information is classified, wherein the corresponding classification of a publisher, each classification and one in data link table
Child list is corresponding, the newly-increased data information that same issue people is issued then divide into the corresponding child list of publication source information into
Row saves, can be by text file included in newly-increased data information and/or conversion by the publisher in publication source information
Obtained text file is classified to carry out classification preservation to newly-increased data information.
S143, it is stabbed according to issuing time by text file included in newly-increased data information and/or the text being converted to
Word file saves into preset data chained list child list corresponding with publication source information.
It is stabbed according to issuing time by text file included in newly-increased data information and/or the text file being converted to
Preservation is saved in corresponding child list into preset data chained list.The corresponding classification of one publisher, each classification
It is corresponding with a child list in data link table, and since the data information in data link table is to be carried out according to time shaft
Storage, thus need according to increase newly the issuing time stamp of data information by text file corresponding in newly-increased data information store to
In data link table in child list corresponding with publisher's classification, it can be realized and newly-increased data information is saved.
Since the text file that data link table is stored can not be deleted and be modified, it can realize and treat institute in web page monitored
The historical data information of publication is saved, and is carried out with facilitating the later period to carry out the historical data information issued to corresponding publisher
Evidence obtaining.
It, will by being monitored and judging whether file therein is text file to the data information issued in webpage
The file of non-legible file is converted to text file, and stores into data link table all text files to realize to internet
Information carries out chain type storage, it can be ensured that the text file stored can not be deleted and be modified, and user can be facilitated to obtain interconnection
Online deleted data information has great practical value to assist user to collect evidence related data information.
The embodiment of the present invention also provides a kind of internet information chain type storage device, the internet information chain type storage device
For executing any embodiment of aforementioned internet information chain type storage method.Specifically, referring to Fig. 6, Fig. 6 is of the invention real
The schematic block diagram of the internet information chain type storage device of example offer is provided.The internet information chain type storage device can configure
In the terminal devices such as desktop computer, laptop, tablet computer or mobile phone.
As shown in fig. 6, internet information chain type storage device 100 includes web monitor unit 110, judging unit 120, letter
Cease converting unit 130, information memory cell 140.
Web monitor unit 110, for obtaining the website information of webpage to be monitored, according to the website information of webpage to be monitored
The data information issued in web page monitored is treated to be monitored in real time to obtain newly-increased data information.
The website information for obtaining webpage to be monitored is treated in web page monitored according to the website information of webpage to be monitored and is issued
Data information monitored in real time to obtain newly-increased data information.Wherein, webpage information to be monitored by user input to
The website information of web page monitored, it is all in internet that webpage to be monitored can be microblogging, wechat, enterprise's network address, government website etc.
On the data information issued, publisher can be individual, enterprise, tissue or government department, such as micro- at its to a certain famous person
The information issued in rich is monitored, then webpage information to be monitored is the website information of famous person's microblogging webpage.
It may include the file of multiple format, such as the letter of text formatting in the data information issued in webpage to be monitored
Breath, the information of video format, audio format, the information of picture format etc..By the data information issued in webpage to be monitored
For real-time release, therefore web page monitored need to be treated and be monitored to obtain the newest data issued in webpage letter in real time
Breath.
In other inventive embodiments, as shown in fig. 7, the web monitor unit 110 includes subelement: publication source information is raw
At unit 111, issuing time stamp generation unit 112 and newly-increased data information acquiring unit 113.
Issue source information generating unit 111, if data information is issued for monitoring in webpage to be monitored, according to be monitored
The publisher of the website information of webpage and the data information generates publication source information.
If monitoring in webpage to be monitored and issuing data information, believed according to the website information of webpage to be monitored and the data
The publisher of breath generates publication source information.For the publisher for obtaining newly-increased data information, need to be believed according to the network address of webpage to be monitored
The publisher of breath and the data information generates corresponding publication source information.Issue the network address in source information including webpage to be monitored
The publisher of information and the data information, the website information of webpage to be monitored that is to say the webpage to be monitored letter that user is inputted
Breath;Publisher that is to say the issue main body of the publication newly-increased data information, and publisher can be individual, enterprise, tissue or government
Department.
Issuing time stabs generation unit 112, for generating issuing time stamp according to the issuing time of the data information.
The issuing time stamp of newly-increased data information is generated, according to the issuing time of the data information to believe newly-increased data
The issuing time of breath is recorded, and need to be generated corresponding issuing time according to the issuing time of data information and be stabbed, issuing time stamp
It can not be modified after generation, namely be to ensure that the issuing time of newly-increased data information is recorded in time and can not be changed.
Newly-increased data information acquiring unit 113, for obtaining All Files and issue source in the publication data information
Information, issuing time stamp are to obtain newly-increased data information.
The All Files and publication source information, issuing time obtained in the publication data information are stabbed to obtain newly-increased data
Information.The All Files in issued data information are obtained as newly-increased data information, and obtain obtained publication source information
And newly-increased data information can be obtained in issuing time stamp, increasing newly in data information may include one or more files.
Judging unit 120, for whether being that text file judges to the file in newly-increased data information.
It whether is that text file judges to the file in newly-increased data information, for various lattice in newly-increased data information
Whether the file of formula is saved, need to be first that text file judges to the file in newly-increased data information.Specifically, passing through
The format information of each file in newly-increased data information is obtained to judge whether this document is text file.
Obtain the format information of each file in newly-increased data information.Each file is owned by respective format information, no
Format information with file matches with corresponding type, can be judged the concrete type of file by format information.
It whether is that text file judges to each file according to the format information of each file, it can be right by the format information of each file
The concrete type of file is judged.
Information conversion unit 130 is known if the file for increasing newly in data information is non-legible file by presupposed information
Non-legible file is converted to text file by other model.
If the file in newly-increased data information is non-legible file, non-legible file is turned by presupposed information identification model
It is changed to text file.Specifically, non-legible file is also one of file, non-legible file includes audio file, video text
Part, picture.Information identification model is the model for non-legible file to be identified and converted, wherein information identifies mould
It include audio identification model and picture recognition model in type.
In other inventive embodiments, as shown in figure 8, the information conversion unit 130 includes subelement: the first text file
Acquiring unit 131, the second text file acquiring unit 132 and third text file acquiring unit 133.
First text file acquiring unit 131, for obtaining the format information of the non-legible file and judging this document
Whether be audio file, pass through if this document is audio file audio identification model in information identification model to this document into
Row identification is to obtain corresponding text file.
It obtains the format information of file and judges whether this document is audio file, pass through if this document is audio file
Audio identification model in information identification model identifies this document to obtain corresponding text file.Pass through audio identification
Model can be identified and be converted to the voice messaging in audio file, to obtain the text text comprising text information accordingly
Part, each audio file correspondence obtains a text file after being converted.Wherein, audio identification model includes acoustic mode
Type, phonetic feature dictionary and semantic analytic modell analytical model.
In other inventive embodiments, as shown in figure 9, the first text file acquiring unit 131 includes subelement: phoneme
Cutting unit 1311, phoneme conversion unit 1312 and speech analysis unit 1313.
Phone segmentation unit 1311, for being believed according to the acoustic model in audio identification model the voice in audio file
Breath carries out cutting to obtain multiple phonemes included in voice messaging.
Cutting is carried out to obtain voice to the voice messaging in audio file according to the acoustic model in audio identification model
Multiple phonemes included in information.Specifically, voice messaging is made of, the sound of a character the phoneme of multiple character sounds
Element includes the frequency and tone color of the character sound.It include the phoneme of all character sounds in acoustic model, by by voice messaging
It is matched with phoneme all in acoustic model, cutting can be carried out to the phoneme of character single in voice messaging, by cutting
Divide and finally obtains multiple phonemes included in the voice messaging.
Phoneme conversion unit 1312, for according to the phonetic feature dictionary in audio identification model to obtained phoneme into
Row matching using by all phoneme conversions as Pinyin information.
Obtained phoneme is matched according to the phonetic feature dictionary in audio identification model, it can be by all phonemes
Be converted to Pinyin information.It include the corresponding phoneme information of all character phonetics in phonetic feature dictionary, by by obtained sound
Corresponding with the character phonetic phoneme information of element is matched, can be by the phoneme conversion of single character in phonetic feature dictionary and
The character phonetic that the phoneme matches, to realize all phoneme conversions included in voice messaging as Pinyin information.
Speech analysis unit 1313, for being believed according to the semantic analytic modell analytical model in audio identification model obtained phonetic
Breath carries out semantic parsing to obtain the text file comprising text information.
Semantic parsing is carried out to obtained Pinyin information according to the semantic analytic modell analytical model in audio identification model, to realize
Pinyin information is converted into corresponding text file.Comprising corresponding between Pinyin information and text information in semantic analytic modell analytical model
Mapping relations, semantic solution can be carried out to obtained Pinyin information by mapping relations included in semantic analytic modell analytical model
Analysis is to be converted to the text file comprising text information for Pinyin information.
Second text file acquiring unit 132, for obtaining the format information of the non-legible file and judging this document
Whether be picture, pass through if this document is picture picture recognition model in information identification model to this document identified with
Obtain corresponding text file.
It obtains the format information of the non-legible file and judges whether this document is picture, lead to if this document is picture
Cross being identified to text included in this document to obtain corresponding text of picture recognition model in information identification model
File.Specifically, text template is the Template Information for being identified to text in picture, a text template and picture
In a text it is corresponding, text template includes multiple fonts corresponding to corresponding text, the equal energy of text in picture
Match with a certain font in corresponding text template, passes through the text template and picture progress in picture recognition model
Match, text included in the picture can be identified to obtain corresponding text file.
Third text file acquiring unit 133, for obtaining the format information of the non-legible file and judging this document
Whether it is video file, passes through audio identification model and picture recognition in information identification model if this document is video file
Model identifies this document to obtain corresponding text file.
It obtains the format information of the non-legible file and judges whether this document is video file, if this document is video
File then passes through audio identification model in information identification model and picture recognition model identifies to obtain phase this document
The text file answered.If this document is video file, the voice messaging in the video file is first obtained, and pass through audio identification
Model can identify the voice messaging in the video file and be converted to obtain the corresponding text information of the voice messaging,
Specific identification and conversion method are identical as method performed in the first text file acquiring unit 131;Obtain the view
Each frame picture included in frequency file, and by picture recognition model to each frame picture included in the video file
It is identified, to obtain text information included in each frame picture, specific recognition methods and second text file
Performed method is identical in acquiring unit 132.Obtain text information and video corresponding to the voice messaging of video file
The text information that each frame picture is included in file can finally obtain text file corresponding to the video file, namely
It is that each video file correspondence obtains a text file after being converted.
In addition, if not text file is not any in video file, audio file and picture, information identification model
This document can not be handled, then generate prompt messages to prompt user that can not handle this document.
Information memory cell 140, for by the text file in newly-increased data information and/or the text file being converted to
It saves into preset data chained list.
Text file in newly-increased data information and/or the text file being converted to are saved into preset data chained list
To be saved to newly-increased data information.Data link table is default for storing the database of information, tool in terminal device
Body, data link table is the database stored according to time shaft to text file included in newly-increased data information, number
Logical order according to the data information stored in chained list is realized by the pointer link orders in data link table, in this reality
It applies in example to increase the issuing time stamp of data information newly as the logical order of data link table, that is to say that by temporal information be finger
Needle link orders store the text file in newly-increased data information into data link table.By using time sequencing as chained list
Logical order stores newly-increased data information, and user can get the text using temporal information as sequence by data link table
Listed files, the information that data link table is stored have the characteristic that can not be deleted.
It is stored further, since other non-legible files are converted to the text file comprising text information, therefore can
Greatly compression corresponding data information is carried out to store required memory space, convenient for user carry out using.
In other inventive embodiments, as shown in Figure 10, the information memory cell 140 includes subelement: acquisition of information list
Member 141, document classification unit 142 and file storage unit 143.
Information acquisition unit 141, when for obtaining the publication source information and publication that increase data information in webpage to be monitored newly
Between stab.
Obtain the publication source information for increasing data information in webpage to be monitored newly and issuing time stamp.For convenience of to newly-increased data
Information, which is stored, retrieves the information data information stored with the later period, need to obtain the issue source letter of newly-increased data information
Breath and issuing time stamp, specifically, including the website information of webpage to be monitored and the hair of the data information in publication source information
Cloth people.
Document classification unit 142, for according to publication source information by text file included in newly-increased data information and/
Or the text file being converted to is classified.
According to publication source information by text file included in newly-increased data information and/or the text file being converted to
Classify.Specifically, classification storage is carried out to newly-increased data information to realize, it need to be according to the publisher couple in publication source information
Newly-increased data information is classified, wherein the corresponding classification of a publisher, each classification and one in data link table
Child list is corresponding, the newly-increased data information that same issue people is issued then divide into the corresponding child list of publication source information into
Row saves, can be by text file included in newly-increased data information and/or conversion by the publisher in publication source information
Obtained text file is classified to carry out classification preservation to newly-increased data information.
File storage unit 143, for according to issuing time stamp by text file included in newly-increased data information and/
Or the text file being converted to saves into preset data chained list child list corresponding with publication source information.
It is stabbed according to issuing time by text file included in newly-increased data information and/or the text file being converted to
Preservation is saved in corresponding child list into preset data chained list.The corresponding classification of one publisher, each classification
It is corresponding with a child list in data link table, and since the data information in data link table is to be carried out according to time shaft
Storage, thus need according to increase newly the issuing time stamp of data information by text file corresponding in newly-increased data information store to
In data link table in child list corresponding with publisher's classification, it can be realized and newly-increased data information is saved.
Since the text file that data link table is stored can not be deleted and be modified, it can realize and treat institute in web page monitored
The historical data information of publication is saved, and is carried out with facilitating the later period to carry out the historical data information issued to corresponding publisher
Evidence obtaining.
It, will by being monitored and judging whether file therein is text file to the data information issued in webpage
The file of non-legible file is converted to text file, and stores into data link table all text files to realize to internet
Information carries out chain type storage, it can be ensured that the text file stored can not be deleted and be modified, and user can be facilitated to obtain interconnection
Online deleted data information has great practical value to assist user to collect evidence related data information.
Above-mentioned internet information chain type storage device can be implemented as the form of computer program, which can be with
It is run in computer equipment as shown in figure 11.
Figure 11 is please referred to, Figure 11 is the schematic block diagram of computer equipment provided in an embodiment of the present invention.
Refering to fig. 11, which includes processor 502, memory and the net connected by system bus 501
Network interface 505, wherein memory may include non-volatile memory medium 503 and built-in storage 504.
The non-volatile memory medium 503 can storage program area 5031 and computer program 5032.The computer program
5032 are performed, and processor 502 may make to execute internet information chain type storage method.
The processor 502 supports the operation of entire computer equipment 500 for providing calculating and control ability.
The built-in storage 504 provides environment for the operation of the computer program 5032 in non-volatile memory medium 503, should
When computer program 5032 is executed by processor 502, processor 502 may make to execute internet information chain type storage method.
The network interface 505 is for carrying out network communication, such as the transmission of offer data information.Those skilled in the art can
To understand, structure shown in Figure 11, only the block diagram of part-structure relevant to the present invention program, is not constituted to this hair
The restriction for the computer equipment 500 that bright scheme is applied thereon, specific computer equipment 500 may include than as shown in the figure
More or fewer components perhaps combine certain components or with different component layouts.
Wherein, the processor 502 is for running computer program 5032 stored in memory, to realize following function
Can: the website information of webpage to be monitored is obtained, the number issued in web page monitored is treated according to the website information of webpage to be monitored
It is believed that breath is monitored in real time to obtain newly-increased data information;It whether is that text file carries out to the file in newly-increased data information
Judgement;If the file in newly-increased data information is non-legible file, non-legible file is converted by presupposed information identification model
For text file;Text file in newly-increased data information and/or the text file being converted to are saved to preset data chain
In table.
In one embodiment, processor 502 is executing the website information for obtaining webpage to be monitored, according to webpage to be monitored
When website information is treated the data information issued in web page monitored and is monitored in real time to obtain the step of newly-increased data information,
It performs the following operations: issuing data information if monitoring in webpage to be monitored, according to the website information of webpage to be monitored and described
The publisher of data information generates publication source information;Issuing time stamp is generated according to the issuing time of the data information;It obtains
All Files and publication source information, issuing time stamp in the publication data information are to obtain newly-increased data information.
In one embodiment, if file of the processor 502 in the newly-increased data information of execution is non-legible file, by pre-
If non-legible file is converted to the step of text file by information identification model, perform the following operations: obtaining described non-legible
The format information of file simultaneously judges whether this document is audio file, passes through information identification model if this document is audio file
In audio identification model this document is identified to obtain corresponding text file;Obtain the format of the non-legible file
Information simultaneously judges whether this document is picture, passes through pair of picture recognition model in information identification model if this document is picture
This document is identified to obtain corresponding text file;It obtains the format information of the non-legible file and judges that this document is
The no audio identification model and picture recognition mould for being video file, passing through in information identification model if this document is video file
Type identifies this document to obtain corresponding text file.
In one embodiment, processor 502 is executing the format information for obtaining the non-legible file and is judging this document
Whether be audio file, pass through if this document is audio file audio identification model in information identification model to this document into
When row identification is to obtain the step of corresponding text file, perform the following operations: according to the acoustic model in audio identification model
Cutting is carried out to obtain multiple phonemes included in voice messaging to the voice messaging in audio file;According to audio identification mould
Phonetic feature dictionary in type to obtained phoneme matched using by all phoneme conversions as Pinyin information;Known according to audio
Semantic analytic modell analytical model in other model carries out semantic parsing to obtained Pinyin information to obtain the text comprising text information
File.
In one embodiment, processor 502 is being executed text file in newly-increased data information and/or is being converted to
It when text file saves the step into preset data chained list, performs the following operations: obtaining and increase data letter in webpage to be monitored newly
The publication source information and issuing time of breath are stabbed;According to publication source information by text file included in newly-increased data information and/
Or the text file being converted to is classified;It is stabbed according to issuing time by text file included in newly-increased data information
And/or the text file being converted to saves into preset data chained list child list corresponding with publication source information.
It will be understood by those skilled in the art that the embodiment of computer equipment shown in Figure 11 is not constituted to computer
The restriction of equipment specific composition, in other embodiments, computer equipment may include components more more or fewer than diagram, or
Person combines certain components or different component layouts.For example, in some embodiments, computer equipment can only include depositing
Reservoir and processor, in such embodiments, the structure and function of memory and processor are consistent with embodiment illustrated in fig. 11,
Details are not described herein.
It should be appreciated that in embodiments of the present invention, processor 502 can be central processing unit (Central
Processing Unit, CPU), which can also be other general processors, digital signal processor (Digital
Signal Processor, DSP), specific integrated circuit (Application Specific Integrated Circuit,
ASIC), ready-made programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic
Device, discrete gate or transistor logic, discrete hardware components etc..Wherein, general processor can be microprocessor or
Person's processor is also possible to any conventional processor etc..
Computer readable storage medium is provided in another embodiment of the invention.The computer readable storage medium can be with
For non-volatile computer readable storage medium.The computer-readable recording medium storage has computer program, wherein calculating
Machine program performs the steps of the website information for obtaining webpage to be monitored when being executed by processor, according to the net of webpage to be monitored
Location information is treated the data information issued in web page monitored and is monitored in real time to obtain newly-increased data information;To newly-increased data
Whether the file in information is that text file is judged;If the file in newly-increased data information is non-legible file, by pre-
If non-legible file is converted to text file by information identification model;By in newly-increased data information text file and/or conversion
Obtained text file is saved into preset data chained list.
In one embodiment, the website information for obtaining webpage to be monitored, according to the website information pair of webpage to be monitored
The data information issued in webpage to be monitored is monitored in real time the step of to obtain newly-increased data information, comprising: if monitoring
Data information is issued into webpage to be monitored, is generated according to the publisher of the website information of webpage to be monitored and the data information
Issue source information;Issuing time stamp is generated according to the issuing time of the data information;It obtains in the publication data information
All Files and publication source information, issuing time stamp are to obtain newly-increased data information.
In one embodiment, it if the file in the newly-increased data information is non-legible file, is identified by presupposed information
The step of non-legible file is converted to text file by model, comprising: obtain the format information of the non-legible file and judgement
Whether this document is audio file, passes through the audio identification model in information identification model if this document is audio file to this
File is identified to obtain corresponding text file;It obtains the format information of the non-legible file and whether judges this document
For picture, pass through being identified this document to obtain of picture recognition model in information identification model if this document is picture
Corresponding text file;It obtains the format information of the non-legible file and judges whether this document is video file, if this article
Part is that video file then passes through audio identification model in information identification model and picture recognition model identifies this document
To obtain corresponding text file.
In one embodiment, the format information for obtaining the non-legible file and judge whether this document is audio text
Part passes through the audio identification model in information identification model if this document is audio file and is identified this document to obtain
The step of corresponding text file, comprising: according to the acoustic model in audio identification model to the voice messaging in audio file
Cutting is carried out to obtain multiple phonemes included in voice messaging;According to the phonetic feature dictionary in audio identification model to institute
Obtained phoneme matched using by all phoneme conversions as Pinyin information;According to the semantic analytic modell analytical model in audio identification model
Semantic parsing is carried out to obtain the text file comprising text information to obtained Pinyin information.
In one embodiment, the text file by newly-increased data information and/or the text file being converted to are protected
Deposit the step into preset data chained list, comprising: obtain the publication source information and publication for increasing data information in webpage to be monitored newly
Timestamp;According to publication source information by text file included in newly-increased data information and/or the text file being converted to
Classify;It is according to issuing time stamp that text file included in newly-increased data information and/or the text being converted to is literary
Part saves into preset data chained list child list corresponding with publication source information.
It is apparent to those skilled in the art that for convenience of description and succinctly, foregoing description is set
The specific work process of standby, device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein.
Those of ordinary skill in the art may be aware that unit described in conjunction with the examples disclosed in the embodiments of the present disclosure and algorithm
Step can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and software
Interchangeability generally describes each exemplary composition and step according to function in the above description.These functions are studied carefully
Unexpectedly the specific application and design constraint depending on technical solution are implemented in hardware or software.Professional technician
Each specific application can be used different methods to achieve the described function, but this realization is it is not considered that exceed
The scope of the present invention.
In several embodiments provided by the present invention, it should be understood that disclosed unit and method, it can be with
It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit
It divides, only logical function partition, there may be another division manner in actual implementation, can also will be with the same function
Unit set is at a unit, such as multiple units or components can be combined or can be integrated into another system or some
Feature can be ignored, or not execute.In addition, shown or discussed mutual coupling, direct-coupling or communication connection can
Be through some interfaces, the indirect coupling or communication connection of device or unit, be also possible to electricity, mechanical or other shapes
Formula connection.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit
The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple
In network unit.Some or all of unit therein can be selected to realize the embodiment of the present invention according to the actual needs
Purpose.
It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit
It is that each unit physically exists alone, is also possible to two or more units and is integrated in one unit.It is above-mentioned integrated
Unit both can take the form of hardware realization, can also realize in the form of software functional units.
If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product
When, it can store in a computer readable storage medium.Based on this understanding, technical solution of the present invention substantially or
Person says that all or part of the part that contributes to existing technology or the technical solution can body in the form of software products
Reveal and, which is stored in a computer readable storage medium, including some instructions are used so that one
Platform computer equipment (can be personal computer, server or the network equipment etc.) executes described in each embodiment of the present invention
The all or part of the steps of method.And computer readable storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory
The various media that can store program code such as (ROM, Read-Only Memory), magnetic or disk.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any
Those familiar with the art in the technical scope disclosed by the present invention, can readily occur in various equivalent modifications or replace
It changes, these modifications or substitutions should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with right
It is required that protection scope subject to.
Claims (10)
1. a kind of internet information chain type storage method characterized by comprising
The website information for obtaining webpage to be monitored treats the number issued in web page monitored according to the website information of webpage to be monitored
It is believed that breath is monitored in real time to obtain newly-increased data information;
It whether is that text file judges to the file in newly-increased data information;
If the file in newly-increased data information is non-legible file, non-legible file is converted to by presupposed information identification model
Text file;
Text file in newly-increased data information and/or the text file being converted to are saved into preset data chained list.
2. internet information chain type storage method according to claim 1, which is characterized in that described according to webpage to be monitored
Website information treat the data information issued in web page monitored and monitored in real time to obtain newly-increased data information, comprising:
If monitoring in webpage to be monitored and issuing data information, according to the website information of webpage to be monitored and the data information
Publisher generates publication source information;
Issuing time stamp is generated according to the issuing time of the data information;
All Files and publication source information in the publication data information, issuing time stamp are obtained to obtain newly-increased data letter
Breath.
3. internet information chain type storage method according to claim 1, which is characterized in that described to be known by presupposed information
Non-legible file is converted to text file by other model, comprising:
It obtains the format information of the non-legible file and judges whether this document is audio file, if this document is audio file
Then this document is identified by the audio identification model in information identification model to obtain corresponding text file;
It obtains the format information of the non-legible file and judges whether this document is picture, pass through letter if this document is picture
Picture recognition model identifies this document to obtain corresponding text file in breath identification model;
It obtains the format information of the non-legible file and judges whether this document is video file, if this document is video file
Then this document is identified by audio identification model in information identification model and picture recognition model corresponding to obtain
Text file.
4. internet information chain type storage method according to claim 3, which is characterized in that if described this document is audio
File then passes through the audio identification model in information identification model and is identified to this document to obtain corresponding text file, packet
It includes:
Cutting is carried out to obtain voice messaging to the voice messaging in audio file according to the acoustic model in audio identification model
Included in multiple phonemes;
Obtained phoneme is matched to be by all phoneme conversions according to the phonetic feature dictionary in audio identification model
Pinyin information;
Semantic parsing is carried out to be included to obtained Pinyin information according to the semantic analytic modell analytical model in audio identification model
The text file of text information.
5. internet information chain type storage method according to claim 2, which is characterized in that described by newly-increased data information
In text file and/or the text file that is converted to save into preset data chained list, comprising:
Obtain the publication source information for increasing data information in webpage to be monitored newly and issuing time stamp;
Text file included in newly-increased data information and/or the text file being converted to are carried out according to publication source information
Classification;
Text file included in newly-increased data information and/or the text file being converted to are saved according to issuing time stamp
The child list corresponding with publication source information into preset data chained list.
6. a kind of internet information chain type storage device characterized by comprising
Web monitor unit treats prison according to the website information of webpage to be monitored for obtaining the website information of webpage to be monitored
The data information issued in control webpage is monitored in real time to obtain newly-increased data information;
Judging unit, for whether being that text file judges to the file in newly-increased data information;
Information conversion unit passes through presupposed information identification model if the file for increasing newly in data information is non-legible file
Non-legible file is converted into text file;
Information memory cell, for by the text file in newly-increased data information and/or the text file being converted to save to
In preset data chained list.
7. internet information chain type storage device according to claim 6, which is characterized in that the web monitor unit,
Include:
Issue source information generating unit, if issuing data information for monitoring in webpage to be monitored, according to webpage to be monitored
The publisher of website information and the data information generates publication source information;
Issuing time stabs generation unit, for generating issuing time stamp according to the issuing time of the data information;
Newly-increased data information acquiring unit, for obtaining All Files and publication source information, hair in the publication data information
Cloth timestamp is to obtain newly-increased data information.
8. internet information chain type storage device according to claim 6, which is characterized in that the information conversion unit,
Include:
First text file acquiring unit, for obtaining the format information of the non-legible file and judging whether this document is sound
Frequency file, pass through if this document is audio file audio identification model in information identification model to this document identified with
Obtain corresponding text file;
Second text file acquiring unit, for obtaining the format information of the non-legible file and judging whether this document is figure
It is corresponding to obtain to pass through being identified to this document for picture recognition model in information identification model if this document is picture for piece
Text file;
Third text file acquiring unit, for obtaining the format information of the non-legible file and judging whether this document is view
Frequency file passes through audio identification model in information identification model and picture recognition model to this if this document is video file
File is identified to obtain corresponding text file.
9. a kind of computer equipment, including memory, processor and it is stored on the memory and can be on the processor
The computer program of operation, which is characterized in that the processor realizes such as claim 1 to 5 when executing the computer program
Any one of described in internet information chain type storage method.
10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage has computer journey
Sequence, the computer program execute the processor as described in any one of claim 1 to 5 mutual
Networked information chain type storage method.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811526834.0A CN109657181B (en) | 2018-12-13 | Internet information chain storage method, device, computer equipment and storage medium | |
PCT/CN2019/092551 WO2020119064A1 (en) | 2018-12-13 | 2019-06-24 | Method and device for storing internet information in linked manner, computer apparatus and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811526834.0A CN109657181B (en) | 2018-12-13 | Internet information chain storage method, device, computer equipment and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109657181A true CN109657181A (en) | 2019-04-19 |
CN109657181B CN109657181B (en) | 2024-05-14 |
Family
ID=
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111125345A (en) * | 2019-12-24 | 2020-05-08 | 南京三百云信息科技有限公司 | Data application method and device |
WO2020119064A1 (en) * | 2018-12-13 | 2020-06-18 | 平安科技(深圳)有限公司 | Method and device for storing internet information in linked manner, computer apparatus and storage medium |
CN112104747A (en) * | 2020-10-30 | 2020-12-18 | 广州市玄武无线科技股份有限公司 | Request response system based on chain processing |
WO2021000496A1 (en) * | 2019-07-04 | 2021-01-07 | 平安科技(深圳)有限公司 | Information chain generation method and apparatus, and computer device and storage medium |
CN112784077A (en) * | 2021-03-17 | 2021-05-11 | 陕西省大数据集团有限公司 | Method and device for classified extraction of data asset value |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101364955A (en) * | 2008-09-28 | 2009-02-11 | 杭州电子科技大学 | Method for analyzing and extracting evidence of e-mail customer terminal |
US20090157407A1 (en) * | 2007-12-12 | 2009-06-18 | Nokia Corporation | Methods, Apparatuses, and Computer Program Products for Semantic Media Conversion From Source Files to Audio/Video Files |
CN101882162A (en) * | 2010-06-29 | 2010-11-10 | 北京搜狗科技发展有限公司 | Method and system for transmitting network information |
CN103942639A (en) * | 2014-03-21 | 2014-07-23 | 宁波中小在线信息服务有限公司 | Policy management system and method for policy consultative service system |
CN106412678A (en) * | 2016-09-14 | 2017-02-15 | 安徽声讯信息技术有限公司 | Method and system for transcribing and storing video news in real time |
US20170062010A1 (en) * | 2015-09-02 | 2017-03-02 | Yahoo! Inc. | Computerized system and method for formatted transcription of multimedia content |
CN107680602A (en) * | 2017-08-24 | 2018-02-09 | 平安科技(深圳)有限公司 | Voice fraud recognition methods, device, terminal device and storage medium |
CN108829765A (en) * | 2018-05-29 | 2018-11-16 | 平安科技(深圳)有限公司 | A kind of information query method, device, computer equipment and storage medium |
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090157407A1 (en) * | 2007-12-12 | 2009-06-18 | Nokia Corporation | Methods, Apparatuses, and Computer Program Products for Semantic Media Conversion From Source Files to Audio/Video Files |
CN101364955A (en) * | 2008-09-28 | 2009-02-11 | 杭州电子科技大学 | Method for analyzing and extracting evidence of e-mail customer terminal |
CN101882162A (en) * | 2010-06-29 | 2010-11-10 | 北京搜狗科技发展有限公司 | Method and system for transmitting network information |
CN103942639A (en) * | 2014-03-21 | 2014-07-23 | 宁波中小在线信息服务有限公司 | Policy management system and method for policy consultative service system |
US20170062010A1 (en) * | 2015-09-02 | 2017-03-02 | Yahoo! Inc. | Computerized system and method for formatted transcription of multimedia content |
CN106412678A (en) * | 2016-09-14 | 2017-02-15 | 安徽声讯信息技术有限公司 | Method and system for transcribing and storing video news in real time |
CN107680602A (en) * | 2017-08-24 | 2018-02-09 | 平安科技(深圳)有限公司 | Voice fraud recognition methods, device, terminal device and storage medium |
CN108829765A (en) * | 2018-05-29 | 2018-11-16 | 平安科技(深圳)有限公司 | A kind of information query method, device, computer equipment and storage medium |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020119064A1 (en) * | 2018-12-13 | 2020-06-18 | 平安科技(深圳)有限公司 | Method and device for storing internet information in linked manner, computer apparatus and storage medium |
WO2021000496A1 (en) * | 2019-07-04 | 2021-01-07 | 平安科技(深圳)有限公司 | Information chain generation method and apparatus, and computer device and storage medium |
CN111125345A (en) * | 2019-12-24 | 2020-05-08 | 南京三百云信息科技有限公司 | Data application method and device |
CN111125345B (en) * | 2019-12-24 | 2024-04-16 | 南京三百云信息科技有限公司 | Data application method and device |
CN112104747A (en) * | 2020-10-30 | 2020-12-18 | 广州市玄武无线科技股份有限公司 | Request response system based on chain processing |
CN112784077A (en) * | 2021-03-17 | 2021-05-11 | 陕西省大数据集团有限公司 | Method and device for classified extraction of data asset value |
Also Published As
Publication number | Publication date |
---|---|
WO2020119064A1 (en) | 2020-06-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20140278426A1 (en) | Data shredding for speech recognition acoustic model training under data retention restrictions | |
US10832803B2 (en) | Automated system and method for improving healthcare communication | |
WO2019095586A1 (en) | Meeting minutes generation method, application server, and computer readable storage medium | |
CN104485105B (en) | A kind of electronic health record generation method and electronic medical record system | |
US9514740B2 (en) | Data shredding for speech recognition language model training under data retention restrictions | |
US11580951B2 (en) | Speaker identity and content de-identification | |
CN110334110A (en) | Natural language classification method, device, computer equipment and storage medium | |
JP2019061662A (en) | Method and apparatus for extracting information | |
JP6019604B2 (en) | Speech recognition apparatus, speech recognition method, and program | |
US20160189107A1 (en) | Apparatus and method for automatically creating and recording minutes of meeting | |
US11114113B2 (en) | Multilingual system for early detection of neurodegenerative and psychiatric disorders | |
CN111353065A (en) | Voice archive storage method, device, equipment and computer readable storage medium | |
WO2019227629A1 (en) | Text information generation method and apparatus, computer device and storage medium | |
CN113190675A (en) | Text abstract generation method and device, computer equipment and storage medium | |
JP6179971B2 (en) | Information providing apparatus and information providing method | |
CN113326696B (en) | Text generation method and device | |
CN109243549B (en) | Intelligent follow-up method and device and server | |
US10825558B2 (en) | Method for improving healthcare | |
CN112309372B (en) | Intent recognition method, device, equipment and storage medium based on intonation | |
EP3809411A1 (en) | Multi-lingual system for early detection of alzheimer's disease | |
CN108962228A (en) | model training method and device | |
CN109524009B (en) | Policy entry method and related device based on voice recognition | |
JP2019520614A (en) | Risk event recognition system based on SNS information, method, electronic device and storage medium | |
CN109657181A (en) | Internet information chain type storage method, device, computer equipment and storage medium | |
US11431472B1 (en) | Automated domain language parsing and data extraction |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant |