CN117851535A - Information file full-structure storage based on transaction logic and search engine-free design method and system thereof - Google Patents

Information file full-structure storage based on transaction logic and search engine-free design method and system thereof Download PDF

Info

Publication number
CN117851535A
CN117851535A CN202310368352.1A CN202310368352A CN117851535A CN 117851535 A CN117851535 A CN 117851535A CN 202310368352 A CN202310368352 A CN 202310368352A CN 117851535 A CN117851535 A CN 117851535A
Authority
CN
China
Prior art keywords
file
information
search
search engine
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310368352.1A
Other languages
Chinese (zh)
Inventor
殷步九
梁玢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202310368352.1A priority Critical patent/CN117851535A/en
Publication of CN117851535A publication Critical patent/CN117851535A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a transaction logic-based information file full-structure storage and a search engine-free design method thereof, which comprises the following steps: when the network file is stored, the network file is converted into a text according to the file structure decomposition mark and stored in a file library; continuously expanding a classification coding library for newly appeared letters, words, sentences and paragraphs in the process of storing; the extraction process only needs to submit keywords, and queries the relevant degree according to the classification of the words, the words and the sentences. The invention generates the self-organizing ordered effect on the unordered information, fundamentally solves the low-efficiency situation of iterative search on the unordered information facing to the vast amount of the unordered information, solves the traditional searching problem in the fundamental sense, fundamentally provides intelligent, accurate and philosophy-conforming prompt information for the user, and can provide a large amount of related information which cannot be imagined by the user and multi-level related information for the user.

Description

Information file full-structure storage based on transaction logic and search engine-free design method and system thereof
Technical Field
The invention relates to the technical field of computers, in particular to a transaction logic-based information file full-structure storage and a search engine-free design method and system thereof.
Background
1. What is a search engine
The search engine is a system for collecting information from the internet by using a specific computer program according to a certain strategy, organizing and processing the information, providing search service for users, and displaying the information related to the user search to the users.
2. Search mode of search engine
The search mode of the search engine comprises full text index, catalog index, meta search engine, vertical search engine, collective search engine, portal search engine, free link list and the like.
1. Full text indexing
The full text search engine is a concept of extracting information from websites to establish a webpage database, and is a real-name search engine. Foreign representatives are Google, and domestic are well known hundred degree searches.
The automatic information gathering function of full-text search engines is divided into two types. One is periodic searching, i.e., at intervals (e.g., google is typically 28 days), a search engine actively dispatches a "spider" program to retrieve internet sites within a certain IP address range, and once a new site is found, it automatically extracts information about the site and adds the site address to its own database. The other is to submit a web site search, i.e., the downstream web site owner actively submits a web site to the search engine for query by the user.
When the user searches information by keywords, the search engine searches in the database, if a website which accords with the content required by the user is found, a special algorithm is adopted, the relevance and ranking level of each webpage are calculated according to the matching degree, the occurrence position, the frequency and the link quality of the keywords in the webpage, and then the webpage links are sequentially returned to the user according to the relevance. The engine is characterized by a relatively high search rate.
Another type of search engine company rents databases of other search engines and arranges the search results in a self-defined format, such as the Lycos search engine.
2. Directory index
The catalog index is also called classified retrieval, and is the service for providing WWW resource query on the Internet at the earliest, and mainly comprises the steps of collecting and sorting the resources of the Internet, distributing the websites of the searched webpages to the categories of different levels of related classified theme catalogues according to the searched webpage contents, and forming a classified tree structure index like a library catalog. The catalog index does not need to input any text, and the required network information resources can be found by clicking and entering the catalog layer by layer only according to the theme classification catalog provided by the website. The user can find the required information according to the classified catalogue completely and does not rely on Keywords (Keywords) for query. The engine is characterized by higher accuracy. The most representative of the directory indexes is Yahoo-! Searching the new wave classification catalogue.
Although the catalog index has a search function, the catalog index cannot be strictly called a real search engine, but is only a website link list classified by catalog.
There are many differences in the directory index compared to full text search engines:
first, full-text index search engines pertain to automatic web site retrieval, while directory indexing relies entirely on manual operations. After the user submits the website, the catalog editor can browse your website personally and then decide whether to admit your website according to a set of self-defined evaluation criteria or even subjective impressions of the editor.
Second, the directory index requires much more downstream websites and sometimes is not necessarily successful even if logged in multiple times. Particularly with hyperindexes such as Yahoo, login is more difficult.
In addition, when registering a full text index search engine, the classification problem of websites is generally not considered, but when registering a Directory index, websites must be placed in a most suitable Directory (Directory).
Finally, the related information of each website in the full-text index search engine is automatically extracted from the user webpage, so that the user has more autonomy as a user; the directory index requires that the website information must be manually filled in, and there are various limitations. Furthermore, if the staff considers that the catalogue and the website information of the website submitted by the staff are unsuitable, the staff can adjust the catalogue and the website information at any time, and certainly, the catalogue and the website information are not purchased in advance.
In a default search mode, some catalog search engines firstly return websites matched in own catalogs, such as Chinese search foxes, newness waves, web liability and the like; while others default to web searches, such as Yahoo.
The full text index and the catalog index have the tendency of mutual fusion penetration. Some original pure full text search engines now also provide Directory searches, such as Google provides classification queries with Open Directory borrowing. And like Yahoo-! These boss directory indexes expand the search scope by cooperating with search engines such as Google.
3. Meta-search
After receiving the user query, the meta search engine (METASearch Engine) searches over multiple search engines simultaneously and returns the results to the user. There are search engines above the search engine. The method is mainly focused on improving the searching speed, intelligently processing the searching result, setting the individual searching function and improving the user searching interface friendliness, and has higher recall ratio and precision ratio. More well-known meta search engines are InfoSpace, dogpile, vivisimo, metacrawler, dopile, ixquick, search engine, etc. Representative among the meta search engines in chinese is a star search engine.
In terms of search result ranking, some rank search results directly by source, such as dog pipe; some rearrange and combine the results according to a self-defined rule, such as Vivisimo.
4. Vertical search
Vertical search engines are a type of search engine that has evolved after 2006. Unlike general web search engines, vertical searches focus on specific search areas and search requirements (e.g., airline ticket searches, travel searches, life searches, novice searches, video searches, etc.), with a better user experience in their specific search areas. Compared with the general search with thousands of search servers, the vertical search needs low hardware cost, and has specific user requirements and various query modes.
5. Collective search
A collective search engine: the search engine is similar to a meta search engine except that rather than invoking multiple search engines simultaneously to search, it is selected by the user from several search engines provided, such as the search engine that hotboot introduced by the end of 2002.
6. Portal searching
Portal search engine: AOLSearch, MSNSearch, etc. although providing search services, there is neither a category directory nor a web page database by itself, and the search results are entirely from other search engines.
7. Free linking
Free linked list (Free For All Links abbreviated FFA): typically, the link entries are simply scrolled, and a small portion has a simple category list, but of a size greater than Yahoo-! The iso-catalog index is much smaller.
3. Domestic existing search engine
1. Comprehensive types:
hundred degrees, dog searching, 360 searching, channel, google, yahoo and other websites.
2. Shopping type:
naughty net, dang net, aleba, etc.
3. Intellectual property type:
national intellectual property office-patent retrieval China trademark network SooPAT
4. Service content and features provided by Baidu search engine
Features of the Baidu search engine include: hundred degree snapshot, web page preview/preview of all web pages, related search terms, mispronounced word correction prompts, mp3 search, flash search.
After the 3-month lightning plan (Blitzen Project) in 2002 starts, a series of products such as bar, knowledge, map, national science, encyclopedia, document, video, blog and the like are put out.
An integrated Search engine (All-in-One Search Page), also known as a "multiple engine synchronous Search system". The hundred degrees are that a plurality of independent search engines are linked on one WWW page, the search engines are required to be clicked or appointed during searching, the search input is performed once, and a plurality of engines are searched simultaneously. The integrated search engine does not have a self-built database, does not need to develop supporting technology, and certainly cannot control and optimize the search result.
5. Other domestic and relatively well-known search engines
1. In 12 months and 23 days 2003, the original intelligent search formally and independently works, and Chinese search is established. In month 2 2004, chinese search published desktop search engine webpig 1.0, and in month 3 2006, webpig was named IG (Internet Gateway).
2. In 6 th 2005, the new wave formally introduced an autonomously developed search engine "love". In 2007, new love questions used google search engines.
3. In 7 th and 1 st 2007, the web is easy to comprehensively adopt the independently developed channel searching technology, and the original comprehensive searching and web page searching are combined. The web page search, the picture search and the blog search provide services for web searching. The web page searching uses natural language processing, distributed storage and computing technology which is independently researched and developed; the picture searching initiative is based on the brand, model, even season and other advanced searching functions of the shooting camera; compared with similar products, the blog search has the advantages of comprehensive grabbing and timely updating, and provides innovative functions such as 'article preview', 'blog archives', and the like.
6. Recent advances in technology for search engines
Recent technological developments in search engines include the following:
1. improving understanding of user search questions
To overcome the shortcomings of keyword retrieval and directory queries, natural language intelligent answering has emerged. The user may enter a simple question, such as "how can kill virus ofcomputer? ". After analysis of the structure and content of the question, the search engine either directly gives the answer to the question or directs the user to reselect from several selectable questions. The natural language has the advantages that firstly, network communication is more humanized, and secondly, inquiry becomes more convenient, direct and effective. In the above example, if a keyword is used for query, most people will search by using the word "viruses", and the result will necessarily include many invalid information such as introduction of various viruses, how the viruses are generated, and the like, and use "how can killvirus of computer? By the aid of the method, the search engine can provide information of how to kill viruses to users, and searching efficiency is improved.
2. Processing the search result
(1) Search engine based on link evaluation
An excellent representation of a search engine based on link ratings is Google, whose original "link rating system" is based on the recognition that the importance of one web page depends on the number of links it is linked to by other web pages, particularly the number of links of some web pages that have been identified as "important". The evaluation system is very similar to the thought of science and technology quotation index, but because the Internet is developed in a commercialized environment, the linked number of a website is closely related to the commercial popularization of the website, and therefore, the evaluation system has a certain lack of objectivity.
(2) Search engine based on access popularity
A representative of a search engine based on access to popularity is direct hit, whose basic idea is that the website most people choose to access is the most important website. The importance ranking of the relevant websites is statistically determined based on the websites that were actually picked and visited by thousands of network users in the search results before and the time they spent on those websites, and thus, which websites best meet the search requirements of the users. Thus having typical crowding characteristics. This rating system has the same drawbacks as a search engine based on link ratings.
(3) Removing additional redundant information from the search result
It has been investigated that excessive additional information places a heavy burden on the user, and that in order to remove such excessive additional information, search techniques such as user customization, content filtering, etc. may be employed.
7. Those skilled in the art consider the direction of search engine development
1. Providing personalized search service (search for setting defined conditions)
Search engines have been several decades old from birth, during which search technology has been changing, from initial catalogue searches to keyword searches, and evolving speech searches, picture searches, etc., and are evolving. If the next trend of search engines is referred to, there is a consensus in the industry that personalized search engines are certainly the most interesting directions and will become the future of search engines.
In recent years, the technical changes of the search engines such as google, microsoft must answer and Chinese search reveal some end devices, personalized search is becoming the research direction of the search engines, and various personalized search platforms and functions are being developed in a controversial manner to meet different search requirements of users.
While current search engines can provide users with useful things, the search results obtained by different areas and individuals are not satisfactory, and the provided results are too huge in information and too much in repeated information, which is a place where the search engines are urgently needed to be improved.
2. Providing result information of an optimization scope
(1) Vertical topic search engine
The information on the network is huge like the sea, the network resources are increased at ten times speed, and a search engine is difficult to collect the network information of all topics, even if the information topics are collected comprehensively, the topics are difficult to be made accurate and professional due to the fact that the topic range is too wide, so that the search result is too much. Thus, the vertical topic search engine occupies a place in various search engines with high targeting and specialization, such as stock, weather, news and the like, has high pertinence, and compared with the general search engine with disordered mass information, the vertical search engine is more concentrated, specific and deep, and the satisfaction degree of the user on the query result is high. Those skilled in the art have recognized that vertical topic searches have significant room for development.
(2) Searching for non-www information
Searching of class information such as FTP is provided.
(3) Multimedia search engine
Multimedia retrieval mainly includes retrieval of sound, images, video.
3. Integrated information integration
People need not only professional information, but also whole association information. With the development of search technologies such as artificial intelligence, neural networks, grid computing and the like, we have an ability to integrate internet information, intelligently provide information which users really need, not simply what is needed, because the users do not know what they need when searching for many times.
4. Toward knowledge-based search engines
Future search engines will be developed towards knowledge-based search engines in an effort to provide more accurate and adaptable data to the searcher. The encyclopedia on the net has been developed like a spring bamboo shoot after rain; on the other hand, there are also some companies attempting to improve the search, and the search Agent such as the Copernic Agent is one of them, which is required to meet the requirements of users.
The well-known information (WebGenie) is a company that develops search engine products by using Text Mining technology, and uses artificial intelligence algorithm to achieve simple man-machine interaction modes that are lacking in search engines, such as associated word prompts, dynamic category word prompts, etc., which are more similar search engine products.
1. The defects and shortcomings of the existing domestic and foreign search engines are overcome by the example:
1. in terms of matters, the passive operation is complicated, and the effect is poor;
2. the searching efficiency is low, the accuracy is poor, the repeated inaccurate information is piled up, and the user satisfaction is poor;
3. facing the vast information of the current information age, the search efficiency and accuracy are not optimistic.
2. The technical solutions of these search engine companies have failed to overcome these drawbacks and deficiencies:
1. in the face of increasing information of great and great, the system is in a state of being fully passively payable, at best, only limited technical improvement is carried out, and the system is only a cup, a water and a firewood for improving customer satisfaction and is not helpful.
2. All of the search techniques listed above, in the face of increasingly more vast, unordered information, have fundamentally failed to meet the ever-increasing demands of users for information searching.
Disclosure of Invention
Based on the technical problems in the background technology, the invention provides a method and a system for designing a full-structure storage of an information file and no search engine based on transaction logic.
The invention provides a transaction logic-based information file full-structure storage and a search engine-free design method thereof, which comprises the following steps:
s1, when the network file is stored, the network file is converted into a text according to a file structure decomposition mark and stored in a file library;
s2, continuously expanding a classification coding library for newly appeared letters, words, sentences and paragraphs in the process of storing;
and S3, only submitting keywords in the extraction process, and inquiring the degree of correlation according to the classification of the words, the words and the sentences, for example: the word inquiry is automatically related to related words, related sentences, related segments and related articles related to the keywords, the appointed inquired hierarchy (words, sentences, segments and texts) can be prompted, the frequency and the repeatability can be prompted, and related words, sentences and segments are prompted to be selected as inquiry key conditions;
s4, expanding the corresponding text type association attribute and the text type translation grade of the article to be selected by a user;
s5, expanding to various discipline classifications, social classifications, historical classifications, composer classifications, application classifications and security classification statistical analysis for user selection;
s6, providing file confidentiality and anti-virus functions;
s7, providing a title code protection function;
s8, providing a technology checking function.
Preferably, in the step S3, dissipation structures are adopted to gather the discrete and vast information and related keywords thereof in the same group, and the unordered information of the vast information can be ordered according to various attributes, so that quick information search is realized.
Preferably, an important concept in the dissipation structure is "entropy", which is a concept used to characterize the degree of system order, and a system composed of a large number of subsystems has a boltzmann function relationship:
S=KB㏑W
wherein S is entropy of the system, KB is Boltzmann constant, W is microcosmic state number of the system corresponding to the macroscopic state, namely molecules forming the system are distributed and arranged according to different numbers when forming subsystems, the magnitude of entropy value of the system is irrelevant to an evolution process, entropy is a state function of the system, high entropy means that disorder degree of the system is increased, and low entropy corresponds to order degree increase.
Preferably, the dissipative structure formation must satisfy four conditions:
first, the system must be open, which is a fundamental condition for system formation and development, an isolated system will spontaneously tend to disorder, and its entropy will gradually increase;
second, the system must be in a nonlinear region away from equilibrium, and the equilibrium or linear balance can only make the system a dead structure that is never changed;
thirdly, the system must have fluctuations that are the cause of the system to jump into the dissipative structure branches, acting as a "trigger";
fourth, a nonlinear interaction mechanism must exist within the system, which is an inherent motive force for the system to evolve from unordered to ordered.
Preferably, the file structure is divided into a file structure, a paragraph structure, a sentence structure, a vocabulary structure, a text structure or a letter structure, and various current pictures, tables and multimedia file modules are referenced, and various pictures, tables and multimedia file modules contained in the file are respectively marked by adopting unified classification marks.
Preferably, the file structure = { [ text or letters ] } + { [ vocabulary ] } + { [ sentence ] } + { paragraph + { [ other file ] } + { [ picture ] } + { [ table ] } + { [ multimedia file ] }; paragraph structure = { [ text or letter ] } + { [ vocabulary ] } + { [ sentence ] }; sentence structure = { [ text or letter ] } + { [ vocabulary ] }; lexical structure = { [ text or letter ] }.
Preferably, the table does not include a table, a picture and a multimedia file module in the table.
The information file full-structure storage and no search engine design system based on the transaction logic comprises a network file acquisition module, a file structure conversion module, a new user inquiry module, an associated translation module, a classification frequency statistics module, a file security module and a file antivirus module.
Preferably, the network file acquisition module: when the network file is stored, the network file is converted into a text according to the file structure decomposition mark and stored in a file library; a file structure conversion module: continuously expanding a classification coding library for newly appeared letters, words, sentences and paragraphs in the process of storing; new user query module: the extraction process only needs to submit keywords, and the related degree is inquired according to the classification of the words, the words and the sentences, for example: the word inquiry is automatically related to related words, related sentences, related segments and related articles related to the keywords, the appointed inquired hierarchy (words, sentences, segments and texts) can be prompted, the frequency and the repeatability can be prompted, and related words, sentences and segments are prompted to be selected as inquiry key conditions; and (5) associating a translation module: expanding the corresponding text type association attribute of the article and the text type translation grade for the user to select; classification frequency statistics module: the method is expanded to various discipline classifications, social classifications, historical classifications, composer classifications, usage classifications and security classification statistical analyses for users to select; file security module and file antivirus module: providing file confidentiality and anti-virus functions and providing property code protection functions.
In the invention, the method and the system for designing the full-structure storage of the information file based on the transaction logic and no search engine have the following advantages:
1) The invention generates self-organizing ordered effect on unordered information, fundamentally solves the problem of low efficiency of iterative search on unordered information in vast and vast, and solves the problem of traditional search in fundamental sense.
2) The intelligent, accurate and philosophy-conforming prompt information is fundamentally provided for users.
3) The method can provide a great amount of association information which is not imaginable by the user and multi-level association information for the user.
Drawings
FIG. 1 is a diagram of a method and system for designing a full structure storage of information files based on transactional logic and no search engine;
fig. 2 is a schematic diagram of a file structure of a transaction logic-based information file full-structure storage and a search engine-free design method and system thereof.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments.
Referring to fig. 1-2, the method for storing the full structure of the information file based on the transaction logic and designing the no-search engine thereof comprises the following steps:
s1, when the network file is stored, the network file is converted into a text according to a file structure decomposition mark and stored in a file library;
s2, continuously expanding a classification coding library for newly appeared letters, words, sentences and paragraphs in the process of storing;
and S3, only submitting keywords in the extraction process, and inquiring the degree of correlation according to the classification of the words, the words and the sentences, for example: the word inquiry is automatically related to related words, related sentences, related segments and related articles related to the keywords, the appointed inquired hierarchy (words, sentences, segments and texts) can be prompted, the frequency and the repeatability can be prompted, and related words, sentences and segments are prompted to be selected as inquiry key conditions;
s4, expanding the corresponding text type association attribute and the text type translation grade of the article to be selected by a user;
s5, expanding to various discipline classifications, social classifications, historical classifications, composer classifications, application classifications and security classification statistical analysis for user selection;
s6, providing file confidentiality and anti-virus functions;
s7, providing a title code protection function;
s8, providing a technology checking function.
In the invention, the step S3 adopts a dissipation structure to gather the discrete and vast information and related keywords thereof in the same group, and can realize the ordered and unordered information of the vast according to various attributes, thereby realizing rapid information search.
In the invention, an important concept in the dissipation structure is entropy, which is a concept for describing the system order, and a system formed by a plurality of subsystems has a Boltzmann function relation:
S=KB㏑W
wherein S is entropy of the system, KB is Boltzmann constant, W is microcosmic state number of the system corresponding to the macroscopic state, namely molecules forming the system are distributed and arranged according to different numbers when forming subsystems, the magnitude of entropy value of the system is irrelevant to an evolution process, entropy is a state function of the system, high entropy means that disorder degree of the system is increased, and low entropy corresponds to order degree increase.
In the invention, the dissipation structure is formed to satisfy four conditions:
first, the system must be open, which is a fundamental condition for system formation and development, an isolated system will spontaneously tend to disorder, and its entropy will gradually increase;
second, the system must be in a nonlinear region away from equilibrium, and the equilibrium or linear balance can only make the system a dead structure that is never changed;
thirdly, the system must have fluctuations that are the cause of the system to jump into the dissipative structure branches, acting as a "trigger";
fourth, a nonlinear interaction mechanism must exist within the system, which is an inherent motive force for the system to evolve from unordered to ordered.
In the invention, the file structure is divided into a file structure, a paragraph structure, a sentence structure, a vocabulary structure, a text structure or a letter structure, various current pictures, forms and multimedia file modules are referenced, and various pictures, forms and multimedia file modules contained in the file are respectively marked by adopting unified classification marks.
In the present invention, the file structure = { [ text or letter ] } + { [ vocabulary ] } + { [ sentence ] } + { paragraph + { [ other file ] } + { [ picture ] } + { [ table ] } + { [ multimedia file ] }; paragraph structure = { [ text or letter ] } + { [ vocabulary ] } + { [ sentence ] }; sentence structure = { [ text or letter ] } + { [ vocabulary ] }; lexical structure = { [ text or letter ] }.
In the invention, the table does not comprise a table, a picture and a multimedia file module in the table.
The information file full-structure storage and no search engine design system based on the transaction logic comprises a network file acquisition module, a file structure conversion module, a new user inquiry module, an associated translation module, a classification frequency statistics module, a file security module and a file antivirus module.
In the invention, the network file acquisition module: when the network file is stored, the network file is converted into a text according to the file structure decomposition mark and stored in a file library; a file structure conversion module: continuously expanding a classification coding library for newly appeared letters, words, sentences and paragraphs in the process of storing; new user query module: the extraction process only needs to submit keywords, and the related degree is inquired according to the classification of the words, the words and the sentences, for example: the word inquiry is automatically related to related words, related sentences, related segments and related articles related to the keywords, the appointed inquired hierarchy (words, sentences, segments and texts) can be prompted, the frequency and the repeatability can be prompted, and related words, sentences and segments are prompted to be selected as inquiry key conditions; and (5) associating a translation module: expanding the corresponding text type association attribute of the article and the text type translation grade for the user to select; classification frequency statistics module: the method is expanded to various discipline classifications, social classifications, historical classifications, composer classifications, usage classifications and security classification statistical analyses for users to select; file security module and file antivirus module: providing file confidentiality and anti-virus functions and providing property code protection functions.
Illustrating the convenience and benefits of using the present invention by an end user
For example, a certain candidate searches for relevant Gao Jiao about school scale, set science system, teaching condition, calendar graduation distribution condition, recruitment information and the like, and by adopting the method, the target information can be quickly (tens of times faster than the traditional technology) searched according to the primarily input keywords of the searcher, and all the desired related information can be obtained at the same time. The search is as follows: the system can actively provide relevant information prompts for the searcher and can provide a large amount of accurate associated information of interest for the searcher. Such as: related information of the same kind of schools related to the school and the like can be provided, because the related information is self-organized by the time the invention finishes searching.
By combining the invention, a true multidimensional knowledge-associated 'knowledge base' is innovated, and true electronic publications and true electronic books capable of realizing knowledge transverse association at will, namely, electronic book browsers supported by the knowledge base, can be introduced to markets.
By combining the invention, a real-time information publishing system is innovated, and great revolution of publishing industry is brought.
Because of the self-organization characteristic of the information, the invention can realize various correlations of the information and can obtain the timely and accurate constraint effect of the constraint algorithm set by people. A large, truly data warehouse system can be formed.
The invention comprises the following steps: the internet establishes a platform for uploading, searching and downloading virtual information, so that a file can be issued (uploaded) to the virtual internet only through one IP address, and can be downloaded from one IP address on the virtual internet through any IP address, which is similar to a virtual public library, and is a complex huge system because the system is not a system controlled by a person or an organization, and the complexity of the system is huge.
In order to solve the problem, the invention starts from the basic structure of the file, and makes a plurality of individuals or organizations adopt a unified new file structure standard, a unified tool is adopted to operate, and a conversion means is used to be compatible with the file materials which do not adopt the new file structure standard, so that the efficiency and the storage resource problem are suspected, and the novel method is very efficient, because the structure conversion process is carried out while the materials are stored, the current file or a batch of files are only faced by the individuals or organizations, and the computer hardly feels time. The invention does not repeatedly store the characters, words and segment units, so that the repeated storage capacity of the computer can be reduced by times of orders of magnitude, and storage resources can be saved by times of orders of magnitude. How does the query efficiency do it? It is conceivable that the query efficiency would be surprisingly fast, since the files of the new solution are stored in a fully structured form, simply by taking the files directly to you, except for the interface response time with the querier selecting and determining the query conditions. It can be said that no so-called fuzzy query (known to the same person as the computer) in the meaning of "searching" of the computer exists at all, and the so-called fuzzy query is simply clear and white, and is just as light and easy to be put on a bookshelf to fetch books. Then is the difficulty of developing a search engine big? After the novel technology is adopted, the meaning of the current network search does not exist at all, and the search difficulty is not existed, namely, only one query condition interface program is written, and then the query module of the invention is called, so that the development difficulty is not existed at all.
The invention is effectively connected with objects, objects and people, people and people, information and information, information and people and information and objects at any time and any place. The dissipation structure theory research is adopted to gather the discrete and vast information and related keywords (words and meanings) in the same group, and can realize the ordering of the vast and unordered information according to various attributes, so as to realize rapid information search, provide intelligent thinking guidance for searching people, and easily and freely obtain target information to be searched. The method solves the problems that the current search engine technology seriously violates the principle of the system science from the aspect of the system science, fundamentally solves the problems that storage resources are seriously wasted and a snapshot resource library is repeatedly built in order to achieve the search target, and is an indispensible repeated and halved search result for people. The search engine is surmounted in the impression of people to irregularly stack the materials like a lazy secretary, and is disordered and difficult to search the materials. The method of establishing a database called 'snapshot' according to frequency and adding a quick search technology is adopted to realize a search engine, is not a scientific method, results are not ideal, one search target often has thousands of results with different sources, and the requirement of searching by a user is not met.
The foregoing is only a preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art, who is within the scope of the present invention, should make equivalent substitutions or modifications according to the technical scheme of the present invention and the inventive concept thereof, and should be covered by the scope of the present invention.

Claims (9)

1. The method for storing the full structure of the information file based on the transaction logic and designing the information file without a search engine is characterized by comprising the following steps:
s1, when the network file is stored, the network file is converted into a text according to a file structure decomposition mark and stored in a file library;
s2, continuously expanding a classification coding library for newly appeared letters, words, sentences and paragraphs in the process of storing;
and S3, only submitting keywords in the extraction process, and inquiring the degree of correlation according to the classification of the words, the words and the sentences, for example: the word inquiry is automatically related to related words, related sentences, related segments and related articles related to the keywords, the appointed inquired hierarchy (words, sentences, segments and texts) can be prompted, the frequency and the repeatability can be prompted, and related words, sentences and segments are prompted to be selected as inquiry key conditions;
s4, expanding the corresponding text type association attribute and the text type translation grade of the article to be selected by a user;
s5, expanding to various discipline classifications, social classifications, historical classifications, composer classifications, application classifications and security classification statistical analysis for user selection;
s6, providing file confidentiality and anti-virus functions;
s7, providing a title code protection function;
s8, providing a technology checking function.
2. The method for storing the whole structure of the information file and designing the no-search engine based on the transaction logic according to claim 1, wherein the step S3 is characterized in that the dissipation structure is collected in the same group from discrete and vast information and related keywords thereof, and the unordered information of the vast can be ordered according to various attributes, so that the quick information search can be realized.
3. The method for storing full structure of information file based on transaction logic and no search engine design according to claim 2, wherein an important concept in the dissipation structure is "entropy", which is a concept for describing the degree of order of a system, and a system formed by a plurality of subsystems has a boltzmann function relationship:
S=KB㏑W
wherein S is entropy of the system, KB is Boltzmann constant, W is microcosmic state number of the system corresponding to the macroscopic state, namely molecules forming the system are distributed and arranged according to different numbers when forming subsystems, the magnitude of entropy value of the system is irrelevant to an evolution process, entropy is a state function of the system, high entropy means that disorder degree of the system is increased, and low entropy corresponds to order degree increase.
4. The method for full structure storage of information files and no search engine design based on transactional logic as claimed in claim 2, wherein the dissipative structure is formed to satisfy four conditions:
first, the system must be open, which is a fundamental condition for system formation and development, an isolated system will spontaneously tend to disorder, and its entropy will gradually increase;
second, the system must be in a nonlinear region away from equilibrium, and the equilibrium or linear balance can only make the system a dead structure that is never changed;
thirdly, the system must have fluctuations that are the cause of the system to jump into the dissipative structure branches, acting as a "trigger";
fourth, a nonlinear interaction mechanism must exist within the system, which is an inherent motive force for the system to evolve from unordered to ordered.
5. The method for storing full structure of information file based on transaction logic and designing no search engine according to claim 1, wherein the file structure is divided into file structure, paragraph structure, sentence structure, vocabulary structure, text structure or letter structure, and the current various pictures, tables and multimedia file modules are referenced, and the various pictures, tables and multimedia file modules contained in the file are marked by unified classification marks.
6. The method for full structure storage of information files based on transactional logic and no search engine design thereof according to claim 5, wherein the file structure = { [ text or letters ] } + { [ vocabulary ] } + { [ sentence ] } + { paragraph + { [ other files ] } + { [ picture ] } + { [ table ] } + { [ multimedia file ] }; paragraph structure = { [ text or letter ] } + { [ vocabulary ] } + { [ sentence ] }; sentence structure = { [ text or letter ] } + { [ vocabulary ] }; lexical structure = { [ text or letter ] }.
7. The transaction logic based information file full structure storage and no search engine design method according to claim 5, wherein the table does not include a table, a picture, a multimedia file module in the table.
8. The information file full-structure storage and search engine-free design system based on the transaction logic is characterized by comprising a network file acquisition module, a file structure conversion module, a new user inquiry module, an associated translation module, a classification frequency statistics module, a file security module and a file antivirus module.
9. The transaction logic based information file full structure storage and no search engine design system according to claim 8, wherein the network file acquisition module: when the network file is stored, the network file is converted into a text according to the file structure decomposition mark and stored in a file library; a file structure conversion module: continuously expanding a classification coding library for newly appeared letters, words, sentences and paragraphs in the process of storing; new user query module: the extraction process only needs to submit keywords, and the related degree is inquired according to the classification of the words, the words and the sentences, for example: the word inquiry is automatically related to related words, related sentences, related segments and related articles related to the keywords, the appointed inquired hierarchy (words, sentences, segments and texts) can be prompted, the frequency and the repeatability can be prompted, and related words, sentences and segments are prompted to be selected as inquiry key conditions; and (5) associating a translation module: expanding the corresponding text type association attribute of the article and the text type translation grade for the user to select; classification frequency statistics module: the method is expanded to various discipline classifications, social classifications, historical classifications, composer classifications, usage classifications and security classification statistical analyses for users to select; file security module and file antivirus module: providing file confidentiality and anti-virus functions and providing property code protection functions.
CN202310368352.1A 2023-04-09 2023-04-09 Information file full-structure storage based on transaction logic and search engine-free design method and system thereof Pending CN117851535A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310368352.1A CN117851535A (en) 2023-04-09 2023-04-09 Information file full-structure storage based on transaction logic and search engine-free design method and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310368352.1A CN117851535A (en) 2023-04-09 2023-04-09 Information file full-structure storage based on transaction logic and search engine-free design method and system thereof

Publications (1)

Publication Number Publication Date
CN117851535A true CN117851535A (en) 2024-04-09

Family

ID=90536787

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310368352.1A Pending CN117851535A (en) 2023-04-09 2023-04-09 Information file full-structure storage based on transaction logic and search engine-free design method and system thereof

Country Status (1)

Country Link
CN (1) CN117851535A (en)

Similar Documents

Publication Publication Date Title
Seymour et al. History of search engines
US6199067B1 (en) System and method for generating personalized user profiles and for utilizing the generated user profiles to perform adaptive internet searches
Kumar et al. Keyword query based focused Web crawler
Cafarella et al. Structured data on the web
US9026543B2 (en) System and method for generating a relationship network
US8037051B2 (en) Matching and recommending relevant videos and media to individual search engine results
US20080154886A1 (en) System and method for summarizing search results
Lee–Smeltzer Finding the needle: controlled vocabularies, resource discovery, and Dublin Core
Baker et al. A novel web ranking algorithm based on pages multi-attribute
Paananen Comparative analysis of yandex and google search engines
Manral et al. An innovative approach for online meta search engine optimization
US7490082B2 (en) System and method for searching internet domains
CN117851535A (en) Information file full-structure storage based on transaction logic and search engine-free design method and system thereof
Gurrin et al. Dublin City University experiments in connectivity analysis for TREC-9.
Zhang Application of data storage and information search in english translation corpus
Bhardwaj et al. Structure and Functions of Metasearch Engines: An Evaluative Study.
Anil et al. Multidimensional user data model for web personalization
Sharma et al. A Novel Architecture for Search Engine using Domain Based Web Log Data.
KR102434880B1 (en) System for providing knowledge sharing service based on multimedia platform
Bădărînză et al. A dataset for evaluating query suggestion algorithms in information retrieval
Choudhary A comparative analysis of various web search engines
Abd Elraouf et al. An efficient ranking module for an Arabic search engine
Duklan et al. Classification of search engine optimization techniques: A data mining approach
Kum word quer
Deshmukh et al. URL Mining Using Agglomerative Clustering Algorithm

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination