CN104823169A - Index configuration for searchable data in network - Google Patents

Index configuration for searchable data in network Download PDF

Info

Publication number
CN104823169A
CN104823169A CN201380053433.7A CN201380053433A CN104823169A CN 104823169 A CN104823169 A CN 104823169A CN 201380053433 A CN201380053433 A CN 201380053433A CN 104823169 A CN104823169 A CN 104823169A
Authority
CN
China
Prior art keywords
data
storage allocation
data field
size
subregion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201380053433.7A
Other languages
Chinese (zh)
Other versions
CN104823169B (en
Inventor
J·M·高德博格
J·B·汉德勒
A·M·A·麦克哈尼
E·K·E·恩沃卡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
A9 com Inc
Original Assignee
A9 com Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US13/650,931 external-priority patent/US9507750B2/en
Priority claimed from US13/650,961 external-priority patent/US9047326B2/en
Application filed by A9 com Inc filed Critical A9 com Inc
Priority to CN201811424497.4A priority Critical patent/CN110096502A/en
Publication of CN104823169A publication Critical patent/CN104823169A/en
Application granted granted Critical
Publication of CN104823169B publication Critical patent/CN104823169B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • G06F16/1824Distributed file systems implemented using Network-attached Storage [NAS] architecture
    • G06F16/1827Management specifically adapted to NAS
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2272Management thereof

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

An entity using a computing device can upload searchable data to a network service to be indexed and stored. The data can include a plurality of data fields, each data field having one or more associated values. The network service can analyze the data fields and their respectively associated values to determine data field types for the data fields and search options to be enabled for the data fields. Based at least in part on the data field types and the search options, the network service can generate a search index configuration/schema. Based at least in part on the generated search index configuration/schema, the network service can generate a search index for the data. In some embodiments, the network service can also convert the data into a format compatible with the search index.

Description

For in network can the index configurations of search data
Background technology
Calculation element is usually used for by network as Internet traffic.The network service provided by service provider becomes more general.Calculation element is frequently used to and is connected to network service, and described network service can provide service, and what will be used by calculation element as stored/retrieve can search data or extra processing power is provided to calculation element.About can the network storage of search data, the user of calculation element needs for the configuration of its data selection and/or form usually, and its data can be stored by network service index.Conventional method needs user to determine the applicable configuration of its data usually.The form that conventional method also may need the data of user to meet, and then require that user converts its data to described form.This may be inconvenience, trouble or difficulty for wanting the user using network service to store and search for, and then reduces overall customer experience.
Accompanying drawing explanation
Describe according to each embodiment of the present disclosure with reference to the accompanying drawings, in the accompanying drawings:
Fig. 1 illustrates the exemplary environments of the aspect that can utilize each embodiment;
Fig. 2 illustrate in networked environment can the example system embodiment of index configurations of search data;
Fig. 3 illustrate can utilize in networked environment can the example web page browsing environment of index configurations of search data;
Fig. 4 illustrates the examplar search index that can produce according to each embodiment;
Fig. 5 illustrate in networked environment can the illustrative methods embodiment of index configurations of search data;
Fig. 6 illustrates the exemplary means of the aspect that can be used to realize each embodiment;
Fig. 7 illustrates the example components of client terminal device described device as shown in Figure 6; And
Fig. 8 illustrates the environment that can realize each embodiment.
Embodiment
Describe the system and method producing index configurations, described index configurations can be used to produce the search index for the data by least one network reception.At least some embodiment allows calculation element to be upload the data in the storage allocation that provided by network service (that is, Internet Service Provider) by network (such as, the Internet).Network service can analyze institute's uploading data to determine the type (that is, data field type) of the data field of each data field in multiple data field.Network service can analyze institute's uploading data to determine whether that one or more search options of each data field be allowed in multiple data field are included in institute's uploading data.
At least some embodiment allows calculation element to be upload the data in the storage allocation that provided by network service (that is, Internet Service Provider, network service etc.) by network (such as, the Internet).One or more user/entity (such as, using one or more calculation element) can utilize search index to search for uploaded data by network, and described search index can be provided by network service.
In some embodiments, the data uploaded can comprise multiple data field.Network service can analyze institute's uploading data to determine the type (that is, data field type) of the data field of each data field in multiple data field.Such as, each data field can have the type comprising integer, text or literal type.
In addition, network service can analyze institute's uploading data to determine whether that one or more search options of each data field be allowed in multiple data field are included in institute's uploading data.Such as, network service can be determined for each respective data field, whether allows respective data field to be included in the option in the search index that will produce.Network service also can be determined for each respective data field, whether allows the option of the face number by calculating respective data field.In addition, network service can be determined for each respective data field, whether allows the option by returning/provide the value be associated with respective data field in response to search inquiry.
In some embodiments, network service can produce at least part of based on the index configurations (that is, search index configuration, pattern, index arranged) of determined data field type with the data of the search option that will allow.Network service can produce based on index configurations the search index being used for data at least partly.
As provided according to each embodiment, hereafter describe and propose other function various and advantage.
Fig. 1 illustrates the exemplary environments 100 of the aspect that can utilize in each embodiment.Exemplary environments 100 can comprise at least one calculation element 102, network 104 (such as, the Internet, Intranet, local network, LAN (Local Area Network) etc.) and network service 106 (that is, Internet Service Provider, network service etc.).At least one calculation element 102 is connected to network service 106 communicatedly by network 104.In some embodiments, calculation element 102 can when not having network 104 as transmitted network service 106 when the Internet.As shown in fig. 1, user 108 or other entities (such as, individual, company, tissue, group etc.) 108 of at least one calculation element 102 can also be there is.Data 110 are sent to network service 106 (and vice versa) from least one calculation element 102 by network 104 by user or entity 108.
In some embodiments, network service 106 can comprise and/or utilize the one or more main frame or server that are connected to network 104.Such as, storage space can be rented to client by network service 106, as user or another entity (such as, company, tissue, colony, individual etc.) 108 of device 102.Therefore, the user/entity 108 of calculation element 102 can use network 104 data from device 102 to be stored on network service 106.In other words, user/entity 108 and/or device 102 can utilize network calculating to store via network service 106.
In an example, calculation element 102 transmits the data 110 that will be stored in network services 106 by network 104, as shown in fig. 1.Data 110 can be any data for network calculating, as searching for, database purchase, run application, run virtual machine, operation system etc. data.Calculation element 102 can transmit the data 110 in the storage allocation that will be stored in and be provided by service 106.Such as, user/entity 108 can be bought or storage space in rental service 106, and storage allocation can be assigned with and be assigned to user/entity 108.In some embodiments, user/entity 108 can have particular account and/or storage allocation in service 106; The storage space (such as, storage allocation) being assigned to entity 108 can be associated with the account of entity 108.
Entity 108 also may wish that network service 106 is provided for the search index of data 110.First conventional method needs entity 108 for (namely data 110 that will be indexed provide configuration usually, index configurations, pattern, index are arranged), or configuration/form that conventional method may need solid data 110 to meet (such as, search data form (SDF)), therefore require that entity 110 converts its data 110 to required configuration.But this is inconvenience, trouble or difficulty for entity 108.
In some embodiments, data 110 can be transferred to network service 106 by entity 108, and network service 106 can automatically (such as, without the need to from the instruction of entity 108 or request) analyze data 110 and produce the index configurations (such as, search index configuration, search index pattern etc.) for data 110.Such as, in some embodiments, network service 106 by determine that the data field type 112 and determining of the one or more data fields be included in data 110 will allow for being included in the search option 114 of the one or more data fields in data 110 to analyze data 110.
About the Class1 12 determining data field, the several data field type that can be associated with data 110 (such as, document, file etc.) can be there is, as the data field etc. of the data field of integer, the data field of literal type or text.In some embodiments, data 110 can comprise multiple data field, and each data field comprises value, and (such as, data field " title " can have the value of " ABCD-brand shirt "; Data field " price " can have " value etc. of $ 20 ').Network service 106 can analyze the multiple data fields be included in data 110, to determine the type of the data field of each data field in multiple data field.
Such as, for each data field, network service 106 can determine whether the value of each respective data field comprises the integer amount exceeding the integer amount threshold value (such as, the value of data field " price " is integer entirely) of specifying; If like this, so can determine that respective data field is integer data field type.Network service 106 is also by such as determining that following at least one is to determine whether data field is lteral data field type: the value be associated with data field has the word numerical lower limits value that exceedes and specify but lower than the alphabetic character quantity of the word of specifying quantitatively limit value; The number of the different value be associated with data field is lower than the word varying number threshold value of specifying; The number percent of different value is lower than the word different weight percentage threshold value of specifying; Or the length of value is lower than the word length threshold value of specifying.In some embodiments, network service 106 such as can consider that the frequency of the different value in the length of data field value and data field value and/or number percent are to be identified as text by data field; If there is many different values in data field value and data field value very long (such as, having the alphabetic character number exceeding threshold value), so data field is likely text.In some embodiments, if data field is not integer type or literal type, so data field can be text.
About determining search option 114, network service 106 can determine the one or more search options 114 for data 110 (data field) that will allow.Such as, when determining the data field type of the data field be included in data 110, network service 106 can determine whether that data field is included in the option in the search index that will produce by permission, permission calculates the option of the face number of data field, and/or whether allows the option of the search value returning/be provided for data type.
Such as, if the data field type of data field is confirmed as text (such as, data field is " product description " and value is long paragraph), so network service 106 can select the option that is not included in by data field (and value) in search index.In another example, for there is the data field of integer data field type (such as, data field is " production year " and value is the time), network service 106 can select to allow to be included in by data field the option in the search index that will produce, and serves the option that 106 can allow the face number calculating data field.Face number can be the counting of a certain classification how many Search Results being in data field.Such as, if data field is " production year ", so network service 106 can define necessity and provide face number, and described number instruction how many Search Results are associated with a certain classification; Such as, " 1984 (23); 2002 (12); 2010 (18) " show the example about the face number of " production year " data field, wherein 23 Search Results are associated with " 1984 ", and 12 Search Results are associated with " 2002 " and 18 Search Results are associated with " 2010 ".
In some embodiments, network service 106 also can determine the value allowing return data field.Such as, in response to searching request, not all can search data field (and value) need returned (such as, retrieve and present).Network service 106 can determine the value of whether return data field.
Now turn to the generation of the configuration for data 110, network service (such as, without the need to the instruction from entity 108) can produce the configuration (such as, search index configuration, pattern etc.) being used for data 110 automatically.In some embodiments, described configuration can help to determine how index data 110 at least partly; Described index configurations can manage the index data 110 by how at least partly.Described configuration or pattern can specify the data field type of each data field be included in data 110, indicate each data field whether can search for, each data field is indicated to be whether can graduate (such as, classifiable) and contribute to setting up other similar information of index.Produce be used for will index data 110 configuration after, network service 106 can produce at least part of based on produces the search index of data 110 configured.
Fig. 2 illustrate can utilize in networked environment can the example web page browsing environment 200 of index configurations of search data.Example web page browsing environment 200 can comprise the example web page 202 reproduced as web browser by application program.In this example, webpage 202 can be provided by the network service be associated with territory ABCD.com.
Such as, user/entity (such as, the client of network service) can be retailer and can upload the data relevant to selling shirt.Data can be stored by network service index and made it to other people if the potential customers of user/entity are for searching for.Network service can analyze described data to determine the type (that is, data field type) of the data field of each data field comprised in the data.Such as, the data relevant to the sale of shirt can comprise data field, as " color " 206, " size " 208, " price " 210, " description " and other field.Network service can analyze the value of each data field to determine the type of each respective data field.Network service also can determine the one or more options (such as, search option) being allowed for each data field.Network service can produce for will the configuration/pattern of data of index subsequently.Subsequently, network service can produce based on described configuration/pattern the index being used for data.
Such as, network service identifiable design data field " color " and the value determining described data field are (such as, " redness ", " blueness ", " white ", " green " etc.) be alphabetical/word, and " color " data field can be identified as literal type.(in this example, uploading the data and value (such as, " redness ", " blueness ", " white ", " green " etc.) that are associated with " color " data field by entity).In another example, network service identifiable design institute uploading data at least partially in " size " data field, and determine that the value be included in " size " data field is numerical value.In this example, network service can determine that " size " data field is integer type.In another example, network service identifiable design institute uploading data at least partially in the value of " description " data field, and can determine that described value comprises numeral and alphabetic character, and/or described value is very long with regard to number of characters, and/or described value has different term/phrase/symbol.In this example, network service can determine that " description " data field is text.
About search option, network service can be determined for each in data field, whether allows respective data field to be included in the option in the search index that will produce.Such as, in some embodiments, " description " data field (and respective value) can be omitted from search index.If like this, so when running the inquiry about search index, described inquiry will not search for " description " data field.But " description " data field and value can be included in search index by some embodiments.
In addition, network service can determine whether the option allowing the face number calculating each data field.As mentioned above, face number represents that result that how many match search is inquired about has the particular value (or scope of value) for particular data field.Such as, as shown in Figure 2, (namely " color " data field with " redness " value has face number 23,23 Search Results for " redness " shirt), but " blueness " value of " color " data field has face number 28 (that is, for 28 Search Results of " blueness " shirt) etc.In some embodiments, described value can overlap (that is, not being necessary for exact matching).Such as, there is shirt that is blue and red streak can be associated with " blueness " and " redness " value and/or there is other value.In some embodiments, network service can determine the face number that should calculate some data fields, but there is no need the face number calculating all data fields.Such as, network service can determine the face number that should there is " color ", " size " and " price ", and there is not the face number of " description ".
In addition, network service can determine whether the value allowing return data field.Such as, can have the data field " internal product identification number " comprised in the data, the value of described data field is product ID in entity inside and is not intended to be demonstrated the client to entity; Similarly, network service can determine the value that do not allow to return this type of data field.
It is contemplated that the additional option and the data relevant to other project that those skilled in the art can be familiar with can be there is.Such as, network service can determine whether that permission can make the option of data field graduation (such as, classifiable).With reference to figure 2, in some embodiments, " price " data field is classified/is classified by its value (such as, from lowest price to ceiling price, from ceiling price to lowest price etc.), " color " data field can classify (not shown in Fig. 2) etc. by alphabet sequence.In another example (not shown), the data as relevant in music, video, book, photo etc. to media file can be there are.The example data field of media file can include but not limited to: " title ", " artist/author ", " creating the time ", " price ", " grade " etc.
The type of the data field comprised in the data and the one or more search options for comprising data field are in the data determined, network service can produce for data configuration (namely, search index configuration, pattern etc.), the generation of described configuration is at least partly based on determined data field type and search option.
After generation configuration, network service can produce the search index for data based on produced configuration at least partly.Therefore, the data provided by entity can utilize network service and the search index for data that produced by network service stores.
Fig. 3 illustrate in networked environment can the example system embodiment 300 of index configurations of search data.Example system embodiment 300 can comprise system controller 302, at least one communication transceiver 304, data field type analyzer 306, search option analyzer 308, index configurations generator 310, index generator 312 and at least one storage allocation 314.
System controller 302 can promote system perform in networked environment can the various operations of index configurations of search data.System controller 302 can communicate with at least one communication transceiver 304, to promote to the data transmission in one or more sources of system 300 outside and/or from the data communication in the data receiver in one or more sources of system 300 outside and promotion system.
The data that system 300 receives via communication transceiver 304 (such as, from entity) can be analyzed by data field type analyzer 306, to determine and each type be associated in the data field comprised in the data.Data also can be analyzed by search option analyzer 308, to determine whether to allow one or more search option about each in the data field be included in the data.At least partly based on determined data field type and one or more determined search option, index configurations generator 310 can produce search index configuration/pattern.Subsequently, at least partly based on produced search index configuration/pattern, index generator 312 can produce the search index for data.Can be stored in one or more storage allocation 314 by data with for the search index that data produce.
It is contemplated that all parts of example system 300 and/or part can be implemented as hardware, software or both combinations.Such as, the various piece of system 300 can realize via a part for circuit, processor, application program, procedure code, algorithm or its any combination etc.It is further contemplated that Fig. 3 is example and is intended to only for illustration of property object.Such as, all parts need not configure according to Fig. 3.In some embodiments, all parts need not closely be coupled each other, and is alternately diffused in the system of more disperseing.Such as, parts such as index generator can reside in separately/heterogeneous networks and/or system, but still remains to the communication connection of other parts.
Fig. 4 illustrates the examplar search index 400 that can produce according to each embodiment of the present disclosure.With reference to figure 4, root node 402 can be there is in search index.In the example in figure 4, data can by entity as T-shirt retailer uploads.Data may correspond in the information that can be used for the T-shirt (root node 402) sold about entity manufacture.The father node (such as, 404,406,408) of the data field of the data representing relevant to T-shirt can be there is.Such as, T-shirt can have color data field 404, size data field 406 and price data field 408.
Continue the example with reference to figure 4, data field can have the child node (such as, 410,412,414,416,418) of the value represented in each respective data field.Such as, two kinds of colors (red 410 and blue 412), a kind of size (medium size 414) and two kinds of Price Range (< $ 10416 and $ 10-$ 20418) can at least be there are.Also can there is last set result/project (such as, T-shirt 420,422,424,426,428,430), it is one or more that described Search Results/project may correspond in data field and value.
In this example, all three data fields (color 404, size 406 and price 408) all will be included in search index, can have face number and can provide/rreturn value in response to relevant search inquiry.Such as, as shown in Figure 4, color: red 410 can have face number three, and color: blue 412 can have face number two.Size: medium size 414 can have face number two.Price: < $ 10416 can have face number one, and price: $ 10-$ 20418 can have face number two.In addition, the search inquiry of color: such as, red 410 will return T-shirt 422,424 and 428; Such as, search red 410 and blue 412, will return T-shirt 422 etc.Although examplar search index 400 is shown as tree construction, it is contemplated that can in a number of alternative manners and/or utilize other structure to produce search index.
Fig. 5 illustrate in networked environment can the illustrative methods embodiment 500 of index configurations of search data.Again, should be appreciated that in the scope of each embodiment, can exist and perform or the other step performed concurrently, less step or alternative steps by similar or alternative order, unless otherwise indicated.In step 502 place, illustrative methods embodiment 500 can receive will the data of index.Such as, method 500 can by entity upload will the data of index, and described data can comprise multiple data field (or at least one data field).In some embodiments, illustrative methods also can determine the title of data field associated with the data.In step 504 place, illustrative methods 500 can determine the type of data field associated with the data.Such as, described method can determine the field type of the multiple field types be associated with each data field in multiple data field.Multiple field type can include, but is not limited at least one in integer type, literal type or text.The type of data field can be determined from the type of multiple data field.In some embodiments, multiple data field and its type and/or title can be identified based on label, signal or other instruction.In step 506 place, method 500 can determine the one or more search options about data field associated with the data that will allow.Such as, described one or more search option can comprise following at least one: respective data field is included in the option in the search index that will produce; Calculate the option of the face number of respective data field or the option of the one or more values be associated with respective data field is provided.Step 508 can comprise and at least partly produces the index configurations for data based on the type of data field and one or more search option.Subsequently, in step 510 place, method 500 can produce based on the index configurations for data the search index being used for data at least partly.In some embodiments, whether, free circumferential edge or both combinations produce search index if can be structural datas based on data.In some embodiments, illustrative methods also can provide at least one in data, index configurations or the index that can be searched for by one or more search inquiry.
The various out of Memory be included in index configurations can be there are.Such as, whether configuration can be preserved can facet (that is, whether should calculate the face number of data field) about data field, the information that whether data field can classify (that is, whether the Search Results with data field should be classified) etc.
In some embodiments, network service can convert the data receiving in the first format/upload to second form, and described second form and search index are compatible mutually and the data being converted into the second form can be stored in one or more storage allocation.Such as, network service can receive data from entity, and described data can have any one or multiple in some various forms, as .PDF .DOC .DOCX .CSV .JSON .XML etc.Network service data can be converted to automatically can with network services is compatible mutually (such as, can be by ... identify, can be by ... use etc.) form, as search data form (SDF).
In some embodiments, network service can based on following operation transformation data: by the first form compared with the second form, and revise at least one data field of being associated with the first form with corresponding at least one data field be associated with the second form.Such as, network service can compare the data layout that receives from entity and revise/upload described form, makes it mutually compatible with network service.This can comprise the one or more data fields identifying whether should add, remove or change over described form.
In some embodiments, network service can based on determining that the value be associated with data field has the integer character amount exceeding the integer amount threshold value of specifying and determines that the type of data field is integer type.In addition, network service is by determining that following at least one is to determine that the type of data field is literal type: the value be associated with data field has the word numerical lower limits value that exceedes and specify but lower than the alphabetic character quantity of the word of specifying quantitatively limit value; The number of the different value be associated with data field is lower than the word varying number threshold value of specifying; The number percent of different value is lower than the word different weight percentage threshold value of specifying; Or the length of value is lower than the word length threshold value of specifying.In addition, network service can based on determining that following person determines that the type of data field is text: the value be associated with data field has at least one in the integer and alphabetic character quantity exceeding the amount of text threshold value of specifying; The number of kinds of characters exceedes the text varying number threshold value of specifying; The number percent of kinds of characters exceedes the text different weight percentage threshold value of specifying; Or the length of character exceedes the text size threshold value of specifying.
In some embodiments, network service can determine that data field is included in the option in the search index that will produce by permission, described decision is at least partly based on receiving the signal be included in data field, and described signal designation data field will be included in search index.Network service also can determine the option of face number allowing to calculate data field, and described decision is at least partly based on determining that the quantity of at least one value be associated with data field exceedes the face number lower limit of specifying and lower than the face number higher limit of specifying.Network service can determine to allow to provide the value be associated with data field in response to related search queries further, and described decision is at least partly based on receiving the signal be included in data field, and described signal designation will provide the value be associated with data field.
In some embodiments, one or more search inquiry (term such as, in search inquiry) can be utilized by network service.Such as, network service can infer searcher just facet on particular data field from search inquiry.Therefore, such as, network service can determine that data field should be literal type.
In some embodiments, when searcher input inquiry term and request search time, by correlativity with particular rank express (such as, the order of result) present one or more Search Results.The disclosure can allow to create considers that the more complicated rank of other factors as inquiry irrelevant factor (such as, can there is the popularity data field etc. be included in data) is expressed.The disclosure also can allow by checking data and determining that data field popularity is significant analysis, expresses to propose spendable rank.Such as, body of text data field type can be there is and its length (such as, or the inverse of its length) can be considered and can be rank to express to provide and use information.
In some embodiments, data field type also can comprise geographic location type, time type, data type or float.
Each embodiment consistent with the disclosure also can utilize sample data.Such as, user/necrosis is carried and first sample data can be provided to network service.Network service can analyze described sample data to determine type and the search option of data field.Based on data field type and the search option for sample data, network service can produce index configurations, and produces search index based on produced index configurations subsequently.
Fig. 6 illustrates the exemplary electronic user's set 600 that can use according to each embodiment.Although show portable computing (such as E-book reader or flat computer), but will be appreciated that, can use any electronic installation that can receive, determine and/or process input according to each embodiment discussed herein, wherein said device can comprise (such as) desk-top computer, notebook, personal digital assistant, smart mobile phone, video game console, TV set-top box and portable electronic device.In some embodiments, calculation element 600 can be analogue means, as used the device of operational amplifier executive signal process.In this example, calculation element 600 has display screen 602 on front side, and described display screen shows information by the user's (such as, on the computing device with in display screen same side) in the face of display screen under normal operation.In this example, calculation element comprises at least one video camera 604 or other image-forming component for catching static state or video image information at least one visual field of at least one video camera.In some embodiments, calculation element only may comprise an image-forming component, and in other embodiments, calculation element may comprise some image-forming components.Each image capturing component can be such as video camera, charge-coupled device (CCD) (CCD), mobility detect sensor or infrared sensor and other possibilities many.If there is multiple image capturing component on the computing device, so described image capturing component can be dissimilar.In some embodiments, at least one image-forming component can comprise at least one wide angle optical element as fish-eye lens, and described wide angle optical element allows video camera in wide range as 180 degree or greater angle IT image.In addition, each image capturing component can comprise the digital still video camera being configured in extremely rapid succession catch subsequent frame, maybe can catch the video cameras of stream video.
Exemplary computing devices 600 also comprises at least one microphone 606 or can other audio capturing devices of capturing audio data (as device users is said or order).In this example, microphone 606 is placed on side identical with display screen 602 on device, described microphone will can acquisition equipment user be said usually better.In at least some embodiment, microphone can be shotgun microphone, and the described shotgun microphone substantially direct front portion from microphone catches acoustic information, and only from the sound of other direction pickup limited quantity.Should be understood that, microphone can be positioned on the edge of any applicable surface in any region, face or device in various embodiments, and described multiple microphone can be used for audio recording and filtering object etc.
Exemplary computing devices 600 also comprises at least one orientation sensor 608, as element is determined in position and/or movement.Sensors with auxiliary electrode can comprise accelerometer or the gyroscope of the little movement that such as can operate the orientation of detection computations device and/or the change of orientation and device.Orientation sensor also can comprise electronics or digital compass, described electronics or digital compass can indicating device by determine pointed (such as relative to main shaft or other this type of towards) direction (such as north or south).Orientation sensor also can comprise or comprise the similar setting element that GPS (GPS) maybe can operate the relative coordinate of the position determining calculation element and the information relative to large movement of device.Each embodiment can comprise any applicable combination of this class component one or more.As should be appreciated that, for determining that relative position, the algorithm of orientation and/or movement or mechanism can depend on the selection of the element that can be used for device at least in part.
Fig. 7 illustrates the logic arrangement of one group of universal component of exemplary computing devices 700 (as relative to the device 600 described by Fig. 6).In this example, device comprises the processor 702 for performing the instruction that can be stored in storage arrangement or element 704.As the apparent for skilled in the art, device can comprise is permitted eurypalynous storer, data-carrier store or nonvolatile computer-readable recording medium, as the first data-carrier store of programmed instruction performed for the treatment of device 702, for the Stand Alone Memory of image or data, for sharing the removable memory etc. of information with other device.Described device will comprise the display element 706 of some types usually, and as touch-screen or liquid crystal display (LCD), but the device as portable electronic device may carry out transmission of information via other mechanism (as passed through audio tweeter).As discussed, the device in many embodiments can carry out at least one image capture element 708 of imaging to other object near projected image or device by comprising, as video camera or infrared ray sensor.Utilize calculation element, the method that uses video camera element to carry out catching image or video is also known in the art and will do not discuss in detail herein.It should be understood that, can use single image, multiple image, periodically imaging, consecutive image is caught, image stream etc. performs image capture.In addition, device can comprise the ability starting and/or stop image capture, as when receiving order from user, application program or other device.Exemplary means comprises at least one audio capturing parts 712 that can operate from capturing audio information at least one Main way similarly, as monophony or stereophony microphone array.Microphone can be the unidirectional or omnidirectional microphone that such device is known.
In some embodiments, the calculation element 700 of Fig. 7 can comprise one or more communication device (not shown), as Wi-Fi, bluetooth, RF, wired or wireless communication system.Device in many embodiments with network as Internet traffic, and can communicate with other such device.In some embodiments, device can comprise at least one additional input device that can receive conventional input from user.This routine input such as comprises button, touch pad, touch-screen, bearing circle, operating rod, keyboard, mouse, keypad or user can by any other this kind of device or the element be used for device input command.But in some embodiments, this device may not comprise any button, and may only can control via the combination of vision and voice command, make user can when without the need to contacting with device control device.
Device 700 also can comprise at least one directed or motion sensor 710.As discussed, sensors with auxiliary electrode can comprise the accelerometer or gyroscope or electronics or digital compass that can operate the change detecting orientation and/or orientation, described can indicating device determined towards direction.Mechanism also (or alternatively) can comprise or comprise the similar setting element that GPS (GPS) maybe can operate the information of the relative coordinate of the position determining calculation element and the relatively large motion of device.Device also can comprise other element, as allowed position finding by triangulation or another this kind of method.These mechanism can communicate with processor 702, and then device can perform many on any one that be described herein or that advise.
As an example, calculation element such as the device as described in relative to Fig. 6 can in the various information of a period of time IT and/or track user.This information can comprise any applicable information, as position, action (such as, send message or create file), user behavior (such as, how long user performs the mode etc. of a subtask, the user effort time quantum in task, user's browser interface), customer parameter (such as, user likes how receiving information), open application program, the request submitted to, the calling etc. that receives.State as discussed above, described information can this type of mode of link information or other association store, and then user can use any applicable dimension or dimension group access information.
As discussed, distinct methods can be realized according to described embodiment in various environment.Such as, Fig. 8 illustrates the embodiment for realizing the environment 800 according to each side of each embodiment.As understood, although use network environment for explanatory purposes, can optionally use varying environment to realize each embodiment.System comprises E-customer's end device 802, and described E-customer's end device can comprise operating on applicable network 804, sends and receive request, message or information and information is sent back any applicable device of device users.The example of this type of client terminal device comprises personal computer, mobile phone, hand-held messaging device, laptop computer, Set Top Box, personal digital assistant, E-book reader etc.Network can comprise any applicable network, and it comprises the combination of in-house network, internet, Cellular Networks, LAN (Local Area Network) or other such network any or above-mentioned network.This type systematic assembly used can depend on the type of selected network and/or environment at least partly.For being well-known via the agreement of such network communication and assembly, thus discuss no longer in detail herein.Communication on networking can realize via wired or wireless connection and combination thereof.In this embodiment, network comprises internet because environment comprise for receive request and in response to described request the Web server 806 of service content, but for other network, can use for the replacement device of service class like object, as the skilled person will be apparent.
Shown environment comprises at least one apps server 808 and data-carrier store 810.Be to be understood that, can exist can link or otherwise configure some apps servers, layer or other element, process or assembly, these apps servers, layer or other element, process or assembly can perform the task as obtained data from the data-carrier store be applicable to alternately.As used herein, term " data-carrier store " refers to and can store, accesses and any device of acquisition data or device combination, and described device can comprise any combination of data server, database, data storage device and data storage medium and any number.Apps server can comprise any applicable hardware and software, and described hardware and software looks the needs of aspect of the one or more application programs performing client terminal device and data-carrier store integrated and most of data access of handle applications and service logic.Apps server provides the access control cooperated with data-carrier store service, and can generate will be sent to user as contents such as text, picture, audio frequency and/or videos, described content the form of HTML, XML or another applicable structured language can provide service by Web server to user in this embodiment.The disposal of all requests and response and the content delivery between client terminal device 802 and apps server 808 can be disposed by the webserver 806.Should be appreciated that the webserver and apps server dispensable, and be only example components because the structured code discussed herein can as other place herein any applicable device discussed or main frame perform.
Data-carrier store 810 can comprise some independently tables of data, database or other data storage mechanism and medium, is used for storing the data relevant to particular aspects.For example, shown data-carrier store comprises for storing the mechanism generating data 812 and user profile 816, and described mechanism can be used for the content of service creation end.Data-carrier store is also shown as the mechanism comprised for store recording or session data 814.Be to be understood that, many other sides that may need to be stored in data-carrier store may be there are, as page-images information and access right information, described aspect optionally can be stored in any mechanism in mechanism listed above or be stored in the additional mechanism in data-carrier store 810.Data-carrier store 810 operates by the logic be associated with it, to receive instruction from apps server 808, and obtains data, more new data in response to described instruction or otherwise processes data.In an example, user can submit searching request to for the element of certain type.In this situation, data-carrier store possibility calling party information carrys out the identity of authentication of users, and may have access to catalogue details to obtain the information of the element about described type.Then can by information as returned to user with the form of the results list on webpage, user can check described list via the browser on user's set 802.The information of interested particular element can be checked in the private pages of browser or window.
Each server will comprise operating system usually, described operating system is provided for the general management of described server and the executable program instructions of operation, and each server will comprise the computer-readable medium storing instruction usually, described instruction can make server perform its expectation function when the processor by server performs.The applicable implementation of operating system and the general utility functions of server are well-known or commercially available, and are easy to be realized by those of ordinary skill in the art, especially realize according to disclosing herein.
In one embodiment, environment is distributed computing environment, and described environment utilizes via communication link, uses one or more computer network or directly connect to come interconnected some departments of computer science and to unify assembly.But those of ordinary skill in the art should be understood that this system equally successfully can operate in the system with assembly more less or more than assembly illustrated in fig. 8.Therefore, the description of the system 800 in Fig. 8 should be considered as illustrative in essence, and does not limit the scope of the present disclosure.
As discussed above, can implement each embodiment in the operating environment of broad range, described environment can comprise one or more subscriber computer, calculation element in some cases or can be used for the treating apparatus of any one application program operated in multiple application program.User or client terminal device can comprise any general purpose personal computer in multiple general purpose personal computer, as desk-top computer or the notebook of operation standard operating system, and run mobile software and multiple network can be supported to connect and the cellular devices of message-sending protocol, wireless device and hand-held device.This system also can comprise multiple workstation, and described workstation runs various commercially available operating system and for any application program in other known applications of specific purpose (as exploitation and data base administration).These devices also can comprise other electronic installation, as virtual terminal, thin-client, games system and can via other device of network service.
Various aspects also can be embodied as the part of at least one service or Web service, as can be the part of service orientation type frame structure.Service as Web service can use any transmitting of applicable type to communicate, as the message by using in extend markup language (XML) form, and use as the applicable agreements such as SOAP (originating from " Simple Object Access Protocol ") exchange.This kind of service provides or the flow process that performs can any applicable language compilation, as Web Services Description Language (WSDL) (WSDL).Use as functional in the automatic generation of client code in each SOAP framework etc. in the permission of the language such as WSDL.
Most of embodiment utilize at least one be to those skilled in the art the network of any one of the familiar various commercially available agreement of use for supporting communication, described agreement is as TCP/IP, OSI, FTP, UPnP, NFS, CIFS and AppleTalk.For example, network can be any combination of LAN (Local Area Network), wide area network, Virtual Private Network, internet, in-house network, extranet, public switch telephone network, infrared network, wireless network and above-mentioned network.
In the embodiment utilizing Web server, Web server can run any application program in various server or mid-tier application, comprises http server, ftp server, CGI server, data server, java server and business application server.Described server can also respond request from user's set and executive routine or script, as by perform one or more may be embodied as one or more with any programming language (as c, C# or C++) or any script (as Perl, Python or TCL) and combine the web application of the script write or program.Described server can also comprise database server, includes but not limited to that these are commercially available with
Environment can comprise various data-carrier store as discussed above and other storer and storage medium.These media can reside in various position, as on the storage medium of one or more computing machine this locality (and/or residing in one or more computing machine), or away from any one in the computing machine on network or all computing machines.In particular group embodiment, information can reside in the storage area network (" SAN ") that those skilled in the art are familiar with.Similarly, can optionally local and/or remote storage for performing any necessary file of the function belonging to computing machine.When system comprises the device of computing machine, often kind of such device can comprise the hardware element that can carry out electric coupling via bus, described element comprises, such as, at least one central processing unit (CPU), at least one input media are (such as, mouse, keyboard, controller, touch-screen or keypad) and at least one output unit (such as, display equipment, printer or loudspeaker).This type systematic also can comprise one or more memory storage, as disc driver, optical storage and solid-state storage device, as random access memory (" RAM ") or ROM (read-only memory) (" ROM "), and removable medium device, memory card, flash card etc.
Such device also can comprise computer-readable storage media reader, communicator (such as modulator-demodular unit, network card (wireless or wired), infrared communications set etc.) and working storage, as discussed above.Computer-readable storage media reader can be connected with computer-readable recording medium or be configured to receiving computer readable storage medium storing program for executing, thus represent long-range, local, fixing and/or mobile storage means and for temporarily and/or more for good and all containing, store, the storage medium of transmission and acquisition computer-readable information.System and various device usually also comprise multiple software application, module, serve or be positioned at other element of at least one working storage device, comprise operating system and application program, as client application or Web browser.Should understand, alternate embodiment compares embodiment as described above can numerous change.For example, also custom hardware can be used, and/or particular element may be implemented in hardware, software (comprising portable software, as small routine) or hardware and software.In addition, the connection with such as other calculation element of network input/output device can be adopted.
Storage medium containing code or partial code and computer-readable medium can comprise any applicable medium that is known in the art or that used, comprise storage medium and communication media, as (but being not limited to) for store and/or transmission information (as computer-readable instruction, data structure, program module or other data) any method or technology in the volatibility implemented and non-volatile, removable and irremovable medium, comprise RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical memory, magnetic holder, tape, magnetic disk memory or other magnetic storage device, or can be used for store institute want information and can supply system and device access other medium any.Based on open and religious doctrine provided in this article, the art those of ordinary skill realizes alternate manner and/or the method for each embodiment by understanding.
Therefore, instructions and accompanying drawing should be understood in descriptive sense instead of restrictive, sense.But, it is evident that: when the more broader spirit of the present invention do not departed from as set forth in the claims and scope, various amendment and change can be made to it.
Various embodiment of the present disclosure can be described in view of following clause:
A1. in networked environment can the computer implemented method of index configurations of search data, it comprises:
Reception will the data of index, and described data comprise multiple data field;
Determine and the title that each data field in described multiple data field is associated;
Determine and the field type in multiple field types that each data field in described multiple data field is associated, described multiple field type comprises at least one in integer type, literal type or text;
Determine whether the one or more search options being allowed for each in described data field, described one or more search option comprise following at least one: respective data field is included in the option in the search index that will produce; Calculate the option of the face number of described respective data field; Or the option of the one or more values be associated with described respective data field is provided;
At least partly based on each data field be included in described data described field type and whether allow the described of described one or more search option to determine that the search index produced for described data configures; And
The search index produced for described data is configured at least partly based on the described search index for described data.
A2. the computer implemented method as described in clause A1, wherein said data are first forms, and it also comprises:
Described data are become the second form from described first format conversion, and described second form is mutually compatible with described search index; And
The described data being converted into described second form are stored in one or more storage allocation.
Described data are wherein become described second form to comprise from described first format conversion by the computer implemented method A3. as described in clause A2:
By described first form compared with described second form; And
At least one data field that amendment is associated with described first form is to correspond at least one data field be associated with described second form.
A4. the computer implemented method as described in clause A2, wherein said second form is search data form (SDF).
A5. a computer implemented method, it comprises:
Reception will the data of index,
Determine the type of the data field be associated with described data, determine the described type of described data field from multiple data field type;
Determine the one or more search options about the described data field be associated with described data that will allow;
At least part of described type based on described data field and one or more search option produce the index configurations for described data; And
The search index for described data is produced at least partly based on the described index configurations for described data.
A6. the computer implemented method as described in clause A5, wherein said data are first forms, and it also comprises:
Described data are become the second form from described first format conversion, and described second form is mutually compatible with described search index; And
The described data being converted into described second form are stored in one or more storage allocation.
Described data are wherein become described second form to comprise from described first format conversion by the computer implemented method A7. as described in clause A6:
By described first form compared with described second form; And
At least one data label that amendment is associated with described first form is to correspond at least one data label be associated with described second form.
A8. the computer implemented method as described in clause A5, wherein said multiple data field type comprises at least one in integer type, text, literal type, geographic location type, time type, data type or float.
A9. the computer implemented method as described in clause A8, wherein determine that the described type of described data field comprises:
Determine that the value be associated with described data field has the integer character quantity higher than the integer amount threshold value of specifying; And
Determine that the described type of described data field is described integer type.
A10. the computer implemented method as described in clause A8, wherein determine that the described type of described data field comprises:
Determine following at least one: the value be associated with described data field has the alphanumeric character quantity higher than the amount of text threshold value of specifying; The different value number be associated with described data field is higher than the text varying number threshold value of specifying; The number percent of different value is higher than the text different weight percentage threshold value of specifying; Or the length of value is higher than the text size threshold value of specifying; And
Determine that the described type of described data field is described text.
A11. the computer implemented method as described in clause A8, wherein determine that the described type of described data field comprises:
Determine following at least one: the value be associated with described data field has higher than the word numerical lower limits value of specifying but lower than the alphanumeric character quantity of the word of specifying quantitatively limit value; The number of the different value be associated with described data field is lower than the word varying number threshold value of specifying; The number percent of different value is lower than the word different weight percentage threshold value of specifying; Or the length of value is lower than the word length threshold value of specifying; And
Determine that the described type of described data field is described literal type.
A12. the computer implemented method as described in clause A5, wherein said one or more search option can comprise following at least one: described data field is included in the option in the described search index that will produce; Calculate the option of the face number of described data field; Or the option of the value be associated with described data field is provided in response to related search queries.
A13. the computer implemented method as described in clause A12, wherein determine that described one or more search option that will allow comprises and determine that described data field is included in the described option in the described search index that will produce by permission, described decision is at least partly based on receiving the signal that is included in described data field or determining that the type of described data field is at least one in literal type, and described in described signal designation, data field will be included in described search index.
A14. the computer implemented method as described in clause A12, wherein determine that described one or more search option that will allow comprises the described option of face number determining to allow to calculate described data field, described decision is at least partly based on determining that the quantity of multiple values of the distribution be associated with described data field is lower than the face number higher limit of specifying.
A15. the computer implemented method as described in clause A12, wherein determine that described one or more search option that will allow comprises the described option determining to allow to provide the described value be associated with described data field in response to described related search queries, described decision is at least partly based on receiving the signal that is included in described data field or determining that the length of the described value be associated with described data field is lower than at least one in the rreturn value length threshold of specifying, and described signal designation will provide the described value be associated with described data field.
A16. the computer implemented method as described in clause A5, it also comprises:
At least one in described data, described index configurations or the described index that can be searched for by one or more search inquiry is provided.
A17. the computer implemented method as described in clause A5, it also comprises:
Described index configurations is revised at least partly based on one or more Client-initiated input.
A18. a system, it comprises:
At least one communication transceiver;
One or more storage allocation;
At least one processor; And
Storage arrangement, it is included in when being performed by least one processor described and causes described system to carry out the instruction of following operation:
Receiving via described at least one communication transceiver will the data of index;
Determine the type of the data field be associated with described data, determine the described type of described data field from multiple data field type;
Determine the one or more search options about the described data field be associated with described data that will allow;
At least part of described type based on described data field and one or more search option produce the index configurations for described data; And
The search index for described data is produced at least partly based on the described index configurations for described data.
A19. the system as described in clause A18, wherein said data are first forms, and wherein said instruction causes described system further:
Described data are become the second form from described first format conversion, and described second form is mutually compatible with described search index; And
The described data being converted into described second form are stored in described one or more storage allocation.
A20. the system as described in clause A19, wherein said instruction causes described system, based on following operation, described data are become described second form from described first format conversion: by described first form compared with described second form; And at least one data field that amendment is associated with described first form is to correspond at least one data field be associated with described second form.
A21. a nonvolatile computer-readable recording medium, it comprises the instruction for recognition component, and described instruction causes described computing system to carry out following operation when the processor by computing system performs:
Reception will the data of index,
Determine the type of the data field be associated with described data, determine the described type of described data field from multiple data field type;
Determine the one or more search options about the described data field be associated with described data that will allow;
At least part of described type based on described data field and one or more search option produce the index configurations for described data; And
The search index for described data is produced at least partly based on the described index configurations for described data.
A22. the nonvolatile computer-readable recording medium as described in clause A21, wherein said multiple data field type comprises at least one in integer type, text, literal type, geographic location type, time type, data type or float.
A23. the nonvolatile computer-readable recording medium as described in clause A22, wherein said instruction cause described computing system based on determine following at least one determine that the described type of described data field is literal type: the value be associated with described data field has higher than the word numerical lower limits value of specifying but lower than the alphanumeric character quantity of the word of specifying quantitatively limit value; The number of the different value be associated with described data field is lower than the word varying number threshold value of specifying; The number percent of different value is lower than the word different weight percentage threshold value of specifying; Or the length of value is lower than the word length threshold value of specifying.
A24. the nonvolatile computer-readable recording medium as described in clause A21, wherein said one or more search option comprise following at least one: described data field is included in the option in the described search index that will produce; Calculate the option of the face number of described data field; Or the option of the value be associated with described data field is provided in response to related search queries.
A25. the nonvolatile computer-readable recording medium as described in clause A24, wherein determine that described one or more search option that will allow comprises the described option determining to allow the face number calculating described data field, described decision is at least partly based on determining that the quantity of at least one value be associated with described data field is higher than the face number lower limit of specifying and lower than the face number higher limit of specifying.
B1. for a computer implemented method for News Search subregion, it comprises:
Monitor the data bulk that is stored or on the first subregion provided by network service service data speed at least one, described first subregion is included in the storage allocation provided by described network service;
Detect in described quantity or described speed described at least one exceed the amount threshold of specifying or the rate-valve value of specifying respectively;
Detect the size that performs and increase described first subregion in response to described or add at least the second subregion in described storage allocation at least one, at least one in described increase or described interpolation is at least partly based on the described quantity of the data be stored or the described speed of service data;
Described in described increase or described interpolation during at least one, the network traffics be associated with described storage allocation are directed to the cache memory provided by described network service; And
When described in described increase or described interpolation during at least one complete, described network traffics are directed to described storage allocation.
B2. the computer implemented method as described in clause B1, it also comprises:
Monitor the search index being used for described storage allocation;
Detect that the size of described search index exceedes the index size threshold value of specifying; And
Upgrade the described search index that is used for described storage allocation with reflection about described in the described increase of described storage allocation or described interpolation at least one.
B3. the computer implemented method as described in clause B1, if wherein the described size of described first subregion is lower than maximum partition size threshold, so perform the increase of the described size to described first subregion, if and wherein the described size of described first subregion, at described maximum partition size threshold place, so performs the described interpolation of at least described second subregion.
B4. a computer implemented method, it comprises:
Monitor that the data in the storage allocation in networked environment use, described storage allocation has many subregions, and it comprises at least one subregion;
Determine the described data be included at least one subregion described in described storage allocation use whether exceed appointment threshold value;
The size of amendment at least one subregion described or at least one in being included in described storage allocation number of partitions;
The network traffics be associated with described storage allocation are guided the part away from being associated with the amendment of at least one of described size or described number in described storage allocation; And
When described amendment completes, described network traffics are directed to the described part be associated with described amendment in described storage allocation.
B5. the computer implemented method as described in clause B4, it also comprises:
Detect that the size for the search index of described storage allocation exceedes the index size threshold value of specifying; And
Based on the described size of amendment at least one subregion described or in being included in described storage allocation number of partitions described at least one upgrades described search index for described storage allocation.
B6. the computer implemented method as described in clause B5, wherein upgrade described search index comprise rebuild for described storage allocation described search index with described in the described size reflecting at least one subregion described in described amendment or the number of partitions being included in described storage allocation at least one.
B7. the computer implemented method as described in clause B4, wherein said data use comprise be stored in data bulk in described storage allocation or in described storage allocation service data speed at least one.
B8. the computer implemented method as described in clause B7, wherein said appointment threshold value comprises at least one in the amount threshold of specifying or the rate-valve value of specifying, and wherein when the described speed that the data bulk be stored exceedes described amount threshold of specifying or service data exceedes at least one in described rate-valve value of specifying, described data use and exceed described appointment threshold value.
B9. computer implemented method as described in clause B8, wherein at least part of information based on using about historical data calculates described appointment threshold value.
B10. computer implemented method as described in clause B4, it also comprises:
Determine that the network traffics quantity being directed into described storage allocation is higher than the flow threshold of specifying; And
Described storage allocation is revised based on described network traffics quantity.
B11. computer implemented method as described in clause B10, wherein said network traffics comprise the search inquiry flow for searching for the data be stored in described storage allocation.
B12. computer implemented method as described in clause B10, wherein based on described network traffics quantity revise described storage allocation comprise following at least one: the described size of amendment at least one subregion described; Revise described number of partitions; Or replace with at least one subregion with different size at least one subregion be included in described number of partitions.
B13. computer implemented method as described in clause B12, wherein said different size comprises at least one in different CPU power, different RAM capacity, different hard drive space capacity or different bandwidth capacity.
B14. computer implemented method as described in clause B4, at least one wherein in the described size of amendment at least one subregion described or described number of partitions comprises at least one in the described size or described number of partitions increasing at least one subregion described, if wherein the described size of at least one subregion described is lower than maximum partition size threshold, so perform the increase of the described size at least one subregion described, if and wherein the described size of at least one subregion described at described maximum partition size threshold place, so perform the increase to described number of partitions.
B15. computer implemented method as described in clause B4, at least one wherein in the described size of amendment at least one subregion described or described number of partitions comprises at least one in the described size or described number of partitions reducing at least one subregion described, if wherein described number of partitions is greater than a subregion, so perform the reduction to described number of partitions, if and wherein described number of partitions is a subregion, so perform the reduction of the described size at least one subregion described.
B16. the computer implemented method as described in clause B4, it also comprises:
Determine that the CPU of described storage allocation uses, at least one wherein amendment in described size or described number be use based on the described data in described storage allocation or the described CPU determined of described storage allocation use at least one.
B17. the computer implemented method as described in clause B4, it also comprises:
Based on use with described data the configuration that is associated or Client-initiated input at least one revise the configuration of described storage allocation.
B18. the computer implemented method as described in clause B4, it also comprises:
Determine when to perform the described amendment at least one in described size or described number based on the obtainable resource of described storage allocation.
B19. a system, it comprises:
Storage allocation, described storage allocation has many subregions, and it comprises at least one subregion;
At least one processor; And
Storage arrangement, it is included in when being performed by least one processor described and causes described system to carry out the instruction of following operation:
Monitor that the data in described storage allocation use;
Determine the described data be included at least one subregion described in described storage allocation use whether exceed appointment threshold value;
The size of amendment at least one subregion described or at least one in being included in described storage allocation number of partitions;
The network traffics be associated with described storage allocation are guided the part away from being associated with the amendment of at least one of described size or described number in described storage allocation; And
When described amendment completes, described network traffics are directed to the described part be associated with described amendment in described storage allocation.
B20. the system as described in clause B19, it also comprises:
At least one load equalizer, during it to be configured to contribute to described in the described size of amendment or described number at least one, described network traffics are directed away from the part of described storage allocation, and when described in contributing in described size or described number, the amendment of at least one completes, described network traffics are directed into the part of described storage allocation.
B21. the system as described in clause B20, at least one load equalizer wherein said is configured to guide described network traffics through the many subregions be included in described storage allocation.
B22. the system as described in clause B19, it also comprises:
At least one monitor module, it is configured to contribute to monitoring that the described data in described storage allocation use, and contributes to determining the described data be included at least one subregion described in described storage allocation use whether exceed appointment threshold value.
B23. comprise a nonvolatile computer-readable recording medium for the instruction for recognition component, described instruction causes described computing system to carry out following operation when the processor by computing system performs:
Monitor that the data in the storage allocation in networked environment use, described storage allocation has many subregions, and it comprises at least one subregion;
Determine the described data be included at least one subregion described in described storage allocation use whether exceed appointment threshold value;
The size of amendment at least one subregion described or at least one in being included in described storage allocation number of partitions;
The network traffics be associated with described storage allocation are guided the part away from being associated with the amendment of at least one of described size or described number in described storage allocation; And
When described amendment completes, described network traffics are directed to the described part be associated with described amendment in described storage allocation.
B24. the nonvolatile computer-readable recording medium as described in clause B23, wherein said instruction causes described computing system further: detect that the size for the search index of described storage allocation exceedes the index size threshold value of specifying, and based on the described size of amendment at least one subregion described or in being included in described storage allocation described number of partitions described at least one upgrades described search index for described storage allocation.
B25. the nonvolatile computer-readable recording medium as described in clause B24, wherein upgrade described search index comprise rebuild for described storage allocation described search index with described in the described size reflecting at least one subregion described in described amendment or the number of partitions being included in described storage allocation at least one.

Claims (15)

1. a computer implemented method, it comprises:
Monitor that the data in the storage allocation in networked environment use, described storage allocation has the many subregions comprising at least one subregion;
Determine the described data be included at least one subregion described in described storage allocation use whether exceed appointment threshold value;
The size of amendment at least one subregion described or at least one in being included in described storage allocation number of partitions;
The network traffics be associated with described storage allocation are guided the part away from being associated with the described amendment of at least one described in described size or described number in described storage allocation; And
When described amendment completes, described network traffics are directed to the described part be associated with described amendment in described storage allocation.
2. computer implemented method as claimed in claim 1, it also comprises:
Detect that the size for the search index of described storage allocation exceedes the index size threshold value of specifying; And
Based on the described size of amendment at least one subregion described or in being included in described storage allocation described number of partitions described at least one upgrades described search index for described storage allocation.
3. computer implemented method as claimed in claim 2, wherein upgrade described search index comprise rebuild for described storage allocation described search index with described in the described size reflecting at least one subregion described in described amendment or the described number of partitions being included in described storage allocation at least one.
4. computer implemented method as claimed in claim 1, wherein said data use comprise be stored in data bulk in described storage allocation or in described storage allocation service data speed at least one.
5. computer implemented method as claimed in claim 4, wherein said appointment threshold value comprises at least one in the amount threshold of specifying or the rate-valve value of specifying, and wherein when the described speed that the data bulk be stored exceedes described amount threshold of specifying or service data exceedes at least one in described rate-valve value of specifying, described data use and exceed described appointment threshold value.
6. computer implemented method as claimed in claim 5, wherein at least part of information based on using about historical data calculates described appointment threshold value.
7. computer implemented method as claimed in claim 1, it also comprises:
Determine that the network traffics quantity being directed into described storage allocation is higher than the flow threshold of specifying; And
Described storage allocation is revised based on described network traffics quantity.
8. computer implemented method as claimed in claim 7, wherein said network traffics comprise the search inquiry flow for searching for the data be stored in described storage allocation.
9. computer implemented method as claimed in claim 7, wherein based on described network traffics quantity revise described storage allocation comprise following at least one: the described size of amendment at least one subregion described; Revise described number of partitions; Or replace with at least one subregion with different size at least one subregion be included in described number of partitions.
10. computer implemented method as claimed in claim 9, wherein said different size comprises at least one in different CPU power, different RAM capacity, different hard drive space capacity or different bandwidth capacity.
11. computer implemented methods as claimed in claim 1, at least one wherein in the described size of amendment at least one subregion described or described number of partitions comprises at least one in the described size or described number of partitions increasing at least one subregion described, if wherein the described size of at least one subregion described is lower than maximum partition size threshold, so perform the increase of the described size at least one subregion described, if and wherein the described size of at least one subregion described at described maximum partition size threshold place, so perform the increase to described number of partitions.
12. computer implemented methods as claimed in claim 1, at least one wherein in the described size of amendment at least one subregion described or described number of partitions comprises at least one in the described size or described number of partitions reducing at least one subregion described, if wherein described number of partitions is greater than a subregion, so perform the reduction to described number of partitions, if and wherein described number of partitions is a subregion, so perform the reduction of the described size at least one subregion described.
13. computer implemented methods as claimed in claim 1, it also comprises:
Determine that the CPU of described storage allocation uses, wherein revise in described size or described number described at least one be use based on the described data in described storage allocation or the described CPU determined of described storage allocation use at least one.
14. computer implemented methods as claimed in claim 1, it also comprises:
Based on use with described data the configuration that is associated or Client-initiated input at least one revise the configuration of described storage allocation.
15. computer implemented methods as claimed in claim 1, it also comprises:
Determine when to perform at least one described amendment described in described size or described number based on the obtainable resource of described storage allocation.
CN201380053433.7A 2012-10-12 2013-10-12 For the index configurations that can search for data in network Active CN104823169B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811424497.4A CN110096502A (en) 2012-10-12 2013-10-12 Implementation method, system and the medium of the index configurations that can search for data in network

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US13/650,931 US9507750B2 (en) 2012-10-12 2012-10-12 Dynamic search partitioning
US13/650,931 2012-10-12
US13/650,961 US9047326B2 (en) 2012-10-12 2012-10-12 Index configuration for searchable data in network
US13/650,961 2012-10-12
PCT/US2013/064731 WO2014059394A1 (en) 2012-10-12 2013-10-12 Index configuration for searchable data in network

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN201811424497.4A Division CN110096502A (en) 2012-10-12 2013-10-12 Implementation method, system and the medium of the index configurations that can search for data in network

Publications (2)

Publication Number Publication Date
CN104823169A true CN104823169A (en) 2015-08-05
CN104823169B CN104823169B (en) 2018-12-21

Family

ID=50477970

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201811424497.4A Pending CN110096502A (en) 2012-10-12 2013-10-12 Implementation method, system and the medium of the index configurations that can search for data in network
CN201380053433.7A Active CN104823169B (en) 2012-10-12 2013-10-12 For the index configurations that can search for data in network

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN201811424497.4A Pending CN110096502A (en) 2012-10-12 2013-10-12 Implementation method, system and the medium of the index configurations that can search for data in network

Country Status (10)

Country Link
EP (1) EP2907034A4 (en)
JP (2) JP2015532493A (en)
KR (2) KR101782302B1 (en)
CN (2) CN110096502A (en)
AU (3) AU2013328901B2 (en)
BR (1) BR112015008146A2 (en)
CA (1) CA2888116C (en)
IN (1) IN2015DN03160A (en)
SG (2) SG10201606363SA (en)
WO (1) WO2014059394A1 (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105979016A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Local area network data service system
CN105979015A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network data service platform based on local area network
CN105979014A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network data system based on local area network
CN105978739A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network data platform based on local area network
CN105978913A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network service system
CN106060082A (en) * 2016-07-16 2016-10-26 柳州健科技有限公司 Local area network-based network service platform with data monitoring function
CN106060083A (en) * 2016-07-16 2016-10-26 柳州健科技有限公司 Network service system with data monitoring function
CN106060081A (en) * 2016-07-16 2016-10-26 柳州健科技有限公司 Network service platform with data monitor function
CN106101024A (en) * 2016-07-16 2016-11-09 柳州健科技有限公司 There is the LAN data system of data monitoring function
CN106131192A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 The network system with data monitoring function based on LAN
CN106131190A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 The network platform with data monitoring function based on LAN
CN106131193A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the local area network services platform of self-learning function
CN106131191A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the LAN data service system of data monitoring function
CN106131194A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the LAN platform of self-learning function
CN106131196A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 The network system with self-learning function based on LAN
CN106131195A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the LAN system of data monitoring function
CN106131188A (en) * 2016-07-15 2016-11-16 柳州健科技有限公司 LAN system
CN106131189A (en) * 2016-07-15 2016-11-16 柳州健科技有限公司 The network platform based on LAN
US9507750B2 (en) 2012-10-12 2016-11-29 A9.Com, Inc. Dynamic search partitioning
CN107977381A (en) * 2016-10-24 2018-05-01 华为技术有限公司 Data configuration method, index managing method, relevant apparatus and computing device
CN108881147A (en) * 2017-12-29 2018-11-23 北京视联动力国际信息技术有限公司 A kind of data processing method and device of view networking
CN110019191A (en) * 2017-09-21 2019-07-16 阿里巴巴集团控股有限公司 Database information processing method and processing device
CN110134661A (en) * 2019-05-22 2019-08-16 东北大学 A kind of academic big data storage querying method towards facet

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9047326B2 (en) 2012-10-12 2015-06-02 A9.Com, Inc. Index configuration for searchable data in network
CN112306604B (en) * 2020-08-21 2022-09-23 海信视像科技股份有限公司 Progress display method and display device for file transmission
US11658917B2 (en) 2021-04-09 2023-05-23 Tekion Corp Selective offloading of bandwidth to enable large-scale data indexing
CN117596176B (en) * 2024-01-17 2024-04-19 苏州元脑智能科技有限公司 Network state measuring method, device, equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7788233B1 (en) * 2007-07-05 2010-08-31 Amazon Technologies, Inc. Data store replication for entity based partition
US20110225165A1 (en) * 2010-03-12 2011-09-15 Salesforce.Com Method and system for partitioning search indexes

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1143349A1 (en) * 2000-04-07 2001-10-10 IconParc GmbH Method and apparatus for generating index data for search engines
US7716168B2 (en) * 2005-06-29 2010-05-11 Microsoft Corporation Modifying table definitions within a database application
US8341345B2 (en) * 2005-08-08 2012-12-25 International Business Machines Corporation System and method for providing content based anticipative storage management
US7668825B2 (en) * 2005-08-26 2010-02-23 Convera Corporation Search system and method
JP4772569B2 (en) * 2006-04-07 2011-09-14 株式会社日立製作所 System and method for performing directory unit migration in a common namespace
US8214345B2 (en) * 2006-10-05 2012-07-03 International Business Machines Corporation Custom constraints for faceted exploration
JP5218060B2 (en) * 2006-10-06 2013-06-26 日本電気株式会社 Information retrieval system, information retrieval method and program
US8266173B1 (en) * 2007-05-21 2012-09-11 Amazon Technologies, Inc. Search results generation and sorting
US20100011368A1 (en) * 2008-07-09 2010-01-14 Hiroshi Arakawa Methods, systems and programs for partitioned storage resources and services in dynamically reorganized storage platforms
JP4762289B2 (en) * 2008-10-01 2011-08-31 株式会社日立製作所 A storage system that controls allocation of storage areas to virtual volumes that store specific pattern data
US9996572B2 (en) * 2008-10-24 2018-06-12 Microsoft Technology Licensing, Llc Partition management in a partitioned, scalable, and available structured storage
EP2396717A1 (en) * 2009-02-11 2011-12-21 Infinidat Ltd Virtualized storage system and method of operating it
US8250026B2 (en) * 2009-03-06 2012-08-21 Peoplechart Corporation Combining medical information captured in structured and unstructured data formats for use or display in a user application, interface, or view
US20110131202A1 (en) * 2009-12-02 2011-06-02 International Business Machines Corporation Exploration of item consumption by customers
JPWO2011118427A1 (en) * 2010-03-24 2013-07-04 日本電気株式会社 Query device, query partitioning method, and query partitioning program
US8190593B1 (en) * 2010-04-14 2012-05-29 A9.Com, Inc. Dynamic request throttling
CN102959522B (en) * 2010-08-10 2016-01-13 株式会社日立制作所 The management method of computer system and management system
WO2012072879A1 (en) * 2010-11-30 2012-06-07 Nokia Corporation Method and apparatus for updating a partitioned index
WO2012085968A1 (en) * 2010-12-22 2012-06-28 Hitachi, Ltd. Storage apparatus and storage management method
US8620897B2 (en) * 2011-03-11 2013-12-31 Microsoft Corporation Indexing and searching features including using reusable index fields

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7788233B1 (en) * 2007-07-05 2010-08-31 Amazon Technologies, Inc. Data store replication for entity based partition
US20110225165A1 (en) * 2010-03-12 2011-09-15 Salesforce.Com Method and system for partitioning search indexes

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9507750B2 (en) 2012-10-12 2016-11-29 A9.Com, Inc. Dynamic search partitioning
CN106131188A (en) * 2016-07-15 2016-11-16 柳州健科技有限公司 LAN system
CN105979015A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network data service platform based on local area network
CN105979014A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network data system based on local area network
CN105978739A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network data platform based on local area network
CN105978913A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network service system
CN105979016A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Local area network data service system
CN106131189A (en) * 2016-07-15 2016-11-16 柳州健科技有限公司 The network platform based on LAN
CN106060083A (en) * 2016-07-16 2016-10-26 柳州健科技有限公司 Network service system with data monitoring function
CN106060081A (en) * 2016-07-16 2016-10-26 柳州健科技有限公司 Network service platform with data monitor function
CN106131190A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 The network platform with data monitoring function based on LAN
CN106131193A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the local area network services platform of self-learning function
CN106131191A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the LAN data service system of data monitoring function
CN106131194A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the LAN platform of self-learning function
CN106131196A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 The network system with self-learning function based on LAN
CN106131195A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the LAN system of data monitoring function
CN106101024A (en) * 2016-07-16 2016-11-09 柳州健科技有限公司 There is the LAN data system of data monitoring function
CN106131192A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 The network system with data monitoring function based on LAN
CN106060082A (en) * 2016-07-16 2016-10-26 柳州健科技有限公司 Local area network-based network service platform with data monitoring function
CN107977381A (en) * 2016-10-24 2018-05-01 华为技术有限公司 Data configuration method, index managing method, relevant apparatus and computing device
WO2018077138A1 (en) * 2016-10-24 2018-05-03 华为技术有限公司 Data configuration method, index management method, related apparatus and computing device
CN107977381B (en) * 2016-10-24 2021-08-27 华为技术有限公司 Data configuration method, index management method, related device and computing equipment
CN110019191A (en) * 2017-09-21 2019-07-16 阿里巴巴集团控股有限公司 Database information processing method and processing device
CN108881147A (en) * 2017-12-29 2018-11-23 北京视联动力国际信息技术有限公司 A kind of data processing method and device of view networking
CN108881147B (en) * 2017-12-29 2019-07-05 视联动力信息技术股份有限公司 A kind of data processing method and device of view networking
CN110134661A (en) * 2019-05-22 2019-08-16 东北大学 A kind of academic big data storage querying method towards facet

Also Published As

Publication number Publication date
CN104823169B (en) 2018-12-21
SG10201606363SA (en) 2016-09-29
AU2013328901B2 (en) 2016-07-28
JP2015532493A (en) 2015-11-09
SG11201502828PA (en) 2015-05-28
AU2016231488B2 (en) 2017-09-21
EP2907034A4 (en) 2016-05-18
CA2888116C (en) 2018-03-27
KR101737246B1 (en) 2017-05-17
KR20170054579A (en) 2017-05-17
JP2017050012A (en) 2017-03-09
WO2014059394A1 (en) 2014-04-17
CA2888116A1 (en) 2014-04-17
JP6339155B2 (en) 2018-06-06
CN110096502A (en) 2019-08-06
KR20150066575A (en) 2015-06-16
AU2017245374A1 (en) 2018-01-18
KR101782302B1 (en) 2017-09-26
AU2016231488A1 (en) 2016-10-06
IN2015DN03160A (en) 2015-10-02
EP2907034A1 (en) 2015-08-19
BR112015008146A2 (en) 2017-07-04
AU2017245374B2 (en) 2018-08-09
AU2013328901A1 (en) 2015-05-14

Similar Documents

Publication Publication Date Title
CN104823169A (en) Index configuration for searchable data in network
US20210352030A1 (en) Computerized system and method for automatically determining and providing digital content within an electronic communication system
CN104704522B (en) Recommend native applications
US11341153B2 (en) Computerized system and method for determining applications on a device for serving media
US9372901B2 (en) Searching for software applications based on application attributes
US9268716B2 (en) Writing data from hadoop to off grid storage
US9223902B1 (en) Architectures for content identification
US9411839B2 (en) Index configuration for searchable data in network
TW201931067A (en) Computerized system and method for automatically performing an implicit message search
US20140222560A1 (en) System and method for monetization in photo sharing sites
US10263908B1 (en) Performance management for query processing
US20160239533A1 (en) Identity workflow that utilizes multiple storage engines to support various lifecycles
US20160323714A1 (en) Low key point of interest notification
EP3808035A1 (en) Multi-source data analytics system, data manager and related methods
US20160350272A1 (en) Obtaining attribution information for representations
US11250039B1 (en) Extreme multi-label classification
US9852135B1 (en) Context-aware caching
KR102277772B1 (en) Apparatus and method for integrated management of data in mobile device, and the mobile device
US20150026266A1 (en) Share to stream
US20150186672A1 (en) Photo privacy
US20140278924A1 (en) Selectively altering requests based on comparison of potential value of requests
US20160125034A1 (en) Annotate Apps with Entities by Fusing Heterogeneous Signals

Legal Events

Date Code Title Description
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant