Summary of the invention
The invention provides a kind of data search method based on the cloud teaching platform and system, to address the above problem.
The invention provides a kind of data search method based on the cloud teaching platform.Said method comprises the following steps: thematic data is extracted the data in server infinite loop traversal teaching data storer, according to the GUID list of its maintenance, according to the first rule, obtains thematic data, and thematic data is stored to the thematic data storer; Thematic data in index data processing server infinite loop traversal thematic data storer, and generate the thematic data concordance list according to thematic data; The index data processing server generates the words and phrases data directory according to Second Rule generator data directory and according to three sigma rule, and index of metadata table and words and phrases data directory is sent to querying server according to the thematic data concordance list; Wherein, if the index of metadata table in the index data processing server or words and phrases data directory have Data Update, index of metadata table or words and phrases data directory after the index data processing server will upgrade are sent to querying server, or the index data processing server periodically is sent to querying server with index of metadata table or words and phrases data directory, or index data processing server real-time synchronization ground is sent to querying server with index of metadata table or words and phrases data directory; When the user carries out data search, querying server is according to the Query Information of user input query metadata concordance list and words and phrases data directory successively, to obtain the initial query result, querying server obtains final Query Result according to assessment algorithm and initial query result.
The present invention also provides a kind of data search system based on the cloud teaching platform, comprises teaching data storer, thematic data extraction server, thematic data storer, index data processing server and querying server.Thematic data is extracted server and is connected teaching data storer and thematic data storer, and the index data processing server connects thematic data storer and querying server.Thematic data is extracted server, is used for the data in infinite loop traversal teaching data storer, according to the GUID list of its maintenance, according to the first rule, obtains thematic data, and thematic data is stored to the thematic data storer.The index data processing server is used for the thematic data in infinite loop traversal thematic data storer, and generates the thematic data concordance list according to thematic data.The index data processing server is used for according to the thematic data concordance list, generates the words and phrases data directory according to Second Rule generator data directory and according to three sigma rule, and index of metadata table and words and phrases data directory are sent to querying server.Wherein, if the index of metadata table in the index data processing server or words and phrases data directory have Data Update, index of metadata table or words and phrases data directory after the index data processing server will upgrade are sent to querying server, or the index data processing server periodically is sent to querying server with index of metadata table or words and phrases data directory, or index data processing server real-time synchronization ground is sent to querying server with index of metadata table or words and phrases data directory.When the user carries out data search, querying server is according to the Query Information of user input query metadata concordance list and words and phrases data directory successively, to obtain the initial query result, querying server obtains final Query Result according to assessment algorithm and initial query result.
Compared to prior art, according to the data search method based on the cloud teaching platform provided by the invention and system, thematic data is extracted server and is obtained thematic data according to the first rule, and thematic data is stored to the thematic data storer, so by the refinement of thematic data, improved the search efficiency of data.In addition, the index data processing server is sent to querying server with index of metadata table and words and phrases data directory, when the user carries out data search, querying server is according to the Query Information of user input query metadata concordance list and words and phrases data directory successively, to obtain the initial query result, querying server obtains final Query Result according to assessment algorithm and initial query result.So, by query metadata concordance list and words and phrases data directory successively, improved the hit rate of initial query result, and, then obtain final Query Result according to assessment algorithm and initial query result, greatly improved the accuracy of inquiry.
Embodiment
Hereinafter also describe in conjunction with the embodiments the present invention in detail with reference to accompanying drawing.Need to prove, in the situation that do not conflict, embodiment and the feature in embodiment in the application can make up mutually.
The process flow diagram based on the data search method of cloud teaching platform that Figure 1 shows that preferred embodiment according to the present invention provides.As shown in Figure 1, the data search method based on the cloud teaching platform that provides of preferred embodiment of the present invention comprises step 101~104.
In step 101, thematic data is extracted the data in server infinite loop traversal teaching data storer, according to the GUID list of its maintenance, according to the first rule, obtains thematic data, and described thematic data is stored to the thematic data storer.
In the present embodiment, the teaching data on teaching data memory stores cloud teaching platform.Particularly, the teacher on the cloud teaching platform can be stored to teaching courseware the teaching courseware prefecture of teaching data storer, and then, the video of teacher's real-time recording when giving lessons also can be stored to the instructional video prefecture of teaching data storer.The student listens to the teacher online, and after hearing out class to teacher scoring, the student that appraisal result can be stored to the teaching data storer prefecture of marking.Teacher arranges operation, and with the students' work prefecture of job storage to the teaching data storer.The student fulfils assignment, and after teacher for example reads and makes comments by the cloud teaching platform, the operation score is stored to the operation score prefecture of teaching data storer.In addition, teacher also can sum up the main contents of this section course, and inputs corresponding keyword as metadata.Wherein, the cloud teaching platform can generate the numbering that the overall situation is unique for every class journey, is used for all resources (for example, courseware, video, scoring, operation, score, metadata) of this class journey of sign.For example, if teacher A prepares the course of " life cycle of infosystem ", after the A that is a teacher makes teaching courseware and is uploaded to the teaching data storer, the cloud teaching platform can produce the numbering GUID:a1484645-786e-4f7e-bc09-0ecf36add696 that the overall situation is unique for this courseware, the teaching datas such as the instructional video that produces after this curricula, student's scoring, students' work, operation score, metadata all can adopt a1484645-786e-4f7e-bc09-0ecf36add696 to number as GUID, so that thematic data arranges.
In the present embodiment, thematic data is extracted the data in server infinite loop traversal teaching data storer, GUID list according to its maintenance, according to the first rule, the process that obtains thematic data is: thematic data is extracted the data in server infinite loop traversal teaching data storer, obtain the GUID of data, and whether comparison GUID is present in the GUID list, if do not exist, obtaining data filing corresponding to GUID is thematic data.Particularly, thematic data is extracted server maintenance GUID list, if the data GUID that thematic data extraction server obtains in the teaching data storer is present in the GUID list, illustrate that these data are extracted, if the GUID of these data is not present in the GUID list, illustrate that these data are not extracted, at this moment, thematic data is extracted server teaching data corresponding to this GUID in the teaching data storer is all extracted, and filing is to a static file, form a thematic data, and this thematic data is stored in the thematic data storer.Simultaneously, after completing the extraction of teaching data corresponding to this GUID, thematic data is extracted the GUID list that server can be updated to this GUID its maintenance.In this, each thematic data is such as comprising the teaching datas such as numbering GUID, teaching courseware, instructional video, student's scoring, students' work, operation score, metadata.
In step 102, the thematic data in the described thematic data storer of index data processing server infinite loop traversal, and generate the thematic data concordance list according to described thematic data.
In the present embodiment, the thematic data concordance list comprises the contents such as sequence number, thematic GUID, physical location, metadata and words and phrases data.Particularly, thematic data in index data processing server infinite loop traversal thematic data storer, and whether the GUID of comparison thematic data is present in the thematic data concordance list, if exist, illustrate that this thematic data has been refined to the thematic data concordance list, if do not exist, illustrate that this thematic data not yet is refined to the thematic data concordance list, need this moment the relevant information of this thematic data is added the thematic data concordance list.In this, thematic data concordance list example is as shown in table 1.
Table 1
In table 1, sequence number represents the numbering of each index information in the thematic data concordance list, special topic GUID is teaching data GUID, physical location represents that relevant thematic data is at the disk directory of thematic data storer, metadata is the keyword that teacher inputs, the words and phrases data are by all the content of text participles with teaching data, and words and phrases that the number of times that repeats reaches the certain predetermined value are refined obtain.Refinement about the words and phrases data is the technological means of commonly using in this area, therefore repeat no more in this.
In step 103, the index data processing server is according to described thematic data concordance list, generate the words and phrases data directory according to Second Rule generator data directory and according to three sigma rule, and described index of metadata table and described words and phrases data directory are sent to querying server.
In the present embodiment, the index data processing server is according to the thematic data concordance list, process according to Second Rule generator data directory is: the index data processing server travels through metadata all in the thematic data concordance list successively, identical metadata is organized in an index of metadata item, wherein, the index of metadata item comprises metadata and corresponding index position thereof.In addition, the index data processing server is according to the thematic data concordance list, the process that generates the words and phrases data directory according to three sigma rule is: the index data processing server travels through words and phrases data all in the thematic data concordance list successively, in identical words and phrases Organization of Data to a words and phrases data directory item, wherein, words and phrases data directory item comprises words and phrases data and corresponding index position thereof.
For example, can generate the words and phrases data directory shown in the index of metadata table shown in table 2 and table 3 according to the thematic data concordance list shown in table 1.
Sequence number |
Metadata |
Index position |
1 |
Life cycle |
1、2 |
2 |
Project verification |
1 |
3 |
Exploitation |
1、2 |
4 |
O﹠M |
1、2 |
5 |
Wither away |
1 |
6 |
Demand |
2 |
7 |
Test |
2 |
8 |
The life of product |
3 |
9 |
Form |
3 |
10 |
Grow up |
3 |
11 |
Ripe |
3 |
12 |
Decline |
3 |
Table 2
Wherein, in table 2, the numbering of each index information in sequence number representation element data directory, metadata is the metadata that records in the thematic data concordance list, index position represents the sequence number of the correspondence of corresponding metadata in the thematic data concordance list.
Sequence number |
The words and phrases data |
Index position |
1 |
Concept forms |
1 |
2 |
Demand analysis |
1 |
3 |
Problem definition |
2 |
4 |
Black-box Testing |
2 |
5 |
Eliminate in market |
3 |
6 |
Marketing life |
3 |
7 |
Life cycle |
3 |
Table 3
Wherein, in table 3, sequence number represents the numbering of each index information in the words and phrases data directory, and the words and phrases data are the words and phrases data that record in the thematic data concordance list, and index position represents the sequence number of the correspondence of corresponding words and phrases data in the thematic data concordance list.
In the present embodiment, after index data processing server generator data directory and words and phrases data directory, index of metadata table and words and phrases data directory can be sent to querying server.Particularly, if the index of metadata table in the index data processing server or words and phrases data directory have Data Update, index of metadata table or words and phrases data directory after the index data processing server will upgrade are sent to querying server, or the index data processing server periodically is sent to querying server with index of metadata table or words and phrases data directory, or index data processing server real-time synchronization ground is sent to querying server with index of metadata table or words and phrases data directory.In other words, in index data processing server and querying server, index of metadata table and the words and phrases data directory of storage keep consistency.Yet the present invention does not limit and adopts which kind of mode to keep index of metadata table in index data processing server and querying server and the consistance of words and phrases data directory.
In step 104, when the user carries out data search, described querying server is inquired about described index of metadata table and described words and phrases data directory successively according to the Query Information of user's input, to obtain the initial query result, described querying server obtains final Query Result according to assessment algorithm and described initial query result.
In the present embodiment, when the user searched for the data of cloud teaching platform, the user understood input inquiry information (for example, key word), and querying server is according to the Query Information of user input query metadata concordance list and words and phrases data directory successively.For example, if the Query Information of user's input is: life cycle, it is as shown in table 4 that querying server query metadata concordance list can obtain the initial query result, and it is as shown in table 5 that inquiry words and phrases data directory can obtain the initial query result.
Sequence number |
Metadata |
Index position |
1 |
Life cycle |
1、2 |
Table 4
Sequence number |
The words and phrases data |
Index position |
7 |
Life cycle |
3 |
Table 5
Afterwards, querying server obtains final Query Result according to assessment algorithm and initial query result.In this, assessment algorithm is: the scoring corresponding according to each thematic data and score obtain average score and average, the assessed value of each thematic data equals average score and the sum of products average of the first ratio and the sum product value of the second ratio of described thematic data, wherein, described the first ratio and described the second ratio with value be 1.
Take the initial query result as table 4 and table 5 as example, querying server finds physical location corresponding to corresponding thematic data according to the index position in the initial query result to the thematic data concordance list, find scoring corresponding to corresponding thematic data and score according to physical location again, then calculate average score and the average of corresponding thematic data, calculate afterwards the assessed value of corresponding thematic data.In this, be 0.6 for example take the first ratio as 0.4, the second ratio, the average of this thematic data of average score+0.6* of this thematic data of assessed value=0.4* of each thematic data.Yet the present invention does not limit the first ratio and the second ratio.
In this, according to assessment algorithm, in table 4 and table 5, the appreciation information of corresponding thematic data is for example shown in following table.
Index position |
Special topic GUID |
Assessed value |
Data source |
1 |
A1484645-786e-4f7e-bc09-0ecf36add696 |
95.32 |
Metadata |
2 |
b3d4074c-ed0c-46b1-9078-7f1d49bf7c12 |
86.23 |
Metadata |
3 |
db03e971-1fab-444d-a761-11f5b25330ea |
93.56 |
The words and phrases data |
In the present embodiment, final Query Result is arranged in order metadata query result and words and phrases data query result, and from large to small arranged sequentially of the assessed value of each thematic data that obtains according to assessment algorithm in each type.So, the assessed value that obtains according to above-mentioned assessment algorithm can obtain final Query Result as shown in table 6.
Index position |
Special topic GUID |
1 |
a1484645-786e-4f7e-bc09-0ecf36add696 |
2 |
b3d4074c-ed0c-46b1-9078-7f1d49bf7c12 |
3 |
db03e971-1fab-444d-a761-11f5b25330ea |
Table 6
In this, querying server can obtain corresponding thematic data to the thematic data storer according to final Query Result (for example, table 6), and the thematic data of obtaining is exported to the user.In addition, querying server also can preserve this final Query Result (for example, final Query Result being saved to the inquiry log storer), during for relevant inquiring next time, directly obtains corresponding final Query Result, thereby improves search efficiency.Yet the present invention does not limit this.In practical application, final Query Result also can be saved to querying server.
The schematic diagram based on the data search system of cloud teaching platform that Figure 2 shows that preferred embodiment according to the present invention provides.As shown in Figure 2, the data search system based on the cloud teaching platform that provides of preferred embodiment of the present invention comprises that teaching data storer 10, thematic data extract server 12, thematic data storer 14, index data processing server 16 and querying server 18.Thematic data is extracted server 12 and is connected teaching data storer 10 and thematic data storer 14, and index data processing server 16 connects thematic data storer 14 and querying server 18.
In the present embodiment, thematic data is extracted server 12, is used for the data in infinite loop traversal teaching data storer 10, according to the GUID list of its maintenance, according to the first rule, obtains thematic data, and thematic data is stored to thematic data storer 14.Index data processing server 16 is used for the thematic data in infinite loop traversal thematic data storer 14, and generates the thematic data concordance list according to thematic data.Index data processing server 16 is used for according to the thematic data concordance list, generates the words and phrases data directory according to Second Rule generator data directory and according to three sigma rule, and index of metadata table and words and phrases data directory are sent to querying server.Wherein, if the index of metadata table in the index data processing server or words and phrases data directory have Data Update, index of metadata table or words and phrases data directory after the index data processing server will upgrade are sent to querying server, or the index data processing server periodically is sent to querying server with index of metadata table or words and phrases data directory, or index data processing server real-time synchronization ground is sent to querying server with index of metadata table or words and phrases data directory.When the user carries out data search, querying server 18 is according to the Query Information of user input query metadata concordance list and words and phrases data directory successively, to obtain the initial query result, querying server 18 obtains final Query Result according to assessment algorithm and initial query result.Specific operation process about said system is described with above-mentioned method, therefore repeat no more in this.
In sum, preferred embodiment provides according to the present invention data search method and system based on the cloud teaching platform, thematic data is extracted server and is obtained thematic data according to the first rule, and thematic data is stored to the thematic data storer, so by the refinement of thematic data, improved the search efficiency of data.In addition, the index data processing server is sent to querying server with index of metadata table and words and phrases data directory, when the user carries out data search, querying server is according to the Query Information of user input query metadata concordance list and words and phrases data directory successively, to obtain the initial query result, querying server obtains final Query Result according to assessment algorithm and initial query result.So, by query metadata concordance list and words and phrases data directory successively, improved the hit rate of initial query result, and, then obtain final Query Result according to assessment algorithm and initial query result, greatly improved the accuracy of inquiry.
The above is only the preferred embodiments of the present invention, is not limited to the present invention, and for a person skilled in the art, the present invention can have various modifications and variations.Within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.