CN110889034A - Data analysis method and data analysis system - Google Patents

Data analysis method and data analysis system Download PDF

Info

Publication number
CN110889034A
CN110889034A CN201910105173.2A CN201910105173A CN110889034A CN 110889034 A CN110889034 A CN 110889034A CN 201910105173 A CN201910105173 A CN 201910105173A CN 110889034 A CN110889034 A CN 110889034A
Authority
CN
China
Prior art keywords
learning
data
server
sections
search
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910105173.2A
Other languages
Chinese (zh)
Inventor
李实恭
詹诗涵
柯兆轩
蓝国诚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Delta Electronics Inc
Original Assignee
Delta Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Delta Electronics Inc filed Critical Delta Electronics Inc
Priority to JP2019091400A priority Critical patent/JP2020042771A/en
Priority to SG10201905523TA priority patent/SG10201905523TA/en
Priority to EP19189383.3A priority patent/EP3621022A1/en
Priority to US16/529,818 priority patent/US11409804B2/en
Publication of CN110889034A publication Critical patent/CN110889034A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying
    • G06F16/435Filtering based on additional data, e.g. user or group profiles
    • G06F16/437Administration of user profiles, e.g. generation, initialisation, adaptation, distribution

Abstract

The present disclosure relates to a data analysis method and a data analysis system thereof. The data analysis method includes receiving first learning data, and adding multiple segment marks to the first learning data to divide the first learning data into multiple first learning sections. These first learning sections are connected with each other in time sequence. And screening out the first keyword string corresponding to each first learning section from each first learning section. An analysis instruction is received, and the analysis instruction and the respective first keyword strings of each first learning section are analyzed to obtain the similarity between the analysis instruction and each corresponding first learning section. And finally, screening out the first learning section with the highest similarity.

Description

Data analysis method and data analysis system
Technical Field
The present disclosure relates to a data analysis method and a data analysis system, which are used for screening out corresponding learning materials according to an analysis instruction.
Background
The on-line learning platform is a network service that stores a plurality of learning data in a server, so that a user can connect to the server through the internet to browse the learning data at any time. In the existing various online learning platforms, the types of learning materials provided include films, audios, presentations, documents or forums, etc.
Because the amount of learning materials stored in the online learning platform is huge, a user needs to input a search instruction (search instruction) according to the own needs to retrieve the relevant learning materials from the online learning platform, and thus, there is still room for improvement.
Disclosure of Invention
One aspect of the present disclosure is a data analysis method. The data analysis method comprises the following steps: the first learning data is received. A plurality of first segment marks are added to the first learning material to divide the first learning material into a plurality of first learning segments. According to each of the plurality of first learning sections, a first keyword string corresponding to each first learning section is generated. And receiving an analysis instruction related to the operation of a user. The analysis instruction and the first keyword string of each first learning section are analyzed to obtain a plurality of first similarities between the analysis instruction and each corresponding first learning section. And screening out one of the first learning sections with the highest similarity from the plurality of first learning sections.
Another aspect of the present disclosure is a data analysis system. The data analysis system comprises a first server, a second server and a storage unit. The first server is used to receive the first learning data. The storage unit is used for receiving and storing the first learning data from the first server. The second server is used to add a plurality of first segment marks in the first learning data to divide the first learning data into a plurality of first learning segments. The second server is used for generating a first keyword string corresponding to each first learning section according to each of the plurality of first learning sections. The second server is used for receiving the analysis instruction and analyzing the analysis instruction and the respective first keyword string of each first learning section to obtain a plurality of first similarities between the analysis instruction and each corresponding first learning section, and the second server screens out one of the first learning sections with the highest similarity from the plurality of first learning sections.
Therefore, the data analysis system can add the first segment marks to the first learning materials, and generate the first keyword string on each first learning segment after distinguishing the first learning segments, so that each first learning segment of the first learning materials can be accurately searched when a subsequent user logs in the data analysis system, or the data analysis system can actively recommend a proper learning segment to the user, thereby improving the operation experience of the user.
Drawings
Fig. 1 is a schematic diagram of a data analysis system according to some embodiments of the present disclosure.
Fig. 2A is a schematic view of a text file of first learning data according to some embodiments of the disclosure.
Fig. 2B is a schematic image frame of the first learning data according to some embodiments of the disclosure.
Fig. 3 is a schematic diagram of a data analysis method according to some embodiments of the disclosure.
Fig. 4 is a schematic diagram of a data analysis method according to some embodiments of the disclosure.
[ description of reference ]
100 data analysis system
110 first server
120 second server
130 memory cell
131 course database
131a first learning material
131b second learning material
131c third learning material
132 analytical database
133 behavior database
134 recommendation database
200 terminal device
A1 text file
A11 learning segment
A12 learning segment
A13 learning segment
A14 learning segment
A21 learning segment
A22 learning segment
B1 video file
B01 video picture
B02 video picture
B03 video picture
B04 video picture
B11 learning section
B12 learning section
T1 first correlation marker
T2 second correlation marker
S301 to S311
S401 to S408 steps
Detailed Description
Reference will now be made in detail to the present embodiments of the present application, examples of which are illustrated in the accompanying drawings. It should be understood, however, that these implementation details should not be used to limit the application. That is, in some embodiments of the disclosure, such practical details are not necessary. In addition, for simplicity, some conventional structures and elements are shown in the drawings in a simple schematic manner.
When an element is referred to as being "connected" or "coupled," it can be referred to as being "electrically connected" or "electrically coupled. "connected" or "coupled" may also be used to indicate that two or more elements are in mutual engagement or interaction. Moreover, although terms such as "first," "second," …, etc., may be used herein to describe various elements, these terms are used merely to distinguish one element or operation from another element or operation described in similar technical terms. Unless the context clearly dictates otherwise, the terms do not specifically refer or imply an order or sequence nor are they intended to limit the invention.
In the existing online learning platform, when a user inputs a search command (search command), the server only compares and screens the search command with the file name, caption text or mark (such as leave word) of the learning material. However, if the content of the learning material is huge (e.g. a movie with a length of two hours), the user still needs to manually adjust the learning material (e.g. adjust the playing time to 45 th minute) to find the section most related to his/her own needs. That is, the analysis mechanism of the existing online learning platform can only search names or subtitles, and cannot perform a fine search according to the requirement. In addition, the user cannot find out the interesting learning materials except through active search.
Fig. 1 is a schematic diagram of a data analysis system according to some embodiments of the present disclosure. Referring to fig. 1, the present disclosure relates to a data analysis system. The data analysis system 100 includes a first server 110, a second server 120, and a storage unit 130. In the present embodiment, the first server 110 is electrically connected to the second server 120, and in other embodiments, a connection can be established between the first server 110 and the second server 120 through a network for data transmission. The storage unit 130 is a data storage device, for example: flash memory devices, memory cards, hard disks, and the like. In some embodiments, the storage unit 130 is stored in a separate server. In some other embodiments, the storage unit 130 may be disposed in the first server 110 or the second server 120. In other embodiments, the first server 110 and the second server 120 can be integrated into a single server.
In the present embodiment, the data analysis system 100 is used to provide an online learning service, for example, a user can connect to the first server 110 through a terminal device 200 (e.g., a personal computer, a notebook computer, or a smart phone) to browse an online learning interface. When the user wants to browse the learning content, the first server 110 can access the corresponding file from the storage unit 130 through the processor therein. The second server 120 is used to perform classification, management and statistics functions through its processor. However, the application of the present disclosure is not limited thereto, and the data analysis system 100 may also be applied to an audio video streaming platform or an internet discussion forum.
The first server 110 is used to receive a plurality of learning data. In some embodiments, the first server 110 receives the learning data from the terminal device 200 via the internet. The learning material may be a film, sound, presentation, or discussion string. For convenience of description, the present embodiment is described by dividing the plurality of learning materials into the first learning material 131a, the second learning material 131b, and the third learning material 131 c. However, the disclosure is not limited thereto, and the number of learning materials can be arbitrarily adjusted.
In some embodiments, after the first server 110 receives the first learning data 131a from the terminal device 200, the first server 110 uploads the first learning data 131a to the storage unit 130, and the first server 110 sends a notification message to the second server 120. The second server 120 is connected to the storage unit 130 to add a plurality of first segment marks in the first learning data 131a, so that the first learning data 131a can be divided into a plurality of first learning sections according to the first segment marks. In some embodiments, the first learning sections are connected to each other according to a chronological sequence (e.g., a predetermined time axis in the first learning material 131 a).
For example, if the first learning data 131a is a movie file with a film length of 30 minutes, the second server 120 can add the first segment marks at the locations of the movie time of 10 minutes and 20 minutes, respectively, to divide the movie file into three first learning segments. Similarly, if the first learning data is a 10-page presentation document, the second server 120 can add the first segment marks at pages 2, 5 and 7 respectively to divide the presentation document into four learning segments. In some other embodiments, the first learning sections are not necessarily consecutive to each other, but only in chronological order. For example, the first learning section is the 1-20 minute paragraph of the movie, and the second learning section is the 30-45 minute paragraph of the movie.
In some embodiments, the segment mark is an identification tag (tag) or an identifier (identifier) added to the first learning material 131a, so that the first server 110 or the second server 120 can quickly find a specific part of the content of the first learning material 131a, but the form of the segment mark is not limited thereto. The generation of the segmentation markers will be described in detail later.
After the second server 120 adds the first segment flag to the first learning data 131a, the second server 120 analyzes a respective first keyword string (keyword) from each of the first learning segments. In some embodiments, the first keyword string comprises at least one keyword. For example: for a movie file divided into three first learning sections, the keyword string of each first learning section can be "projector, image, principle", "high frequency signal, sharpening, enhancing" and "lifting, definition". In some embodiments, the first keyword string may be a text content in each of the first learning sections, the occurrence frequency of which is higher than a set value. The analysis of the first keyword string will be described in detail later.
The second server 120 is used for storing the first learning data and the corresponding first segment flag and the first keyword string in the storage unit 130. In some embodiments, the first server 110 stores the first learning material 131a in the course database 131 in the storage unit 130. After the second server 120 generates the first segment flag and generates the first keyword string, the second server 120 stores the first segment flag and the first keyword string in the analysis database 132 of the storage unit 130. In some other embodiments, the second server 120 further stores a first identification code corresponding to the first learning material in the analysis database 132, so that the first segment mark and the first keyword string can correspond to the first learning material in the course database 131 according to the first identification code.
The data analysis system 100 of the present disclosure can recommend suitable learning materials to the user according to an analysis instruction related to the user operation. In some embodiments, the first server 110 is configured to transmit the analysis command to the second server 120. Then, the second server 120 is connected to the storage unit 130 according to the analysis command, and analyzes the analysis command and the respective first keyword string of each first learning section to obtain a first similarity between the analysis command and the respective first keyword string corresponding to each first learning section. The second server 120 will select the first learning section with the highest similarity from the first learning sections, and integrate the selected first learning sections into the analysis information, which is transmitted to the terminal device 200 for displaying (e.g., displaying the search screen or the recommendation screen) on the terminal device 200.
Next, the generation method and timing of the analysis command will be described. In some embodiments, the analysis command may be generated according to a user operation (e.g., a search operation), and the analysis command includes a first search command transmitted from the terminal device 200. For example, the first search instruction may be a search string "projector, principle", and the keyword strings of the three first learning sections in the first learning data 131a are "projector, image, principle", "high frequency signal, sharpening, enhancing" and "lifting, definition", respectively. After analysis (e.g., comparing the similarity of the strings), the first keyword string of the first learning section is most similar to the search string, so that the second server 120 can transmit the comparison result back to the terminal device 200 through the first server 110 (e.g., a recommended result is displayed on the user interface), so that the user can know that the first learning section of the first learning data 131a is most similar to the search instruction. In some other embodiments, the second server 120 can compare the search command with the keyword string of each learning segment in all the learning data 131 a-131 c to accurately determine the learning data most related to the search command and the corresponding learning segment.
In some other embodiments, the user inputs the first search command through the terminal device 200, and the first search command may be a spoken (semantic) text, that is, the first search command may be a semantic text, for example: "what is the principle of enhancement of the projector". The second server 120 performs an analysis process on the first search command to generate a search string. In some embodiments, the second server 120 analyzes the search instruction using a semantic analysis technique, such as: the first search command can be analyzed to find words such as "projector", "principle", "what", etc. Then, the search string is compared with the first keyword string of each first learning segment. The operation principle of semantic analysis can be understood by those skilled in the art, and therefore, will not be described herein.
In some other embodiments, the analysis instruction may be a recommendation instruction actively generated by the data analysis system 100. That is, the first server 110 can generate the analysis command according to the user's operation. For example, when the first server 110 determines that the terminal device 200 is connected to the data analysis system 100 (e.g., the user logs in to the online learning system), the first server 110 generates an analysis command to actively analyze the document that may be of interest to the user through the second server 120. Alternatively, the first server 110 can generate the analysis command when the operation of the user is determined to meet the predetermined condition (e.g., the user browses the learning data for half an hour, the user asks questions, leaves a message or marks about the learning data) according to the operation of the user.
In some other embodiments, the first server 110 further generates an analysis command according to the behavior data stored in the storage unit 130 after confirming that the user operation meets the predetermined condition. For example, the data analysis system 100 may generate the analysis command according to behavior data (e.g., usage records of the user) stored in the behavior database 133 of the storage unit 130, the details of which will be described later.
In addition, in some embodiments, if the first server 110 generates the analysis command after confirming that the operation of the user meets the predetermined condition, and the second server 120 selects the first learning section closest to the analysis command, the second server 120 may first store the selected first learning section in the recommendation database 134 in the storage unit 130. The first server 110 may transmit the first learning section stored in the recommendation database 134 to the terminal device 200 at a predetermined recommendation time (e.g., when the user logs in or out of the online learning system, or after the user browses a movie).
Accordingly, after receiving the first learning material 131a, the data analysis system 100 subdivides the first learning material 131a into a plurality of first learning sections, and each of the first learning sections has a corresponding first keyword string. Therefore, after the user logs in the data analysis system 100, the data analysis system 100 can accurately provide the appropriate learning data to the user according to the analysis command. As mentioned above, the present disclosure includes at least two application modes: first, when the user searches by using the data analysis system 100, the data analysis system 100 can precisely find out which learning section of the first learning material 131a is most similar to the first search command, in addition to finding out the first learning material 131a most similar to the first search command. Second, when the user logs in the data analysis system 100, the first server 110 can generate an analysis command (e.g., search the user's usage record) according to the user's operation when the predetermined condition is met, and find a corresponding first learning segment according to the analysis command to recommend the first learning segment to the user. Accordingly, the accuracy of the data analysis system 100 in analysis and search can be greatly improved, and the user experience can be improved.
Next, referring to fig. 1 and fig. 2A, please refer to a generation manner of the segment marks, and fig. 2A is a schematic text file diagram of the first learning data 131a according to some embodiments of the disclosure. . In some embodiments, the first learning material 131a includes a text file a1 (e.g., subtitles). After receiving the first learning data 131a, the second server 120 analyzes the text file a1, for example: generating a plurality of characteristic sentences by a semantic analysis method. These characteristic sentences have precedence relationships. Then, the similarity between adjacent characteristic sentences is calculated to generate a first segment flag.
For example, after the text document a1 is analyzed, the generated characteristic sentence includes "the projector adjusts the light emitting unit according to the image signal", "the light emitted by the light emitting unit is reflected as an image picture", and "in another type of projector". The first sentence and the second sentence have the same words of image and luminescence, and have higher similarity, and the second sentence and the third sentence have lower similarity. Therefore, when the second server 120 determines that the similarity between adjacent characteristic sentences is lower than a predetermined value (e.g., there is no identical word or one of the adjacent characteristic sentences is a turning sentence, such as … in other embodiments), the second server 120 generates the first segment flag. So as to divide the character file A1 into a plurality of first learning sections A11-A14.
In the foregoing embodiment, the text file a1 may generate the feature sentences through a semantic analysis technique, and calculate the similarity between the feature sentences, but the disclosure is not limited thereto. In some embodiments, the processor in the second server 120 may also perform binarization (binary) on the text file a1, and compare the processed text file with the processed text file to determine similarity, so as to establish the feature sentences or determine similarity between the feature sentences.
The text file in the foregoing embodiment refers to the text content of the subtitles or the brief report of the movie, and if the text file is the discussion content of the internet forum, the text file can still be segmented by the same principle. Similarly, if the first learning data 131a includes a voice file, the second server 120 can generate a text file a1 through voice recognition and then perform semantic analysis to generate a plurality of characteristic sentences.
In some other embodiments, referring to fig. 2B, the first learning data 131B includes an image file B1. The video file B1 further includes a plurality of video frames B01-B04. The video frames B01-B04 can be a plurality of frames of video files linked according to time sequence. The second server 120 is used for determining the similarity between the adjacent image frames B01-B04 to generate a first segment flag. For example, the video frames B01-B02 are used to display the structure of the projector, and the video frames B03-B04 are used to display the path diagram of the light projection. Since the similarity between the video frames B02 and B03 is low, the second server 120 can add the first segment flag between the video frames B02 and B03 to form a plurality of first learning segments B11 and B12.
Referring again to FIG. 2A, a method for generating a first keyword string by the data analysis system 100 is described. The second server 120 performs an analysis process (e.g., a semantic analysis) on the text file A1 in the first learning data 131a to generate a plurality of feature words. Then, after the second server 120 generates the first segment flag in the manner described above, such that the first learning data 131a is divided into a plurality of first learning sections A11-A14 or B11-B12, the second server 120 will determine the number of the feature words in each of the first learning sections A11-A14 or A21-A22, and when the number is greater than the threshold, set the first learning section as the first keyword string. For example, a first learning section a11 of the text document a1 includes the following contents: the projector adjusts the light-emitting unit according to the image signal, and the light projected by the light-emitting unit is reflected as an image, wherein the image appears 2 times, the light-emitting unit appears 2 times, and the projector and the light respectively appear 1 time. The second server 120 may set the feature word "image, light-emitting unit" appearing 2 times as the first keyword string.
In some embodiments, if the second server 120 finds a plurality of matching learning data (e.g., the first learning data 131a and the second learning data 131b have the "projector" keyword) according to the analysis command (e.g., the first search command), the second server 120 can further generate the search list. In some other embodiments, the first server 110 is further configured to provide a management interface. The management interface is used for a manager or a maintenance person of the data analysis system 100 to view the internal parameters and response data of the data analysis system 100, so that the manager or the maintenance person can adjust the parameters (e.g., threshold values, semantic recognition parameters, etc.) in the data analysis system 100 to optimize the performance of the data analysis system 100.
In addition, the second server 120 can refer to the behavior record of the user to sort the learning data in the search list. Referring to fig. 1, in some embodiments, the storage unit 130 stores a first learning material 131a, a second learning material 131b, and a third learning material 131 c. The second learning material 131b can be divided into a plurality of second learning sections according to the second segmentation labels in the manner described above, and each second learning section includes a respective second keyword string; similarly, the third learning data 131c is divided into a plurality of third learning sections according to a plurality of third segment labels, and each third learning section includes a third keyword string. After the second server 120 screens out the matched first learning data 131a according to the first search instruction, the second server 120 transmits the first learning data 131a to the terminal device 200 through the first server 110 for the user to browse. Meanwhile, the first server 110 or the second server 120 can generate behavior data accordingly and store the behavior data in a behavior database in the storage unit 130.
The behavior data is used to record various operations of the user after connecting to the data analysis system 100, such as: browsing specific learning materials, transmitting messages, marking important documents, etc. The data analysis system 100 can sort the search listings based on the behavioral data. In some embodiments, the first server 110 is configured to generate an analysis command according to the behavior data, for example, the first server 110 may filter out the search string (e.g., the most frequently appearing title name) according to the learning data most frequently browsed by the user to calculate the first similarity. Accordingly, even if the user does not actively input the first search command, the data analysis system 100 can still periodically generate the analysis command and recommend the appropriate learning data and the learning sections thereof.
In some embodiments, the first server 110 is further configured to receive a second search command from the terminal device 200. The second server 120 screens a plurality of learning data, such as a second learning data 131b and a second learning data 131c, from the storage unit 130 according to the second search command. In some embodiments, the second server 120 compares the second search command with the similarity of the keyword string of each learning section in each of the learning data 131 a-131 c, and selects the learning section with the similarity higher than a predetermined value. For example, if the second search command is "image enhancement", and 3 times of "image enhancement" occur in one of the second learning sections of the second learning data 131b and 5 times of "image enhancement" occur in one of the third learning sections of the third learning data 131c, the second server 120 lists the two learning sections as entries in the search list.
Accordingly, after the second server 120 screens the second learning material 131b and the third learning material 131c according to the second search instruction, since the user has previously viewed the first learning material 131a, the second server 120 will further compare the first keyword string in the first learning material 131a with the second keyword string in the second learning material 131b and the third keyword string in the third learning material 131c to obtain a second similarity. Then, a search list is generated and the second learning data 131b and the third learning data 131c are sorted according to the level of the second similarity.
For example, the user has previously browsed the keyword string of the first learning section of the first learning data 131a, which includes 5 keywords. The second keyword string in the second learning data 131b analyzed by the second server 120 includes "projector" and other 3 keywords; the third keyword string in the third learning data 131c screened by the second server 120 includes "projector" and other 4 keywords. The similarity between the first keyword string and the second keyword string is 60% (e.g., three keywords are the same), and the similarity between the first keyword string and the third keyword string is 20% (e.g., only one keyword is the same), which means that the content of the second learning data 131b is relatively similar to the first learning data 131a previously browsed by the user, therefore, the second server 120 will arrange the second learning section of the screened second learning data 131b before the third learning section of the third learning data 131 c.
In some embodiments, the data analysis system 100 can also establish correlations between multiple learning sections. Referring to fig. 1, fig. 2A and fig. 2B, for convenience of description, the text file a1 of fig. 2A and the image file B1 of fig. 2B are regarded as contents of different learning materials. Wherein the learning section A13 has a first related flag T1, and the image B02 in the learning section B11 has a second related flag T2. After the second server 120 screens the learning section A13 according to the first search command, if the learning section B11 is determined to have the second correlation flag T2, the second server 120 will generate the recommendation list according to the learning section B11. For example, the learning section a13 is a movie illustrating the "principle of operation of the projector", and the learning section B11 is a brief file illustrating the "structure of the projector", so that when the user is browsing the learning section a13, the data analysis system 100 can recommend the user to browse the learning section B11 together.
Please refer to fig. 3, which is a schematic diagram illustrating a data analysis method according to some embodiments of the present disclosure. The data analysis method segments the first learning data 131a and generates a first keyword string through the following steps S301-S311. In step S301, the terminal device 200 transmits the first learning data 131a to the first server 110. In step S302, the first server 110 uploads the first learning data 131a to the storage unit 130. In step S303, the storage unit 130 stores the first learning material 131a in the course database 131. In step S304, the storage unit 130 notifies the first server 110 that the storage operation is completed.
In step S305, the first server 110 transmits the analysis information to the second server 120. In steps S306 and S307, the second server 120 sends a request message to the storage unit 130 to obtain the first learning data 131a at the storage unit 130. The second server 120 adds the first segment flag in the manner described above, and generates a respective first keyword string for each first learning segment. In step S308, the second server 120 uploads the first segment flag and the corresponding first keyword string to the storage unit 130, so that the storage unit 130 stores the first segment flag and the corresponding first keyword string in the analysis database 132. Then, the storage unit 130 transmits the completion information to the second server 120, and transmits the completion information to the first server 110 through the second server 120, so as to display the information of "learning material uploading completion" to the user on the interface of the online learning system.
Please refer to fig. 4, which is a schematic diagram illustrating a data analysis method according to some embodiments of the present disclosure. The data analysis method searches for learning data and learning sections according to an analysis command (e.g., a first search command) through the following steps S401-S408. In step S401, the terminal device 200 sends an analysis command to the first server 110. In steps S402 and S403, the first server 110 transmits a first search command of the analysis commands to the second server 120, and the second server 120 searches the learning data in the storage unit 130 according to the first search command. In step S404, the second server 120 obtains the filtered learning data from the storage unit 130. If there are multiple selected learning data, such as the second learning data 131b and the third learning data 131c, in step S405 and step S406, the second server 120 obtains the behavior data from the behavior database 133 in the storage unit 130 to compare the similarity between the behavior data (e.g., the first keyword string) and the second learning data 131b and the third learning data 131c to generate a search list. Finally, in step S407, the second server 120 transmits the search list to the first server 110, and in step S408, the first server 110 displays the search list on an interface of the online learning system for the user to browse or download.
Although the present disclosure has been described with reference to the above embodiments, it should be understood that various changes and modifications can be made by one skilled in the art without departing from the spirit and scope of the disclosure, and therefore, the scope of the disclosure should be determined by that of the appended claims.

Claims (16)

1. A method of data analysis, comprising:
receiving a first learning material;
adding a plurality of first segment marks into the first learning material to divide the first learning material into a plurality of first learning segments;
generating a first keyword string corresponding to each of the first learning sections according to each of the first learning sections;
receiving an analysis instruction related to user operation;
analyzing the analysis instruction and the first keyword string of each first learning segment to obtain a plurality of first similarities between the analysis instruction and each corresponding first learning segment; and
and screening out one of the first learning sections with the highest similarity from the plurality of first learning sections.
2. The data analysis method as claimed in claim 1, wherein the first learning material comprises a text file, the analysis method further comprising:
analyzing the text file to generate a plurality of characteristic sentences, wherein the characteristic sentences have a precedence relationship; and
and judging the similarity of the adjacent characteristic sentences to generate a plurality of first segment marks.
3. The data analysis method of claim 1, wherein the first learning material comprises a plurality of image frames, the analysis method further comprising:
and judging the similarity of the adjacent image frames to generate a plurality of first segment marks.
4. The data analysis method as claimed in claim 1, wherein the first learning material comprises a text file, the data analysis method further comprising:
analyzing the character file to generate a plurality of characteristic characters;
after the first learning materials are divided into a plurality of first learning sections, the number of the characteristic words in each first learning section is judged; and
the plurality of feature words with a number greater than a threshold are set as the first keyword string.
5. The data analysis method as claimed in claim 1, wherein the analysis command comprises a first search command, the data analysis method further comprising:
analyzing the first search command to generate a first search string; and
calculating the first similarity between the first search command and the first learning segments according to the first search string.
6. The data analysis method of claim 5, further comprising:
one of the first learning sections with the highest similarity and a corresponding one of the first learning data are transmitted to a terminal device, and behavior data are generated according to the transmitted one of the first learning sections and the corresponding one of the first learning data.
7. The data analysis method of claim 6, further comprising:
storing the first learning data into a storage unit, wherein the storage unit further stores a second learning data and a third learning data, the second learning data is divided into a plurality of second learning sections according to a plurality of second segment labels, and each second learning section comprises a second keyword string; the third learning data is divided into a plurality of third learning sections according to a plurality of third segment labels, and each third learning section comprises a third keyword string.
8. The data analysis method of claim 7, further comprising:
receiving a second search command;
according to the second search instruction, screening out the second learning data and the third learning data from the storage unit;
calculating a plurality of second similarities between the first keyword string and the second keyword string and the third keyword string, respectively, according to the behavior data; and
according to the second similarities, the second learning data and the third learning data are sorted to generate a search list.
9. A data analysis system, comprising:
a first server for receiving a first learning data;
a storage unit for receiving and storing the first learning data from the first server; and
a second server for adding a plurality of first segment marks to the first learning data to divide a plurality of first learning segments on the first learning data; the second server is used for respectively generating a first keyword string corresponding to each first learning section according to each of the plurality of first learning sections; the second server is configured to receive an analysis command, analyze the analysis command and the first keyword string of each first learning segment to obtain a plurality of first similarities between the analysis command and each first learning segment, and the second server screens out one of the first learning segments with a highest similarity from the plurality of first learning segments.
10. The data analysis system of claim 9, wherein the first learning data includes a text file, and the second server is configured to analyze the text file to generate a plurality of feature sentences, wherein the feature sentences have a precedence relationship; the second server is further configured to determine similarities between the adjacent feature sentences to generate the first segment marks.
11. The data analysis system of claim 9, wherein the first learning data comprises a plurality of image frames, and the second server is configured to determine similarity between adjacent image frames to generate the first segmentation markers.
12. The data analysis system of claim 9, wherein the first learning data comprises a text file, the second server is used for analyzing the text file to generate a plurality of feature words; after the first learning sections are divided from the first learning data, the second server is configured to determine the number of the plurality of feature words in each first learning section, and set the number of the plurality of feature words larger than a threshold as the first keyword string.
13. The data analysis system of claim 9, wherein the analysis command comprises a first search command; the second server is used for analyzing and processing the first search instruction to generate a first search string, and calculating the first search instruction and the plurality of first similarities corresponding to each first learning section by using the first search string.
14. The data analysis system of claim 13, wherein the second server is configured to transmit one of the first learning sections with the highest similarity and a corresponding one of the first learning data to a terminal device, and generate a behavior data according to the one of the first learning sections and the corresponding one of the first learning data.
15. The data analysis system of claim 14, wherein the storage unit further stores a second learning data and a third learning data, the second learning data is divided into a plurality of second learning sections according to a plurality of second segment labels, and each of the second learning sections comprises a second keyword string; the third learning data is divided into a plurality of third learning sections according to a plurality of third segment labels, and each third learning section comprises a third keyword string.
16. The data analysis system of claim 15, wherein the second server is configured to sift the second learning data and the third learning data from the storage unit according to a second search command; the second server is further configured to calculate a plurality of second similarities between the first keyword string and the second keyword string and between the first keyword string and the third keyword string, respectively, according to the behavior data, and sort the second learning data and the third learning data according to the plurality of second similarities to generate a search list.
CN201910105173.2A 2018-09-07 2019-02-01 Data analysis method and data analysis system Pending CN110889034A (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2019091400A JP2020042771A (en) 2018-09-07 2019-05-14 Data analysis method and data analysis system
SG10201905523TA SG10201905523TA (en) 2018-09-07 2019-06-17 Data analysis method and data analysis system thereof
EP19189383.3A EP3621022A1 (en) 2018-09-07 2019-07-31 Data analysis method and data analysis system thereof
US16/529,818 US11409804B2 (en) 2018-09-07 2019-08-02 Data analysis method and data analysis system thereof for searching learning sections

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201862728082P 2018-09-07 2018-09-07
US62/728,082 2018-09-07

Publications (1)

Publication Number Publication Date
CN110889034A true CN110889034A (en) 2020-03-17

Family

ID=69745778

Family Applications (5)

Application Number Title Priority Date Filing Date
CN201910104946.5A Active CN110891202B (en) 2018-09-07 2019-02-01 Segmentation method, segmentation system and non-transitory computer readable medium
CN201910105172.8A Pending CN110895654A (en) 2018-09-07 2019-02-01 Segmentation method, segmentation system and non-transitory computer readable medium
CN201910105173.2A Pending CN110889034A (en) 2018-09-07 2019-02-01 Data analysis method and data analysis system
CN201910104937.6A Active CN110888896B (en) 2018-09-07 2019-02-01 Data searching method and data searching system thereof
CN201910266133.6A Pending CN110888994A (en) 2018-09-07 2019-04-03 Multimedia data recommendation system and multimedia data recommendation method

Family Applications Before (2)

Application Number Title Priority Date Filing Date
CN201910104946.5A Active CN110891202B (en) 2018-09-07 2019-02-01 Segmentation method, segmentation system and non-transitory computer readable medium
CN201910105172.8A Pending CN110895654A (en) 2018-09-07 2019-02-01 Segmentation method, segmentation system and non-transitory computer readable medium

Family Applications After (2)

Application Number Title Priority Date Filing Date
CN201910104937.6A Active CN110888896B (en) 2018-09-07 2019-02-01 Data searching method and data searching system thereof
CN201910266133.6A Pending CN110888994A (en) 2018-09-07 2019-04-03 Multimedia data recommendation system and multimedia data recommendation method

Country Status (4)

Country Link
JP (3) JP6829740B2 (en)
CN (5) CN110891202B (en)
SG (5) SG10201905236WA (en)
TW (5) TWI709905B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI756703B (en) * 2020-06-03 2022-03-01 南開科技大學 Digital learning system and method thereof
CN117351794A (en) * 2023-10-13 2024-01-05 浙江上国教育科技有限公司 Online course management system based on cloud platform

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1774717A (en) * 2003-04-14 2006-05-17 皇家飞利浦电子股份有限公司 Method and apparatus for summarizing a music video using content analysis
CN104572716A (en) * 2013-10-18 2015-04-29 英业达科技有限公司 System and method for playing video files
CN104978961A (en) * 2015-05-25 2015-10-14 腾讯科技(深圳)有限公司 Audio processing method, device and terminal
CN107256262A (en) * 2017-06-13 2017-10-17 西安电子科技大学 A kind of image search method based on object detection
US20180032910A1 (en) * 2015-02-18 2018-02-01 Hitachi, Ltd. Data Analysis Device, Data Analysis Method, and Data Analysis Program

Family Cites Families (52)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07311539A (en) * 1994-05-17 1995-11-28 Hitachi Ltd Teaching material edition supporting system
KR100250540B1 (en) * 1996-08-13 2000-04-01 김광수 Studying method of foreign language dictation with apparatus of playing caption video cd
JP2002041823A (en) * 2000-07-27 2002-02-08 Nippon Telegr & Teleph Corp <Ntt> Information distributing device, information receiving device and information distributing system
JP3685733B2 (en) * 2001-04-11 2005-08-24 株式会社ジェイ・フィット Multimedia data search apparatus, multimedia data search method, and multimedia data search program
JP2002341735A (en) * 2001-05-16 2002-11-29 Alice Factory:Kk Broadband digital learning system
CN1432932A (en) * 2002-01-16 2003-07-30 陈雯瑄 English examination and score estimation method and system
TW200411462A (en) * 2002-12-20 2004-07-01 Hsiao-Lien Wang A method for matching information exchange on network
JP4471737B2 (en) * 2003-10-06 2010-06-02 日本電信電話株式会社 Grouping condition determining device and method, keyword expansion device and method using the same, content search system, content information providing system and method, and program
JP4426894B2 (en) * 2004-04-15 2010-03-03 株式会社日立製作所 Document search method, document search program, and document search apparatus for executing the same
JP2005321662A (en) * 2004-05-10 2005-11-17 Fuji Xerox Co Ltd Learning support system and method
JP2006003670A (en) * 2004-06-18 2006-01-05 Hitachi Ltd Educational content providing system
KR20070116945A (en) * 2005-03-31 2007-12-11 코닌클리케 필립스 일렉트로닉스 엔.브이. Augmenting lectures based on prior exams
US9058406B2 (en) * 2005-09-14 2015-06-16 Millennial Media, Inc. Management of multiple advertising inventories using a monetization platform
JP5167546B2 (en) * 2006-08-21 2013-03-21 国立大学法人京都大学 Sentence search method, sentence search device, computer program, recording medium, and document storage device
TW200825900A (en) * 2006-12-13 2008-06-16 Inst Information Industry System and method for generating wiki by sectional time of handout and recording medium thereof
JP5010292B2 (en) * 2007-01-18 2012-08-29 株式会社東芝 Video attribute information output device, video summarization device, program, and video attribute information output method
JP5158766B2 (en) * 2007-10-23 2013-03-06 シャープ株式会社 Content selection device, television, content selection program, and storage medium
TW200923860A (en) * 2007-11-19 2009-06-01 Univ Nat Taiwan Science Tech Interactive learning system
CN101382937B (en) * 2008-07-01 2011-03-30 深圳先进技术研究院 Multimedia resource processing method based on speech recognition and on-line teaching system thereof
US8140544B2 (en) * 2008-09-03 2012-03-20 International Business Machines Corporation Interactive digital video library
CN101453649B (en) * 2008-12-30 2011-01-05 浙江大学 Key frame extracting method for compression domain video stream
JP5366632B2 (en) * 2009-04-21 2013-12-11 エヌ・ティ・ティ・コミュニケーションズ株式会社 Search support keyword presentation device, method and program
JP5493515B2 (en) * 2009-07-03 2014-05-14 富士通株式会社 Portable terminal device, information search method, and information search program
US20110177482A1 (en) * 2010-01-15 2011-07-21 Nitzan Katz Facilitating targeted interaction in a networked learning environment
JP2012038239A (en) * 2010-08-11 2012-02-23 Sony Corp Information processing equipment, information processing method and program
US8839110B2 (en) * 2011-02-16 2014-09-16 Apple Inc. Rate conform operation for a media-editing application
CN102222227B (en) * 2011-04-25 2013-07-31 中国华录集团有限公司 Video identification based system for extracting film images
CN102348049B (en) * 2011-09-16 2013-09-18 央视国际网络有限公司 Method and device for detecting position of cut point of video segment
CN102509007A (en) * 2011-11-01 2012-06-20 北京瑞信在线系统技术有限公司 Method, system and device for multimedia teaching evaluation and multimedia teaching system
JP5216922B1 (en) * 2012-01-06 2013-06-19 Flens株式会社 Learning support server, learning support system, and learning support program
US9846696B2 (en) * 2012-02-29 2017-12-19 Telefonaktiebolaget Lm Ericsson (Publ) Apparatus and methods for indexing multimedia content
US20130263166A1 (en) * 2012-03-27 2013-10-03 Bluefin Labs, Inc. Social Networking System Targeted Message Synchronization
US9058385B2 (en) * 2012-06-26 2015-06-16 Aol Inc. Systems and methods for identifying electronic content using video graphs
TWI513286B (en) * 2012-08-28 2015-12-11 Ind Tech Res Inst Method and system for continuous video replay
CN102937972B (en) * 2012-10-15 2016-06-22 上海外教社信息技术有限公司 A kind of audiovisual subtitle making system and method
WO2014100893A1 (en) * 2012-12-28 2014-07-03 Jérémie Salvatore De Villiers System and method for the automated customization of audio and video media
JP6205767B2 (en) * 2013-03-13 2017-10-04 カシオ計算機株式会社 Learning support device, learning support method, learning support program, learning support system, and server device
TWI549498B (en) * 2013-06-24 2016-09-11 wu-xiong Chen Variable audio and video playback method
KR101537370B1 (en) * 2013-11-06 2015-07-16 주식회사 시스트란인터내셔널 System for grasping speech meaning of recording audio data based on keyword spotting, and indexing method and method thereof using the system
US20150206441A1 (en) * 2014-01-18 2015-07-23 Invent.ly LLC Personalized online learning management system and method
CN104123332B (en) * 2014-01-24 2018-11-09 腾讯科技(深圳)有限公司 The display methods and device of search result
US9892194B2 (en) * 2014-04-04 2018-02-13 Fujitsu Limited Topic identification in lecture videos
US20150293995A1 (en) * 2014-04-14 2015-10-15 David Mo Chen Systems and Methods for Performing Multi-Modal Video Search
US20160239155A1 (en) * 2015-02-18 2016-08-18 Google Inc. Adaptive media
CN105047203B (en) * 2015-05-25 2019-09-10 广州酷狗计算机科技有限公司 A kind of audio-frequency processing method, device and terminal
TWI571756B (en) * 2015-12-11 2017-02-21 財團法人工業技術研究院 Methods and systems for analyzing reading log and documents corresponding thereof
CN105978800A (en) * 2016-07-04 2016-09-28 广东小天才科技有限公司 Method and system for pushing subjects to mobile terminal and server
CN106202453B (en) * 2016-07-13 2020-08-04 网易(杭州)网络有限公司 Multimedia resource recommendation method and device
CN106231399A (en) * 2016-08-01 2016-12-14 乐视控股(北京)有限公司 Methods of video segmentation, equipment and system
CN106331893B (en) * 2016-08-31 2019-09-03 科大讯飞股份有限公司 Real-time caption presentation method and system
CN108122437A (en) * 2016-11-28 2018-06-05 北大方正集团有限公司 Adaptive learning method and device
CN107623860A (en) * 2017-08-09 2018-01-23 北京奇艺世纪科技有限公司 Multi-medium data dividing method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1774717A (en) * 2003-04-14 2006-05-17 皇家飞利浦电子股份有限公司 Method and apparatus for summarizing a music video using content analysis
CN104572716A (en) * 2013-10-18 2015-04-29 英业达科技有限公司 System and method for playing video files
US20180032910A1 (en) * 2015-02-18 2018-02-01 Hitachi, Ltd. Data Analysis Device, Data Analysis Method, and Data Analysis Program
CN104978961A (en) * 2015-05-25 2015-10-14 腾讯科技(深圳)有限公司 Audio processing method, device and terminal
CN107256262A (en) * 2017-06-13 2017-10-17 西安电子科技大学 A kind of image search method based on object detection

Also Published As

Publication number Publication date
CN110891202A (en) 2020-03-17
TW202011231A (en) 2020-03-16
TW202011749A (en) 2020-03-16
TWI696386B (en) 2020-06-11
SG10201906347QA (en) 2020-04-29
CN110888896B (en) 2023-09-05
SG10201905532QA (en) 2020-04-29
CN110888994A (en) 2020-03-17
JP2020042777A (en) 2020-03-19
TWI700597B (en) 2020-08-01
TWI709905B (en) 2020-11-11
TWI725375B (en) 2021-04-21
SG10201905236WA (en) 2020-04-29
TW202011221A (en) 2020-03-16
TW202011222A (en) 2020-03-16
TW202011232A (en) 2020-03-16
CN110895654A (en) 2020-03-20
SG10201905523TA (en) 2020-04-29
TWI699663B (en) 2020-07-21
CN110888896A (en) 2020-03-17
CN110891202B (en) 2022-03-25
JP2020042770A (en) 2020-03-19
JP6829740B2 (en) 2021-02-10
JP2020042771A (en) 2020-03-19
SG10201907250TA (en) 2020-04-29

Similar Documents

Publication Publication Date Title
CN108319723B (en) Picture sharing method and device, terminal and storage medium
KR101335400B1 (en) Identifying comments to show in connection with a document
US10277696B2 (en) Method and system for processing data used by creative users to create media content
CN111274442B (en) Method for determining video tag, server and storage medium
CN111125435B (en) Video tag determination method and device and computer equipment
WO2021098648A1 (en) Text recommendation method, apparatus and device, and medium
KR102488914B1 (en) Method, Device and Program for extract keywords from contents and recommend contents using extracted kewords
US20060190684A1 (en) Reverse value attribute extraction
CN111314732A (en) Method for determining video label, server and storage medium
US20170155939A1 (en) Method and System for Processing Data Used By Creative Users to Create Media Content
CN109063200B (en) Resource searching method and device, electronic equipment and computer readable medium
CN110889034A (en) Data analysis method and data analysis system
US11361759B2 (en) Methods and systems for automatic generation and convergence of keywords and/or keyphrases from a media
US11386163B2 (en) Data search method and data search system thereof for generating and comparing strings
KR101263403B1 (en) Apparatus and method for keyword searching according to priority of inputted word and computer readable medium having stored thereon computer executable instruction for performing the method
US11409804B2 (en) Data analysis method and data analysis system thereof for searching learning sections
CN115203445A (en) Multimedia resource searching method, device, equipment and medium
CN115618873A (en) Data processing method and device, computer equipment and storage medium
CN110942070A (en) Content display method and device, electronic equipment and computer readable storage medium
KR101487297B1 (en) Web page contents confirmation system and method using categoryclassification
CN111274483B (en) Associated recommendation method and associated recommendation interaction method
CN110196940B (en) Method, apparatus, display engine and medium for displaying hotspot network content to user
KR20230144685A (en) Method for Creating Media Contents
CN115062177A (en) Picture searching method and device
CN115114459A (en) Label correction method, device, equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination