CN116796045A - Multi-dimensional book grading method, system and readable medium - Google Patents

Multi-dimensional book grading method, system and readable medium Download PDF

Info

Publication number
CN116796045A
CN116796045A CN202311061921.4A CN202311061921A CN116796045A CN 116796045 A CN116796045 A CN 116796045A CN 202311061921 A CN202311061921 A CN 202311061921A CN 116796045 A CN116796045 A CN 116796045A
Authority
CN
China
Prior art keywords
book
feature vector
classified
semantic
reader
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202311061921.4A
Other languages
Chinese (zh)
Other versions
CN116796045B (en
Inventor
毕锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Rentian Bookstore Group Co ltd
Original Assignee
Beijing Rentian Bookstore Group Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Rentian Bookstore Group Co ltd filed Critical Beijing Rentian Bookstore Group Co ltd
Priority to CN202311061921.4A priority Critical patent/CN116796045B/en
Publication of CN116796045A publication Critical patent/CN116796045A/en
Application granted granted Critical
Publication of CN116796045B publication Critical patent/CN116796045B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Abstract

A multi-dimensional book grading method, system and readable medium, it obtains the history reading information of reader; acquiring book topics and book abstracts of a first book to be classified; and determining a reading difficulty level label of the first book to be classified for the reader based on the historical reading information of the reader and the book theme and the book abstract of the first book to be classified. The reading difficulty level obtained in this way has specificity and self-adaptability to the reading capability of a specific reader, so that a proper book is conveniently recommended for the specific reader.

Description

Multi-dimensional book grading method, system and readable medium
Technical Field
The application relates to the technical field of intelligent book classification, and in particular relates to a multi-dimensional book classification method, a system and a readable medium.
Background
The reading capability of each reader is different, and the receiving capability of each reader on books to be read is also different, so that the reading requirements of different readers are also different, namely, the different readers are required to be classified in a targeted way for reading the books.
In the traditional book grading system, books are rated hard through artificially set grading rules, for example, grading is carried out through age stages, and the grading is mass-oriented grading, so that universality is achieved, and reading experience and reading requirements of each reader cannot be well met.
Thus, an optimized book ranking scheme is desired.
Disclosure of Invention
The present application has been made to solve the above-mentioned technical problems. The embodiment of the application provides a multi-dimensional book grading method, a multi-dimensional book grading system and a readable medium, wherein the multi-dimensional book grading system acquires historical reading information of readers; acquiring book topics and book abstracts of a first book to be classified; and determining a reading difficulty level label of the first book to be classified for the reader based on the historical reading information of the reader and the book theme and the book abstract of the first book to be classified. The reading difficulty level obtained in this way has specificity and self-adaptability to the reading capability of a specific reader, so that a proper book is conveniently recommended for the specific reader.
In a first aspect, a multi-dimensional book grading method is provided, comprising:
acquiring historical reading information of a reader;
acquiring book topics and book abstracts of a first book to be classified; and
and determining the reading difficulty level label of the first book to be classified for the reader based on the historical reading information of the reader and the book theme and the book abstract of the first book to be classified.
In a second aspect, there is provided a multi-dimensional book ranking system comprising:
the reading information acquisition module is used for acquiring historical reading information of the reader;
the theme and abstract acquisition module is used for acquiring book theme and book abstract of the first book to be classified; and
the difficulty level label determining module is used for determining the reading difficulty level label of the first book to be classified for the reader based on the historical reading information of the reader, the book theme and the book abstract of the first book to be classified.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of a multi-dimensional book ranking method according to an embodiment of the application.
Fig. 2 is a schematic diagram of a multi-dimensional book grading method according to an embodiment of the application.
FIG. 3 is a flowchart of the sub-steps of step 130 in a multi-dimensional book ranking method according to an embodiment of the application.
FIG. 4 is a block diagram of a multi-dimensional book ranking system according to an embodiment of the application.
Fig. 5 is a schematic view of a scenario of a multi-dimensional book grading method according to an embodiment of the application.
Detailed Description
The following description of the technical solutions according to the embodiments of the present application will be given with reference to the accompanying drawings in the embodiments of the present application, and it is apparent that the described embodiments are some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
Unless defined otherwise, all technical and scientific terms used in the embodiments of the application have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs. The terminology used in the present application is for the purpose of describing particular embodiments only and is not intended to limit the scope of the present application.
In describing embodiments of the present application, unless otherwise indicated and limited thereto, the term "connected" should be construed broadly, for example, it may be an electrical connection, or may be a communication between two elements, or may be a direct connection, or may be an indirect connection via an intermediate medium, and it will be understood by those skilled in the art that the specific meaning of the term may be interpreted according to circumstances.
It should be noted that, the term "first\second\third" related to the embodiment of the present application is merely to distinguish similar objects, and does not represent a specific order for the objects, it is to be understood that "first\second\third" may interchange a specific order or sequence where allowed. It is to be understood that the "first\second\third" distinguishing objects may be interchanged where appropriate such that embodiments of the application described herein may be practiced in sequences other than those illustrated or described herein.
It should be appreciated that the conventional book rating system is a method of rating books by artificially set rating rules. Such a rating system is typically conducted on an age basis, with the aim of providing readers of different ages with books that suit their reading capabilities and content interests.
Conventional book rating systems are generally formulated by publishers, educational institutions or related expert organizations and are widely used in the field of book markets and education, and they divide books into different levels or age groups according to the contents of the books, the difficulty of language, the complexity of the episodes, etc.
In conventional book rating systems, symbols like letters, numbers or colors are often used to represent the different ratings, for example, common rating symbols include "Ages 4-8" (4-8 years old apply), "Young add" (teenager apply), and the like. These symbols are typically printed on the front, back, or catalog pages of the book so that readers and purchasers can quickly learn about the applicable objects of the book.
The advantage of conventional book-grading systems is their versatility and easy understanding, providing the reader with a reference to help them select the appropriate reading material among the numerous books. At the same time, the grading system is also helpful for parents, teachers and library administrators to better select proper books for children and teenagers.
However, conventional book rating systems also have some limitations. Since the ranking is based on a generalized age group or stage, it may not meet individual differences and special needs of each reader. Some readers may have a higher reading power at a certain age group, while some readers may have a lower reading power at the same age group. In addition, the content and theme of books may also vary depending on personal interests and the level of development, and conventional rating systems cannot fully take these factors into account.
Therefore, in order to better meet the reading experience and requirements of different readers, it may be considered to employ an optimized book ranking scheme based on personalized reading capabilities to more accurately evaluate the level of the reader's capabilities and provide them with more suitable reading materials according to individual differences.
In the application, a multi-dimensional book classification combined with convolutional neural networks (Convolutional Neural Network, CNN) is provided, which is a method for improving the accuracy and individuation of book classification. Conventional book rating systems are typically classified based on age group or reading ability, but this approach ignores other important features of the book and individual differences of the reader.
Further, conventional book ranking systems focus mainly on the age and reading ability of readers, while ignoring other features of books, such as theme, scenario complexity, emotional tendency, etc. Combining with CNN can evaluate the difficulty of the book and the characteristics of the reader more comprehensively by analyzing the text content and other metadata of the book and considering a plurality of features at the same time.
CNN is a deep learning model with strong feature extraction capability. By training the CNN model, key features in book texts, such as vocabulary, syntax structure, theme and the like, can be automatically learned and extracted. These features can be used to evaluate the difficulty level of the book and match the personalized model of the reader to more accurately determine if the book is suitable for the reader.
The personalized rating result can be realized by combining the multi-dimensional book grading of the CNN, and the individual difference and preference of the readers can be taken into consideration by establishing a personalized model of the readers and training the CNN model, so that book recommendation suitable for the reading capability, interest and preference of each reader is provided for each reader. The personalized grading method can improve the reading experience of readers and better meet the reading requirements of the readers.
The CNN model has a certain interpretability, and the basis of the model for grading books can be understood through a visualization technology. This provides transparency to the book ranking results, allowing readers and related experts to understand how the model is evaluated and recommended, and to verify and adjust the results.
That is, the multi-dimensional book classification combined with the convolutional neural network can improve the classification accuracy, individuation degree and transparency, consider a plurality of characteristics of books and individual differences of readers, provide more accurate and individuated book recommendation for readers, improve reading experience and meet reading requirements.
FIG. 1 is a flow chart of a multi-dimensional book ranking method according to an embodiment of the application. Fig. 2 is a schematic diagram of a multi-dimensional book grading method according to an embodiment of the application. As shown in fig. 1 and 2, the multi-dimensional book grading method includes: 110, acquiring historical reading information of readers; 120, obtaining book topics and book abstracts of a first book to be classified; and 130, determining a reading difficulty level label of the first book to be classified for the reader based on the historical reading information of the reader and the book theme and the book abstract of the first book to be classified.
Wherein, in the step 110, it is ensured that enough historical reading information is obtained so as to accurately evaluate the reading capability and preference of the reader, which may include information on the books read, the reading time length, the reading speed, and the like. By analyzing the historical reading information of the reader, the reading level, the hobbies and the reading preference of the reader can be known. The suitability and difficulty of books can be evaluated in a personalized way, and more accurate book recommendation can be provided for readers.
In the step 120, accurate and comprehensive book theme and summary information is ensured to be acquired, which can be acquired through channels of metadata of books, information provided by publishers, comments of books and the like. The theme and abstract of the book provide important clues about the content and complexity of the book, and by analyzing this information, the characteristics of the book, including difficulty, plot complexity, emotional tendency, etc., can be better understood, thereby grading more accurately.
In the step 130, when the historical reading information of the reader and the information such as the subject matter, abstract and the like of the book to be classified are combined, a proper model or algorithm is established for evaluation and classification. This may involve techniques of feature extraction, machine learning, deep learning, etc. By comprehensively considering individual differences of readers and characteristics of books to be classified, the proper difficulty level label of the books can be determined. This helps to provide personalized book recommendations for readers, improving book classification accuracy and adaptability.
In the above steps, the multi-dimensional book grading method involves obtaining historical reading information of readers, obtaining theme and summary information of books to be graded, and determining reading difficulty level labels of the books based on the information. Points of attention include ensuring accuracy and comprehensiveness of the information, building appropriate models or algorithms for evaluation and classification. The beneficial effects of these methods include providing personalized book recommendations, improving the accuracy and adaptability of the ranking.
Specifically, in the step 110 and the step 120, history reading information of the reader is obtained; and acquiring book topics and book abstracts of the first books to be classified. Aiming at the technical problems, the technical concept of the application is to measure the reading capability of a reader based on the historical reading information of the reader, and further to classify the reading difficulty level of the data to be read based on the reading capability, so that the obtained reading difficulty level has the specificity and the self-adaptability aiming at the reading capability of a specific reader, thereby being convenient for recommending a proper book for the specific reader.
Specifically, in the technical scheme of the application, historical reading information of readers is obtained, and book subjects and book summaries of the first books to be classified are obtained. Wherein, through analyzing the historical reading information of the reader, the reading level, reading speed, understanding ability and the like of the reader can be known, and the information is very important for evaluating the reading difficulty and suitability of the reader. Through the historical reading information of the reader, the reading preference, favorite book types, topics and the like of the reader can be known, the suitability of books can be evaluated in a personalized way, and book recommendation meeting the interests of the reader can be provided.
Further, the book's theme and abstract provide important information about the book's content and theme. By analyzing the information, characteristics of the books such as plot complexity, theme type, emotion tendency and the like can be known, so that the difficulty and suitability of the books can be evaluated more accurately. The topic and abstract information of the book can also be used to extract text features of the book, such as vocabulary, syntactic structures, etc., which are helpful in assessing the difficulty and suitability of the book.
By combining historical reading information of readers and information such as themes, abstracts and the like of books to be classified, the proper difficulty level label of the books can be determined, personalized book recommendation is provided for the readers, and the reading capability and interest of the books and the readers are ensured to be matched. By comprehensively considering individual differences of readers and characteristics of books to be classified, the reading difficulty level of the books can be more accurately determined, the accuracy of book classification is improved, the readers can select books suitable for the readers, and reading experience and understanding capability are improved.
The method comprises the steps of obtaining historical reading information of readers and book topics and book abstracts of a first book to be classified, and playing a key role in determining reading difficulty level labels of the books to be classified for the readers. The information is helpful for knowing the reading capability, preference and interest of readers, and provides important clues for book contents and features, so that personalized book recommendation is realized and grading accuracy is improved.
Specifically, in the step 130, based on the historical reading information of the reader and the book theme and the book abstract of the first book to be classified, a reading difficulty level label of the first book to be classified for the reader is determined. FIG. 3 is a flowchart illustrating the substep of step 130 in the multi-dimensional book grading method according to an embodiment of the present application, as shown in FIG. 3, for determining a reading difficulty level tag of the first book to be graded for the reader based on the historical reading information of the reader and the book subject and the book abstract of the first book to be graded, including: 131, carrying out semantic coding on the historical reading information of the reader and the book theme and the book abstract of the first book to be classified so as to obtain a semantic understanding feature vector of the historical reading information and a semantic coding feature vector of the first book to be classified; 132, performing feature interaction on the semantic understanding feature vector of the historical reading information and the semantic coding feature vector of the first book to be classified to obtain a self-adaptive coding feature vector of the first book reading difficulty; and 133, determining a reading difficulty level label of the first book to be classified for the reader based on the self-adaptive coding feature vector of the reading difficulty of the first book.
Firstly, converting historical reading information of readers, topics of a first book to be ranked, abstracts and other text information into semantic representations, including Word embedding (such as Word2Vec and GloVe) and pre-trained language models (such as BERT and GPT). The text information is converted into a numerical feature vector through semantic coding so as to facilitate subsequent feature interaction and model training. These feature vectors capture the semantic and contextual information of the text.
Then, the semantic understanding feature vector of the historical reading information and the semantic encoding feature vector of the first book to be classified are subjected to feature interaction, and various methods such as splicing, weighted addition, attention mechanism and the like can be adopted. Therefore, the information of different features can be fused, and the expression capability and the discrimination capability of the features are improved. The self-adaptive coding feature vector of the reading difficulty of the first book is obtained through feature interaction, historical reading information of readers and semantic features of the first book to be classified are comprehensively considered, and the proper difficulty degree of the book is reflected.
Then, a classification model or regression model can be established to predict the reading difficulty level label of the first book to be classified for the reader by utilizing the self-adaptive coding feature vector of the reading difficulty of the first book, and the label can be discrete difficulty level (such as primary level, medium level and high level) or continuous difficulty scoring. By determining the reading difficulty level label of the first book to be classified, personalized book recommendation can be provided for readers, the reading capability and interest matching of the books and the readers are ensured, and the reading experience and understanding capability are improved.
Through the steps, based on the historical reading information of the reader and the theme and abstract of the first book to be classified, the reading difficulty self-adaptive coding feature vector of the first book can be obtained through semantic coding and feature interaction, and the reading difficulty grade label of the book is determined based on the feature vector. These steps help to achieve personalized book recommendation and improve the accuracy of the classification, providing book selection meeting the reader's needs.
For said step 131, it comprises: after word segmentation is carried out on the historical reading information of the reader, semantic understanding feature vectors of the historical reading information are obtained through a semantic encoder comprising a word embedding layer; and performing word segmentation on the book theme and the book abstract of the first book to be classified, and then obtaining the semantic coding feature vector of the first book to be classified through the semantic coder containing the word embedding layer.
Wherein the semantic encoder is a converter-based Bert model.
And then, carrying out semantic coding on the historical reading information of the reader and the book theme and the book abstract of the first book to be classified so as to obtain a semantic understanding feature vector of the historical reading information and a semantic coding feature vector of the first book to be classified. The method comprises the steps of obtaining historical reading information of a reader, and book topics and book abstracts of a first book to be classified, and then carrying out semantic understanding on the historical reading information of the reader, the book topics and the book abstracts of the first book to be classified so as to obtain historical reading information of the reader, semantic feature representations of the book topics and the book abstracts of the first book to be classified, namely semantic understanding feature vectors of the historical reading information and semantic coding feature vectors of the first book to be classified.
In a specific example of the present application, the process of semantically encoding the historical reading information of the reader and the book theme and the book abstract of the first book to be classified to obtain the semantic understanding feature vector of the historical reading information and the semantic encoding feature vector of the first book to be classified includes: firstly, performing word segmentation on historical reading information of a reader, and then obtaining a semantic understanding feature vector of the historical reading information through a semantic encoder comprising a word embedding layer; and performing word segmentation on the book theme and the book abstract of the first book to be classified, and then obtaining the semantic coding feature vector of the first book to be classified through the semantic coder containing the word embedding layer. In this particular example, the semantic encoder is a transformer-based Bert model.
For the step 132, it includes: and carrying out feature interaction between the historical reading information semantic understanding feature vector and the first book semantic coding feature vector to be classified based on an attention mechanism by using an inter-feature attention interaction layer so as to obtain a first book reading difficulty self-adaptive coding feature vector.
It should be understood that, in the technical solution of the present application, the semantic understanding feature vector of the historical reading information may reflect the reading capability of the reader, and the semantic encoding feature vector of the first book to be classified may reflect the reading difficulty of the first book to be classified. Based on the above, in the technical scheme of the application, the reading difficulty level label of the first book to be classified for the reader is further determined based on the historical reading information of the reader and the book theme and the book abstract of the first book to be classified.
Specifically, firstly, performing feature interaction on the semantic understanding feature vector of the historical reading information and the semantic encoding feature vector of the first book to be classified to obtain a first book reading difficulty self-adaptive encoding feature vector, namely, performing feature level data interaction on the reading capacity of a reader and the reading difficulty of the first book to be classified in a high-dimensional feature space to obtain a collaborative representation between the two, namely, the first book reading difficulty self-adaptive encoding feature vector.
In a specific technical scheme of the application, an attention interaction layer is used for carrying out attention mechanism-based feature interaction between the historical reading information semantic understanding feature vector and the first book semantic coding feature vector to be classified so as to obtain the first book reading difficulty self-adaptive coding feature vector, and the process can be expressed as follows by a formula:
for the historical reading information semantic understanding feature vector and the first book semantic coding feature vector to be ranked, and />It is transformed into two feature spaces s and t to calculate the degree between them:
wherein ,is a learned weight matrix corresponding to the 1 x 1 convolution in fig. 1, i is an index of the output location, j represents an index of all possible locations.
In particular, in the technical solution of the present application, the attention mechanism caused by the attention layer between features is mainly different from the traditional attention mechanism in that: the goal of traditional attention mechanisms is to learn an attention weight matrix, applied to the individual neural nodes of the current layer, giving them greater weight for those important nodes and less weight for those secondary nodes. Because each neural node contains certain characteristic information, the neural network can select information which is more critical to the current task target from a plurality of characteristic information through the operation. The attention mechanism provided by the application is different, and more attention is paid to the dependency relationship among the characteristic information, namely, the characteristic level dependency and interaction relationship between the semantic understanding characteristic vector of the history reading information and the semantic encoding characteristic vector of the first book to be classified.
Further, the inter-feature attention interaction layer is an attention mechanism for modeling the degree of association between different features during feature interaction. And using an inter-feature attention interaction layer to realize feature interaction between the semantic understanding feature vector of the historical reading information and the semantic coding feature vector of the first book to be classified so as to obtain the reading difficulty self-adaptive coding feature vector of the first book.
The inter-feature attention interaction layer is used for adaptively fusing the feature representations of the historical reading information and the feature of the first book to be classified according to the association degree between the two features. By introducing an attention mechanism, features can be weighted according to importance between different features, thereby better capturing relevance and importance between features. Therefore, the expression capability of the features can be improved, and the obtained self-adaptive coding feature vector of the first book reading difficulty can more accurately reflect the proper difficulty degree of the book.
For said step 133, it comprises: carrying out homogeneous Gilbert space metric dense point distribution sampling fusion on the semantic understanding feature vector of the historical reading information and the semantic encoding feature vector of the first book to be classified to obtain a fusion feature vector; fusing the fused feature vector and the first book reading difficulty self-adaptive coding feature vector to obtain a classification feature vector; and the classification feature vector passes through a classifier to obtain a classification result, wherein the classification result is used for representing the reading difficulty grade label of the first book to be classified for the reader.
The historical reading information semantic understanding feature vector and the first book semantic coding feature vector to be classified are fused into a fusion feature vector through homogeneous Gilbert space metric type dense point distribution sampling fusion. Thus, the information of different characteristics can be comprehensively utilized, and the expression capacity and the discrimination capacity of the characteristics are improved.
And fusing the fusion feature vector with the self-adaptive coding feature vector of the first book reading difficulty to obtain a classification feature vector. The fusion mode can adaptively fuse information of different features according to specific tasks and data characteristics, and better captures correlation and importance degree among the features.
And obtaining the classification feature vector by fusing the feature vector and the first book reading difficulty self-adaptive coding feature vector. The feature vector comprehensively considers the historical reading information, the semantic features of the first book to be classified and the reading difficulty self-adaptive coding features, and can evaluate the difficulty of the book and the characteristics of suitable readers more comprehensively.
And training and predicting by using the classification feature vector as input through a classifier to obtain a classification result. The classification feature vector fused with a plurality of features can provide more accurate information, and is beneficial to improving the performance and accuracy of the classifier.
Comprehensive classification feature vectors can be obtained through homogeneous Gilbert space metric type dense point distribution sampling fusion and feature fusion, and the reading difficulty level label for readers is obtained through training and prediction by a classifier. The method can more accurately evaluate the difficulty of the book and the characteristics of the proper readers, provide personalized book recommendation, promote reading experience and meet reading requirements.
In particular, in the technical scheme of the application, because the attention mechanism-based feature interaction between the historical reading information semantic understanding feature vector and the to-be-classified first book semantic coding feature vector is performed by using the attention interaction layer, the dependency relationship between the historical reading information semantic understanding feature vector and the to-be-classified first book semantic coding feature vector is extracted, and for the historical reading information semantic understanding feature vector and the to-be-classified first book semantic coding feature vector, the expression of the historical reading information of the reader and the text semantic features of the book subject and the book abstract of the to-be-classified first book by the historical reading information semantic understanding feature vector and the to-be-classified first book semantic coding feature vector can be insufficient, so that the applicant of the application considers further fusing the text semantic feature expressions of the historical reading information semantic understanding feature vector and the to-be-classified first book semantic coding feature vector to strengthen the feature expressions of the first book reading difficulty adaptive coding feature vector.
And, the applicant of the present application examinesConsidering that the historical reading information semantic understanding feature vector and the to-be-classified first book semantic coding feature vector are respectively historical reading information of the reader and closely coded dense feature collection type text semantic expression of book subjects and book summaries of the to-be-classified first book, the historical reading information semantic understanding feature vector is recorded asAnd the first book semantic coding feature vector to be ranked, e.g. denoted +.>The homogeneous Gilbert space metric dense point distribution sampling fusion is carried out, and the method is specifically expressed as follows: carrying out homogeneous Gilbert space metric dense point distribution sampling fusion on the semantic understanding feature vector of the historical reading information and the semantic encoding feature vector of the first book to be classified by using the following optimization formula to obtain a fused feature vector; wherein, the optimization formula is:
wherein ,represent Min distance and +.>Is super-parameter (herba Cinchi Oleracei)>Is the semantic understanding feature vector of the history reading information, < >>Is the semantic coding feature vector of the first book to be classified,> and />The semantic understanding feature vector of the history reading information is +.>And the first book semantic coding feature vector to be classified +.>Is characterized by the global feature mean value of the history reading information semantic understanding feature vector +.>And the first book semantic coding feature vector to be classified +.>For row vector +.>Is the transpose vector of the semantically encoded feature vector of the first book to be ranked,/or%> and />Representing addition by position and multiplication by position, +.>Is the fusion feature vector.
Here, the feature vector is understood semantically by reading the information on the historyAnd the first book semantic coding feature vector to be classified +.>For semantic understanding of feature vectors for said history read information>And the semantic code of the first book to be classifiedFeature vector->The fusion feature distribution of the cross-distance constraint point-by-point feature association is used as a bias term to realize feature dense point sampling pattern distribution fusion in the association constraint limit of the feature distribution, so that the homogeneous sampling association fusion among vectors is enhanced. Then, the fusion feature vector is added again>And fusing the self-adaptive coding feature vector of the first book reading difficulty with the self-adaptive coding feature vector of the first book reading difficulty, so that the feature expression of the self-adaptive coding feature vector of the first book reading difficulty can be improved.
Further, fusing the fused feature vector and the first book reading difficulty adaptive coding feature vector to obtain a classification feature vector, including: fusing the fusion feature vector and the first book reading difficulty self-adaptive coding feature vector by the following fusion formula to obtain a classification feature vector; wherein, the fusion formula is:
wherein ,representing the fusion feature vector and the first book reading difficulty adaptive coding feature vector,/I>Representing a cascade function->Representing the classification feature vector.
And determining a reading difficulty level label of the first book to be classified for the reader based on the self-adaptive coding feature vector of the reading difficulty of the first book. In a specific example of the application, the first book reading difficulty adaptive coding feature vector is passed through a classifier to obtain a classification result, wherein the classification result is used for representing a reading difficulty level label.
In summary, the multi-dimensional book classification method 100 according to the embodiment of the application is illustrated, measures the reading capability of a reader based on the historical reading information of the reader, and classifies the reading difficulty level of the data to be read based on the reading capability, so that the obtained reading difficulty level has specificity and adaptability to the reading capability of a specific reader, thereby facilitating the recommendation of a proper book for the specific reader.
In one embodiment of the present application, FIG. 4 is a block diagram of a multi-dimensional book ranking system according to an embodiment of the present application. As shown in fig. 4, the multi-dimensional book rating system 200 according to an embodiment of the present application includes: a reading information obtaining module 210, configured to obtain historical reading information of a reader; a theme and abstract obtaining module 220, configured to obtain book theme and book abstract of the first book to be classified; and a difficulty level tag determining module 230, configured to determine a reading difficulty level tag of the first book to be classified for the reader based on the historical reading information of the reader and the book subject and the book abstract of the first book to be classified.
Here, it will be understood by those skilled in the art that the specific functions and operations of the respective units and modules in the above-described multi-dimensional book grading system have been described in detail in the above description of the multi-dimensional book grading method with reference to fig. 1 to 3, and thus, repetitive descriptions thereof will be omitted.
As described above, the multi-dimensional book rating system 200 according to the embodiment of the present application may be implemented in various terminal devices, such as a server for multi-dimensional book rating, etc. In one example, the multi-dimensional book rating system 200 according to embodiments of the present application may be integrated into the terminal device as a software module and/or hardware module. For example, the multi-dimensional book rating system 200 may be a software module in the operating system of the terminal device, or may be an application developed for the terminal device; of course, the multi-dimensional book rating system 200 may also be one of a number of hardware modules of the terminal device.
Alternatively, in another example, the multi-dimensional book rating system 200 and the terminal device may be separate devices, and the multi-dimensional book rating system 200 may be connected to the terminal device through a wired and/or wireless network and transmit interactive information in a agreed data format.
In one embodiment of the application, a readable medium is provided, characterized in that it has stored thereon computer program instructions, which when executed by a processor, cause the processor to perform a multi-dimensional book ranking method as described.
It should be appreciated that embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the forms of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects may be utilized. Furthermore, the computer program product may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.
Methods, systems, and computer program products of embodiments of the present application are described in the flow diagrams and/or block diagrams. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
Fig. 5 is a schematic view of a scenario of a multi-dimensional book grading method according to an embodiment of the application. As shown in fig. 5, in the application scenario, first, history reading information of a reader (e.g., C1 as illustrated in fig. 5) and book subjects and book summaries of a first book to be classified (e.g., C2 as illustrated in fig. 5) are acquired; the obtained historical reading information, book theme of the first book and book abstract are then input into a server (e.g., S as illustrated in fig. 5) deployed with a multi-dimensional book ranking algorithm, wherein the server is capable of processing the historical reading information, book theme of the first book and book abstract based on the multi-dimensional book ranking algorithm to determine a reading difficulty level tag of the first book to be ranked for the reader.
It is also noted that in the apparatus, devices and methods of the present application, the components or steps may be disassembled and/or assembled. Such decomposition and/or recombination should be considered as equivalent aspects of the present application.
The previous description of the disclosed aspects is provided to enable any person skilled in the art to make or use the present application. Various modifications to these aspects will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other aspects without departing from the scope of the application. Thus, the present application is not intended to be limited to the aspects shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
The foregoing description has been presented for purposes of illustration and description. Furthermore, this description is not intended to limit embodiments of the application to the form disclosed herein. Although a number of example aspects and embodiments have been discussed above, a person of ordinary skill in the art will recognize certain variations, modifications, alterations, additions, and subcombinations thereof.

Claims (10)

1. A multi-dimensional book grading method, comprising:
acquiring historical reading information of a reader;
acquiring book topics and book abstracts of a first book to be classified; and
and determining the reading difficulty level label of the first book to be classified for the reader based on the historical reading information of the reader and the book theme and the book abstract of the first book to be classified.
2. The multi-dimensional book grading method according to claim 1, wherein determining a reading difficulty level tag of the first book to be graded for the reader based on the historical reading information of the reader and the book subject and the book abstract of the first book to be graded, comprises:
carrying out semantic coding on the historical reading information of the reader and the book theme and the book abstract of the first book to be classified so as to obtain a semantic understanding feature vector of the historical reading information and a semantic coding feature vector of the first book to be classified;
performing feature interaction on the semantic understanding feature vector of the historical reading information and the semantic coding feature vector of the first book to be classified to obtain a self-adaptive coding feature vector of the first book reading difficulty; and
and determining a reading difficulty grade label of the first book to be graded for the reader based on the self-adaptive coding feature vector of the reading difficulty of the first book.
3. The multi-dimensional book classification method according to claim 2, wherein semantically encoding the historical reading information of the reader and the book theme and the book abstract of the first book to be classified to obtain a semantic understanding feature vector of the historical reading information and a semantic encoding feature vector of the first book to be classified, comprises:
after word segmentation is carried out on the historical reading information of the reader, semantic understanding feature vectors of the historical reading information are obtained through a semantic encoder comprising a word embedding layer; and
and performing word segmentation on the book theme and the book abstract of the first book to be classified, and then obtaining the semantic coding feature vector of the first book to be classified through the semantic coder containing the word embedding layer.
4. The multi-dimensional book grading method according to claim 3, wherein the semantic encoder is a converter-based Bert model.
5. The method of claim 4, wherein performing feature interaction on the historical reading information semantic understanding feature vector and the first book semantic coding feature vector to be classified to obtain a first book reading difficulty adaptive coding feature vector, comprises:
and carrying out feature interaction between the historical reading information semantic understanding feature vector and the first book semantic coding feature vector to be classified based on an attention mechanism by using an inter-feature attention interaction layer so as to obtain a first book reading difficulty self-adaptive coding feature vector.
6. The multi-dimensional book grading method according to claim 5, wherein determining a reading difficulty level label of the first book to be graded for the reader based on the first book reading difficulty adaptive coding feature vector comprises:
carrying out homogeneous Gilbert space metric dense point distribution sampling fusion on the semantic understanding feature vector of the historical reading information and the semantic encoding feature vector of the first book to be classified to obtain a fusion feature vector;
fusing the fused feature vector and the first book reading difficulty self-adaptive coding feature vector to obtain a classification feature vector; and
and the classification feature vector passes through a classifier to obtain a classification result, wherein the classification result is used for representing the reading difficulty grade label of the first book to be classified for the reader.
7. The method of claim 6, wherein performing homogeneous gilbert spatial metric dense point distribution sampling fusion on the historical reading information semantic understanding feature vector and the first book semantic coding feature vector to be classified to obtain a fused feature vector, comprises: carrying out homogeneous Gilbert space metric dense point distribution sampling fusion on the semantic understanding feature vector of the historical reading information and the semantic encoding feature vector of the first book to be classified by using the following optimization formula to obtain a fused feature vector;
wherein, the optimization formula is:
wherein ,represent Min distance and +.>Is super-parameter (herba Cinchi Oleracei)>Is the semantic understanding feature vector of the history reading information, < >>Is the semantic coding feature vector of the first book to be classified,> and />The semantic understanding feature vector of the history reading information is +.>And the first book semantic coding feature vector to be classified +.>Is characterized by the global feature mean value of the history reading information semantic understanding feature vector +.>And the first book semantic coding feature vector to be classified +.>As a row vector it is possible to provide,is the transpose vector of the semantically encoded feature vector of the first book to be ranked,/or%> and />Representing addition by position and multiplication by position, +.>Is the fusion feature vector.
8. The multi-dimensional book ranking method of claim 7, wherein fusing the fused feature vector and the first book reading difficulty adaptive coding feature vector to obtain a classification feature vector comprises: fusing the fusion feature vector and the first book reading difficulty self-adaptive coding feature vector by the following fusion formula to obtain a classification feature vector;
wherein, the fusion formula is:
wherein ,representing the fused feature vector and the first book reading difficulty adaptive coding feature vector,representing a cascade function->Representing the classification feature vector.
9. A multi-dimensional book grading system, comprising:
the reading information acquisition module is used for acquiring historical reading information of the reader;
the theme and abstract acquisition module is used for acquiring book theme and book abstract of the first book to be classified; and
the difficulty level label determining module is used for determining the reading difficulty level label of the first book to be classified for the reader based on the historical reading information of the reader, the book theme and the book abstract of the first book to be classified.
10. A readable medium having stored thereon computer program instructions which, when executed by a processor, cause the processor to perform the multi-dimensional book grading method according to any of claims 1 to 8.
CN202311061921.4A 2023-08-23 2023-08-23 Multi-dimensional book grading method, system and readable medium Active CN116796045B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311061921.4A CN116796045B (en) 2023-08-23 2023-08-23 Multi-dimensional book grading method, system and readable medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311061921.4A CN116796045B (en) 2023-08-23 2023-08-23 Multi-dimensional book grading method, system and readable medium

Publications (2)

Publication Number Publication Date
CN116796045A true CN116796045A (en) 2023-09-22
CN116796045B CN116796045B (en) 2023-11-10

Family

ID=88050053

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311061921.4A Active CN116796045B (en) 2023-08-23 2023-08-23 Multi-dimensional book grading method, system and readable medium

Country Status (1)

Country Link
CN (1) CN116796045B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117033733A (en) * 2023-10-09 2023-11-10 北京民谐文化传播有限公司 Intelligent automatic classification and label generation system and method for library resources
CN117457134A (en) * 2023-11-06 2024-01-26 广东信拓人工智能研究院有限公司 Medical data management method and system based on intelligent AI

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112347239A (en) * 2020-11-12 2021-02-09 方正株式(武汉)科技开发有限公司 Book matching method, system, server and storage medium based on big data
CN113590970A (en) * 2021-08-11 2021-11-02 东北林业大学 Personalized digital book recommendation system and method based on reader preference, computer and storage medium
CN113610680A (en) * 2021-08-17 2021-11-05 山西传世科技有限公司 AI-based interactive reading material personalized recommendation method and system
CN113935332A (en) * 2021-11-02 2022-01-14 海信集团控股股份有限公司 Book grading method and book grading equipment
CN114818921A (en) * 2022-04-26 2022-07-29 北京思源智通科技有限责任公司 Multi-dimensional book grading method, system, readable medium and equipment
CN115630140A (en) * 2022-11-02 2023-01-20 云南师范大学 English reading material difficulty judgment method based on text feature fusion
US20230196000A1 (en) * 2021-12-21 2023-06-22 Woongjin Thinkbig Co., Ltd. System and method for providing personalized book
CN116401466A (en) * 2023-06-08 2023-07-07 北京奇趣万物科技有限公司 Book classification recommendation method and system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112347239A (en) * 2020-11-12 2021-02-09 方正株式(武汉)科技开发有限公司 Book matching method, system, server and storage medium based on big data
CN113590970A (en) * 2021-08-11 2021-11-02 东北林业大学 Personalized digital book recommendation system and method based on reader preference, computer and storage medium
CN113610680A (en) * 2021-08-17 2021-11-05 山西传世科技有限公司 AI-based interactive reading material personalized recommendation method and system
CN113935332A (en) * 2021-11-02 2022-01-14 海信集团控股股份有限公司 Book grading method and book grading equipment
US20230196000A1 (en) * 2021-12-21 2023-06-22 Woongjin Thinkbig Co., Ltd. System and method for providing personalized book
CN114818921A (en) * 2022-04-26 2022-07-29 北京思源智通科技有限责任公司 Multi-dimensional book grading method, system, readable medium and equipment
CN115630140A (en) * 2022-11-02 2023-01-20 云南师范大学 English reading material difficulty judgment method based on text feature fusion
CN116401466A (en) * 2023-06-08 2023-07-07 北京奇趣万物科技有限公司 Book classification recommendation method and system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘园园和李雅琴: "基于图书语义特征的推荐模型", 《武汉工程大学学报》, vol. 45, no. 03, pages 321 - 322 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117033733A (en) * 2023-10-09 2023-11-10 北京民谐文化传播有限公司 Intelligent automatic classification and label generation system and method for library resources
CN117033733B (en) * 2023-10-09 2023-12-22 北京民谐文化传播有限公司 Intelligent automatic classification and label generation system and method for library resources
CN117457134A (en) * 2023-11-06 2024-01-26 广东信拓人工智能研究院有限公司 Medical data management method and system based on intelligent AI

Also Published As

Publication number Publication date
CN116796045B (en) 2023-11-10

Similar Documents

Publication Publication Date Title
CN116796045B (en) Multi-dimensional book grading method, system and readable medium
Xu et al. Modal-adversarial semantic learning network for extendable cross-modal retrieval
CN110874439B (en) Recommendation method based on comment information
CN110751188B (en) User label prediction method, system and storage medium based on multi-label learning
CN111563166A (en) Pre-training model method for mathematical problem classification
Qian et al. Generating accurate caption units for figure captioning
CN111325571B (en) Automatic generation method, device and system for commodity comment labels for multitask learning
Kathuria et al. Real time sentiment analysis on twitter data using deep learning (Keras)
CN115390806A (en) Software design mode recommendation method based on bimodal joint modeling
Chaudhary et al. Signnet ii: A transformer-based two-way sign language translation model
CN113486174A (en) Model training, reading understanding method and device, electronic equipment and storage medium
CN117494760A (en) Semantic tag-rich data augmentation method based on ultra-large-scale language model
CN116911929A (en) Advertisement service terminal and method based on big data
CN116663566A (en) Aspect-level emotion analysis method and system based on commodity evaluation
CN113741759B (en) Comment information display method and device, computer equipment and storage medium
Shah et al. A study of various word embeddings in deep learning
CN115186085A (en) Reply content processing method and interaction method of media content interaction content
Gao et al. Sentiment classification of time-sync comments: A semi-supervised hierarchical deep learning method
Kulkarni et al. Applied Generative AI for Beginners
CN115130453A (en) Interactive information generation method and device
Sarkar et al. Sentiment analysis
Prabhumoye Controllable Text Generation and Ethical Implications
CN116958997B (en) Graphic summary method and system based on heterogeneous graphic neural network
CN113255342B (en) Method and system for identifying product name of 5G mobile service
Ivezić et al. Trends and Challenges of Text-to-Image Generation: Sustainability Perspective

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant