WO2021218432A1 - Method and apparatus for interpreting picture book, electronic device and smart robot - Google Patents

Method and apparatus for interpreting picture book, electronic device and smart robot Download PDF

Info

Publication number
WO2021218432A1
WO2021218432A1 PCT/CN2021/080269 CN2021080269W WO2021218432A1 WO 2021218432 A1 WO2021218432 A1 WO 2021218432A1 CN 2021080269 W CN2021080269 W CN 2021080269W WO 2021218432 A1 WO2021218432 A1 WO 2021218432A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture book
reader
round
questions
interpretation
Prior art date
Application number
PCT/CN2021/080269
Other languages
French (fr)
Chinese (zh)
Inventor
周琦
张冲
吴鹤松
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2021218432A1 publication Critical patent/WO2021218432A1/en

Links

Images

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/04Electrically-operated educational appliances with audible presentation of the material to be studied
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education

Definitions

  • the present invention relates to the technical field of picture book interpretation, and in particular to a method, device, electronic equipment and intelligent robot for picture book interpretation.
  • the embodiments of the present application provide a method for interpreting picture books, device electronic equipment and intelligent robots.
  • the present application provides a method for interpreting picture books, which is executed by a terminal device, and the method includes: determining the current picture book being read by the reader; acquiring historical information of the current picture book, the historical information including the historical information The number of readers’ interpretations of the current picture book and the positive answer rate of each round of questions asked by the readers in each interpretation in history; the reader is asked the first round of questions, the The first round of questions is determined based on the historical information.
  • each round of questions refers to: divide the multiple questions associated with the picture book into N sets according to certain rules, and then sort them in a certain order, and then put forward the next set after the questions in the previous set are asked The problem, which constitutes a round-by-round problem.
  • the number of interpretations of the current picture book by the reader before that and the positive answer rate of each round of questions raised during each interpretation are obtained from the resource library.
  • the reader is asked the round question that meets the level of understanding, so as to avoid the first time that the reader is asked in the process of interpreting the picture book.
  • a round question is beyond the scope of the reader’s comprehension or is too simple.
  • the method further includes: receiving the reader's answer to each question in the first round of questions; when the positive answer rate of the answer to each question is greater than a set threshold When, ask the reader a second round of questions, the questions in the second round of questions are different from the questions in the first round of questions
  • the method further includes: the sequence of the first round of questions and the second round of questions is set according to the degree of difficulty.
  • the acquiring the historical information of the current picture book before the acquiring the historical information of the current picture book, it further includes: acquiring the identity information of the reader, and the acquiring the historical information of the current picture book includes: acquiring the historical information of the current picture book. The historical information of the current picture book corresponding to the identity information.
  • a data package of at least one picture book is stored in the resource library, the data package including the name of the picture book, the page number of the picture book, the content corresponding to the page number, and at least one round corresponding to the content Questions and answers to each question in the at least one round of questions, the at least one picture book includes the current picture book being read by the reader, and the at least one round question includes the first round question and the first round question Second round question.
  • the picture book designer enters the text prepared by each picture book into the memory in advance, so that the terminal device can still interpret the picture book for the reader even when the terminal device is not connected to the Internet.
  • the method further includes: determining whether the current picture book is stored in a resource library; when the resource library does not have the current picture book , To determine whether the content of the current picture book is the same as the content of the first picture book stored in the resource library; when the content of the current picture book is the same as the content of the first picture book, suggest to the reader with The round question corresponding to the first picture book.
  • the method further includes: determining the page number of the current picture book that the reader is reading; and asking the reader a question about the turn of the content corresponding to the page number.
  • the determining the current picture book being read by the reader includes: acquiring the voice information of the current picture book being read by the reader through a microphone; converting the voice information into picture book text, so The picture book text is used to determine the current picture book being read by the reader from the resource library.
  • the determining the current picture book being read by the reader includes: acquiring the image of the current picture book being read by the reader through a camera; identifying features in the image to obtain the picture book feature value, The picture book feature value is used to determine the current picture book being read by the reader from the resource library.
  • the method before it is determined that the positive answer rate of the reader's answer to the first round of questions is greater than a set threshold, the method further includes: obtaining voice information of the reader's answer to the question through a microphone; The voice information is converted into answer text, and the answer text is used to determine from the resource database whether the reader’s answer to the first question in the first round of questions is correct, and the first round
  • the problem includes the first problem.
  • the method before determining that the positive answer rate of the reader's answer to the first round of questions is greater than a set threshold, includes: acquiring the reader's actions or gestures through a camera; and identifying the reader The feature in the image of the action or gesture is obtained, and the feature value of the answer is used to determine from the resource database whether the reader’s answer to the second question in the first round of questions is correct, so The first round question includes the second question.
  • the method further includes: adding, in the historical information, the number of interpretations of the reader's interpretation of the current picture book and storing the reader's replies during the interpretation process. The positive answer rate of the answers to the questions in each round.
  • the method before the first round of questions is asked to the reader, the method includes: receiving an interpretation instruction, where the interpretation instruction is used to instruct to ask the reader the first round of questions .
  • the terminal device determines the picture book that the reader is currently reading, sometimes the reader does not want to interpret the part of the content, or interpret it after a period of time, so the terminal device is controlled by instructions to interpret it. Or when to interpret to meet the needs of readers.
  • an embodiment of the present application provides a picture book interpretation device, including: a transceiver, a processor, and a memory; the transceiver is used to receive and send data; the memory stores one or more programs, the one The or multiple programs include instructions, and when the instructions are executed by the processor, the electronic device executes each possible implementation solution in the first aspect.
  • an embodiment of the present application provides an electronic device, including: a camera and/or a microphone, a memory, and a processor that executes each possible implementation of the first aspect.
  • an embodiment of the present application provides an intelligent robot, including: a camera and/or a microphone, for receiving voice information or image information of a reader reading a picture book, and the reader’s answer to a question corresponding to the picture book Voice information or image information; memory for storing the second information of at least one picture book, as well as the number of interpretations of the current picture book by the reader in the history of each picture book and each time the reader in the history The positive answer rate of each round of questions raised during interpretation; a processor for processing the voice information or image information obtained by the camera and/or the microphone, and then determining the reader from the memory The current picture book being read, and based on the number of interpretations of the current picture book in the memory and the average value of the positive answer rate obtained at each interpretation of each round of questions, determine the corresponding round of questions to the reader The speaker is used to play the voice to the reader; the communication unit is used to receive the second information of each picture book, and the threshold of the positive answer rate from the current round to the next round.
  • the embodiments of the present application provide a readable storage medium for storing instructions. When the instructions are executed, each possible implementation of the first aspect is realized.
  • the embodiments of the present application provide a computer program device containing instructions, which when running on a terminal, enables each possible implementation of the first aspect to be implemented.
  • FIG. 1 is an architecture diagram of an application system for interpreting picture books provided by an embodiment of the application
  • FIG. 2 is a schematic structural diagram of a terminal device provided by an embodiment of this application.
  • FIG. 3 is a schematic diagram of different gestures representing different numbers provided by an embodiment of the application.
  • FIG. 4 is a schematic diagram of a template for making a picture book provided by an embodiment of the application.
  • FIG. 5 is a schematic diagram of a template for making a picture book provided by an embodiment of the application.
  • Fig. 6 is a schematic diagram of a template for making a picture book provided by an embodiment of the application.
  • FIG. 7 is a flowchart of an interpretation method provided by an embodiment of this application.
  • FIG. 8 is a schematic diagram of an image displayed on a touch screen according to an embodiment of the application.
  • FIG. 9 is a schematic structural diagram of a terminal device provided by an embodiment of the application.
  • FIG. 1 is an architecture diagram of an application system for interpreting picture books provided by an embodiment of the application. As shown in Figure 1, the system architecture includes picture books, terminal equipment and readers.
  • the picture book can be a paper book, or a tablet, kindle, and other devices that can be used by readers to read stories.
  • terminal devices include, but are not limited to, smart devices such as tablet computers and smart phones. For example, it can also include intelligent robots independently developed for various specific business scenarios, and so on.
  • each story in the picture book, as well as a series of questions corresponding to each story and an answer corresponding to each question are stored. Among them, a series of questions corresponding to each story is divided into several rounds according to the degree of difficulty, and the terminal device asks readers questions of different rounds in the order of easy first and then difficult.
  • the terminal device assists the reader in the process of interpreting the picture book, after determining the story that the reader is reading, find out the question and answer corresponding to the story from the database; then, after the reader finishes reading the story, interpret the picture book according to the readers in history The number of times and the positive answer rate in each round, to ask the readers the corresponding difficult round questions to avoid the first round of questions asked to the reader in the process of interpreting a picture book that exceeds the reader’s comprehension ability or is too simple .
  • it is determined whether to ask questions that are more difficult for the first round so as to gradually guide the reader’s understanding of the story and improve the reader Reading comprehension skills.
  • Fig. 2 is a schematic structural diagram of a terminal device provided by an embodiment of the application. As shown in FIG. 2, the terminal device includes an input unit 1, an output unit 2, a processing unit 3, a storage unit 4, and an input unit 5.
  • the terminal device needs to have the ability to perceive the current user's identity, such as identifying the current user's identity through face recognition, voiceprint recognition, user account passwords, etc., based on user input or through smarter methods (such as identifying user gender based on face) , Age) to obtain the current user's personal information. All personal data during the use of the user will be stored and recorded under the user account.
  • the input unit 1 includes a microphone 11 and a camera 12. Among them, the microphone 11 is used to collect voice information, and the camera 12 is used to collect image information.
  • the terminal device can obtain the voice information of the reader reading the story in the picture book through the microphone 11, so as to determine the story read by the reader.
  • the terminal device may also obtain the voice information of the reader's answer to the question raised by the terminal device through the microphone 11, so as to obtain the reader's answer to the question.
  • the terminal device can obtain the page of the picture book that the reader is reading through the camera 12, and identify the story read by the reader according to the content of the page.
  • the terminal device can also acquire the reader's motion or gesture through the camera 12, so as to recognize the reader's answer to the question based on the reader's motion or gesture.
  • the output unit 2 includes a speaker 21.
  • the terminal device can play through the speaker 21 recordings of other people reading the story that the reader is reading, broadcast the question corresponding to the story that the reader is reading, the correct answer corresponding to the question, the words to encourage the reader, etc. .
  • the processing unit 3 includes an automatic speech recognition (ASR) unit 31.
  • ASR technology is a technology that converts human speech into text.
  • the ASR unit 31 cooperates with the microphone 11 to convert the voice information of the story that the reader is reading obtained by the microphone 11 and the voice information of the answer to the question raised by the terminal device to obtain the corresponding text and text Then, the processing unit 3 searches the database for the corresponding story or answer according to the obtained text.
  • the processing unit 3 also includes a vision processing unit 32.
  • the vision processing unit 32 cooperates with the camera 12 to process the image information obtained by the camera 12, and then extract the required features in the image.
  • the visual processing unit 32 includes a picture book image recognition unit 321, a picture book click recognition unit 322 and a gesture recognition unit 323.
  • the picture book content designer When the picture book content designer enters the picture book in advance, he extracts the cover of the entered picture book and the feature value of each inner image, and then generates the unique identity document (ID) of the picture book through the feature value, and combines the picture book ID and feature The value is associated and finally stored in the storage unit 4.
  • ID unique identity document
  • the picture book image recognition unit 321 is used to identify the current picture book cover, specific page number and other characteristic values after the camera 12 acquires the image of the page where the reader is reading the picture book, and then compares it to the storage unit 4 to find the corresponding picture book ID and The page ID is used to determine the picture book that the reader is reading and the current page of the picture book that the reader is reading.
  • the picture book image recognition unit 321 uses a scale-invariant feature transform (SIFT) algorithm to detect or describe local features in the image.
  • SIFT scale-invariant feature transform
  • the picture book click recognition unit 322 is used for detecting the position of the reader's finger click area by taking a video frame image including the reader's finger through the camera 12.
  • the picture book click recognition unit 322 first preprocesses the collected image (that is, performs processing such as noise reduction on the hand shape area in the image, excluding areas with obvious skin color differences, etc.); and then extracts the edge of the image , To extract the edge of the convex area (that is, extract the image of the finger according to the shape and contour of the finger area); finally, according to the collected image and the extracted finger image, determine the area where the reader clicks on the picture book.
  • the gesture recognition unit 323 is used to recognize the actions or gestures of the reader in the image, and then determine the answer indicated by the reader according to the corresponding gesture.
  • gesture recognition methods include, but are not limited to, recognizing gestures based on geometric features, through gesture edges (such as contours) and gesture area features (such as palm color, area, etc.).
  • the designer of the picture book content can define "1, 2, 3, 4, 5, 6, 7, 8, 9, 10" according to the number of fingers and the specific gestures of the fingers.
  • readers can use gestures to answer questions.
  • the terminal device asks the reader: How many small animals are there on this screen?
  • the reader can use any gesture as shown in Figure 3 as an answer.
  • the gesture recognition unit 323 determines the answer of the reader's reply by recognizing the gesture displayed by the reader.
  • the processing unit 3 also includes a reading result calculation unit 33 and an interpretation round calculation unit 34.
  • the reading result calculation unit 33 is used to calculate the correct rate of the reader after answering all the questions in a round, and then store the calculated result in the storage unit 4.
  • the interpretation round calculation unit 34 is used to determine according to the correct rate of the answers answered by the reader in the current round, the number of times the reader has interpreted the picture book and similar picture books in the storage unit 4, and the calculation rule selected when the picture book is entered At this time, the reader will interpret the corresponding round of the picture book. All personal related data (such as reading times, reading rounds, Q&A results, etc.) generated by the reader during the reading process will be associated with the reader’s account, so as to realize the data isolation of different readers and conduct personalized reading rounds calculate.
  • the storage unit 4 includes a database 41 and a resource library 42.
  • the database 41 is used to store data processed by the processing unit 3 on voice information and image information.
  • the resource library 42 is used to store the data entered by the designer of the picture book content.
  • the database 41 divides the database 41 into a picture book image feature database 411, a first picture book reading record database 412, and a second picture book reading record database 413 according to the type of data stored.
  • the picture book image feature database 411 is used to extract the image of the cover and each page of each picture book in the resource library 42 and identify the feature value, and then associate the image feature value with the data of the picture book and the corresponding page.
  • the picture book developer extracts the cover of the picture book and the image of each page entered in the resource library 42, recognizes the feature value, stores it in the picture book image feature database 411, and then combines each feature value with the resource library
  • the corresponding cover and page content stored in 42 are associated with data such as questions and answers for each round.
  • the specific association relationship is as follows:
  • Table 1 The association table between the feature values of the cover of the picture book and the image of each page and the data stored in the resource library
  • the processing unit 3 After the processing unit 3 obtains the cover of the picture book or a certain page of the image that the reader is reading through the camera 11, it is processed by the visual processing unit 32 to obtain the feature value of the image, and then sent to the picture book image feature database 411, and the picture book The feature values stored in the image feature database 411 are compared. If the feature value stored in the picture book image feature database 411 has a feature value that matches the sent image feature value, the processing unit 3 can correspond to the feature value stored in the picture book image feature database 411 that matches the sent image feature value
  • the association relationship of the corresponding picture book or the content of the corresponding page, the questions, answers and other data of each round are obtained from the resource library 42.
  • the first picture book reading record database 412 is used to record the information of each picture book being interpreted, that is, when the picture book is interpreted, the correct rate of each round calculated by the reading result calculation unit 33 is used to generate a piece of interpretation information.
  • the information includes the correct rate of each round of a picture book in an interpretation process.
  • the format of the interpretation information stored in the first picture book reading record database 412 is as follows:
  • the terminal device interprets the picture book for the reader, it will generate a piece of interpretation information to record the reader’s understanding of the picture book during the interpretation process.
  • the interpretation round calculation unit 34 According to the interpretation information in the history of the picture book, determine the round of directly entering the picture book to be interpreted again.
  • the second picture book reading record database 413 is used to record the interpretation information of each type of each picture book. Compared with the interpretation information stored in the first picture book reading record database 412, the interpretation information stored in the second picture book reading record database 413 is increased Type classification, the interpretation information is divided into number learning interpretation information, letter learning interpretation information, etc., as follows:
  • the resource library 42 is used to store at least one picture book.
  • the stored picture book content includes the basic information of the picture book (name, picture book type, subtype, number of pages, cover picture, specific page number picture, etc.), picture book cover and images of each page, image feature values, text and text of each page Images, questions and answers for each round corresponding to each page, round pass calculation criteria and other information.
  • the processing unit 3 obtains information related to the picture book for interpretation from the resource library 42 after determining the picture book read by the reader and the page currently being read.
  • the input unit 5 is used to download the content of the picture book and then input it into the resource library 42.
  • the input unit 5 may be a physical interface such as a USB interface, a Type-C interface, etc., or a wireless communication module such as a WiFi module, a Bluetooth module, etc., which is not limited in this application.
  • the picture book developer needs to make the picture book content in the development tool in advance, and then store it in the resource library 42 through the input unit 5.
  • Picture book developers make picture book content through dedicated APP, cloud, etc. (this application uses APP as an example). After opening the APP to make, the APP displays the template shown in Figure 4, which includes the name of the picture book, author, and interpretation rounds The total number, the name of the picture book series, the picture book classification, the picture book sub-category, and the cover picture are options. The picture book developer can fill in the various options in the template according to the picture book to be entered.
  • each option in the template can be divided into required items and non-required items.
  • Required items need to be filled in by the picture book developer, such as picture book name, author, cover image and other options.
  • Required items do not need to be filled in, such as picture book classification, picture book sub-category and other options.
  • the template includes at least one round of interpretation (this application takes three rounds of interpretation as an example), at least one picture book page (this application takes a picture book with 15 pages as an example), reading content, question and answer interaction and other options.
  • the options on each page include three options: the first interpretation, the second interpretation, and the third interpretation, and each interpretation option includes There are two options for reading content and Q&A interaction.
  • the picture book developer produces the contents of the first round of interpretation of the first page of the picture book "Dad, Don't Fear", after selecting the option "Page 1" from the options on the 15 pages, three interpretation options will appear ; Then, after selecting the "first round of interpretation” option among the three interpretation options, two options for reading content and Q&A interaction will appear.
  • the picture book developer uploads the image on the first page of the picture book "Dad, don’t be afraid” and the text content recorded on the first page to the "Read aloud” option, and then upload the questions and answers that need to be asked to the "Question and answer interaction” option .
  • the options on each page include two options: reading content and Q&A interaction.
  • the two options for reading content and Q&A interaction include the first interpretation, the second interpretation, and the third interpretation.
  • Options When the picture book developer makes the first round of interpretation of the first page of the picture book "Dad, Don't Be Fear", after selecting the "1st page” option from the 15 page options, the reading content and Q&A will appear There are two interactive options; after selecting the "read aloud” option in the two options, three interpretation options will appear; then after the "first round of interpretation” is selected in the three interpretation options, the picture book developer will set the picture book The image of the first page of "Dad, don't be afraid” and the text content recorded on the first page are uploaded to the "Read aloud” option; after selecting the "Q&A interaction” option in the two options, three interpretations will appear Option; then after selecting the "first round of interpretation” option in the three interpretation options, the picture book developer uploads the image on the first page of the picture book "Dad,
  • the template includes options such as entering the next round of interpretation rules and the average number of forward answers.
  • the "Enter the next round of interpretation rules” options include “Method 1: The first round of interpretation of the picture book Q&A interaction, the number of readers' positive answers exceeds” and "Method 2: The same series have been read There are more than 2 picture books, and in the first round of interpretation and answer interaction, the average number of readers' positive answers exceeds the two "choose one" options. If the picture book you make is a series picture book, select "Method 1"; if the picture book you make is a non-series picture book, select "Method 2". Then select the number of positive answer rate in the "options such as the average number of positive answers", and the terminal device will enter the second round of interpretation only if the positive answer rate of the reader's reply is greater than the set positive answer rate.
  • the picture book image feature value database 411 extracts the image of the cover and each page in the picture book, identifies the feature value, generates the picture book cover and the ID of each page, and adds each Each feature value is associated with each ID, and the specific relationship is shown in Table 1, and then stored in the picture book image feature value database 411.
  • the terminal device is an intelligent robot as an example below.
  • FIG. 7 is a work flow chart of a terminal device provided by an embodiment of the application to assist a reader in interpreting a picture book.
  • Step S701 Determine the current picture book that the reader is reading.
  • the reader uses the intelligent robot to accompany himself to interpret the story
  • the reader initiates a picture book interpretation instruction to the intelligent robot through voice, keystrokes, etc., to let the intelligent robot work.
  • the intelligent robot judges the current user's identity information (through voiceprints, passwords, fingerprints, face information, etc.), and then turns on the microphone 11 and/or the camera 12 to work to obtain the voice and voice of the reader who is reading the picture book. / Or the image of the current page of the picture book that the reader is reading.
  • the microphone 11 of the intelligent robot collects the voice information of the reader, and then the ASR unit 31 in the processing unit 3 converts the collected voice into corresponding text, and then extracts multiple texts from the converted text.
  • the keywords are compared with the corresponding keywords in each picture book stored in the resource library 42 in the storage unit 4. If the keywords converted from the collected voice information match the corresponding keywords of a picture book in the resource library 42, it indicates that the reader is reading the picture book, so as to determine the picture book that the reader is reading and the picture book that the reader is reading.
  • the page number corresponding to the content if the collected voice information is converted to keywords and there is no corresponding keyword in any picture book in the resource library, it means that the reader is not "reading the story” or the database 42 does not store the reader The picture book being read, so that the intelligent robot cannot interpret the story for the reader.
  • each picture book stored in the resource library 42 is made in the manner shown in Figures 4-6 and the corresponding description content, which will not be repeated here in this application.
  • the camera 12 of the smart robot collects images or videos, and then the vision processing unit 32 in the processing unit 3 processes the collected images (if it is a video, the vision processing unit 32 processes each frame of the video separately)
  • the recognition is performed to obtain multiple feature values in the current image, and then the feature values of the corresponding images in each picture book stored in the picture book image feature database 411 in the storage unit 4 are compared. If the feature value of the captured image or video matches the corresponding feature value of a picture book in the resource library, it indicates that the reader is reading the picture book, so as to determine the picture book the reader is reading and the content of the picture book.
  • the intelligent robot in the embodiment of the present application can also turn on the microphone 11 and the camera 12 to work at the same time, and obtain voice information, image or video information at the same time, so as to more accurately determine the picture book the reader is reading and the content of the picture book.
  • the corresponding page number can also turn on the microphone 11 and the camera 12 to work at the same time, and obtain voice information, image or video information at the same time, so as to more accurately determine the picture book the reader is reading and the content of the picture book.
  • the corresponding page number can also turn on the microphone 11 and the camera 12 to work at the same time, and obtain voice information, image or video information at the same time, so as to more accurately determine the picture book the reader is reading and the content of the picture book.
  • Step S703 Obtain historical information of the current picture book.
  • step S705 the reader is asked the first round of questions.
  • the intelligent robot After determining the page number corresponding to the picture book the reader is reading and the content of the picture book, the intelligent robot starts the picture book interpretation work.
  • the interpretation round calculation unit 34 in the processing unit 3 obtains data about the historical reading situation of the picture book from the first picture book reading record database 412 of the storage unit 4.
  • the processing unit 3 obtains the first round of interpretation question of the picture book from the resource library 42. Then the reader is asked the question of the first round of interpretation through the speaker 21; when the interpretation round calculation unit 34 obtains a piece of data in the following table,
  • the processing unit 3 obtains the average positive answer rate of the first round of questions at 60%, and then retrieves it from the resource library 42 Obtain the second-round interpretation question of the picture book, and then ask the reader the second-round interpretation question through the speaker 21; when the interpretation-round calculation unit 34 obtains two pieces of data in the following table,
  • the positive answer rate of the reader in the first round of interpretation is 60% (assuming the positive answer from the first round to the second round of interpretation) The answer rate is 60%), the positive answer rate of readers in the second round of interpretation is 10%, and there is no positive answer rate of readers in the third round of interpretation.
  • the positive answer rate of readers is 60%.
  • the positive answer rate of readers in the second round of interpretation is 60%.
  • Processing unit 3 gets the positive answer to the first round of questions. The rate is 60%, and the positive answer rate for the second round of questions is 35%. Then, obtain the second round of interpretation questions for the picture book from the resource library 42, and then ask the readers the second round of interpretation through the speaker 21; others; The situation can be deduced by analogy.
  • the interpretation round calculation unit 34 in the processing unit 3 obtains data about the historical reading situation of the picture book from the second picture book reading record database 413 of the storage unit 4. If the current reader is studying "English picture books", when the data obtained by the interpretation round calculation unit 34 does not record that the picture book is of the "English picture book” type, it indicates that the picture book is of the "English picture book” type. When it is interpreted for the first time, the processing unit 3 obtains the first-round interpretation question of the "English picture book” type of the picture book from the resource library 42, and then asks the reader the first-round interpretation question through the speaker 21; When the calculation unit 34 obtains four pieces of data in the following table,
  • the “English picture book” type of the picture book has been interpreted twice in history.
  • the positive answer rate of the reader in the first round of interpretation is 50% (assuming that the first round of The positive answer rate of the second round of interpretation is 60%), there is no positive answer rate of the reader in the second round of interpretation, and there is no positive answer rate of the reader in the third round of interpretation.
  • the positive answer rate of readers in the first round of interpretation is 70% and the positive answer rate of readers in the second round of interpretation is 60%.
  • processing unit 3 gets the first round of questions
  • the positive answer rate for the second round of questions is 60%, and the positive answer rate for the second round of questions is 60%.
  • the third round of interpretation questions of the "English picture book” type of the picture book is obtained from the resource library 42, and then the speaker 21
  • the reader asks the questions of the third round of interpretation; other situations are analogous to this.
  • the reader can let the intelligent robot enter the specified type of the picture book through voice instructions, instructions input on the screen, etc.
  • Step S707 Receive the reader's answer to each question in the first round of questions.
  • step S709 when the positive answer rate of the answer to each question is greater than the set threshold, the reader is asked the second round of questions.
  • the microphone 11 and/or the camera 12 are turned on for a set time period to obtain the reader's response to the question.
  • the microphone 11 of the intelligent robot collects the voice information of the reader, and then the ASR unit 31 in the processing unit 3 converts the collected voice into corresponding text, and then extracts multiple texts from the converted text.
  • the keywords are compared with the answers to the corresponding questions stored in the resource library 42 in the storage unit 4.
  • the processing unit 3 can broadcast "the answer is correct” to the reader through the speaker 21 , "You are great, you answered correctly” and other voices, so that the reader knows that the answer you answered is correct; if the keywords converted from the collected voice information are not the same as the keywords of the answer to the corresponding question in the resource library 42 Match, it indicates that the reader’s answer is wrong.
  • the processing unit 3 can broadcast to the reader voices such as "Answer Wrong", “Think about it again, is there a better answer” and other voices to the reader through the speaker 21, so that the reader knows what he has answered. The answer is wrong, then broadcast the correct answer.
  • the microphone 11 can also be turned on again to allow the reader to answer again.
  • the processing unit 3 passes through the speaker 21 Broadcast the correct answer.
  • the camera 12 of the smart robot collects images or videos, and then the vision processing unit 32 in the processing unit 3 performs the processing on the collected images (if it is a video, the vision processing unit 32 responds to the video
  • the vision processing unit 32 responds to the video
  • Each frame of the image is processed separately) for identification to obtain multiple feature values in the current image, and when the reader’s hand index and finger combination are determined based on the multiple feature values, the specific number of the reader’s expression is determined, and then stored with The answers to the corresponding questions stored in the resource library 42 in unit 4 are compared.
  • the processing unit 3 can broadcast to the reader through the speaker 21 "the answer is correct", “you are great, the answer is correct” "And so on, let the reader know that the answer he answered is correct; if the specific number determined is not the same as the answer to the corresponding question in the resource library 42, it indicates that the answer answered by the reader is wrong, and the processing unit 3 can use the speaker 21 Announce the "Answer Wrong", “Think about it again, is there a better answer” and other voices to the reader, let the reader know that the answer he answered is wrong, and then announce the correct answer.
  • the intelligent robot in the embodiment of the present application can also turn on the microphone 11 and the camera 12 to work at the same time, and obtain voice information, image or video information at the same time, so as to more accurately determine the answer of the reader.
  • the processing unit 3 may present the problem on the touch screen in the form of a picture. For example, the intelligent robot asks the reader "Who do you think will become a bunny in this story?", after the reader knows the problem, he can locate the "Daddy Bear", “Little Bear” and other objects on the screen. Then, the processing unit 3 compares the position clicked by the reader with the answer of the corresponding question stored in the resource library 42 to determine whether the position clicked by the reader is correct.
  • the clicked position is correct, it means that the reader’s answer is correct, and then broadcast through the speaker 21 or touch the screen to tell the reader that the answer is correct; if the clicked position is wrong, it means that the reader’s answer is wrong, and then pass The speaker 21 broadcasts or the display mode on the touch screen tells the reader that the answer is wrong.
  • the reading result calculation unit 33 in the processing unit 3 calculates the number of questions answered correctly by the readers in this round, and then calculates the correct rate. If the correct rate does not exceed the set When the threshold is set, the intelligent robot does not enter the second round of interpretation; if the correct rate exceeds the set threshold, the intelligent robot enters the second round of interpretation.
  • the processing unit 3 will store the picture book, type, round of interpretation, and forward answer rate of each round of interpretation in the format of Table 2 and Table 3.
  • subsequent readers can refer to the result of this interpretation to determine the round that they directly enter when re-interpreting.
  • the processing unit 3 may not need to include the picture book cover when generating the interpretation information. In this way, it is not necessary to determine the interpretation round based on the reading situation of each picture book rigidly. Instead, the interpretation round of the current picture book can be determined based on whether there has been a reading record similar to the picture book.
  • the first picture book reading record database 412 contains one piece of interpretation information for each of the two books, and the second picture book reading record database 413 records a set of interpretation information.
  • the average positive answer rate of reading picture books: (0.7+0.8)/2 75%.
  • the picture book interpretation method classifies the problems of interpreting picture books according to the degree of difficulty. If the reader reads the picture book for the first time, the reader will be asked the simplest type of questions, if based on historical records The information indicates that the reader has done many interpretations, then ask the reader the corresponding round question, so as to intelligently ask the reader reasonable questions, and effectively help the reader understand the picture book. When one type of question is asked, according to the correct rate of the answers from the reader, determine whether to ask the more difficult type one question, so as to gradually guide the reader to understand the story and improve the reader's reading comprehension ability .
  • FIG. 9 is a schematic structural diagram of a terminal device according to an embodiment of the present invention.
  • the electronic device 900 may be the aforementioned intelligent robot, and includes a sensor 901, a display 902, a processor 903, a memory 904, a communication interface 905, and a bus 906.
  • the processor 903, the memory 904, and the communication interface 905 in the electronic device can establish a communication connection through the bus 906.
  • the sensor 901 is used to obtain the reader's voice information and image or video information, and to send audio and video information.
  • the sensor 901 may include a camera, a microphone, a speaker, and so on.
  • the display 902 is used to display processed data, such as videos and images.
  • the processor 903 may be a central processing unit (CPU).
  • the memory 904 may include a volatile memory (volatile memory), such as a random-access memory (random-access memory, RAM); the memory may also include a non-volatile memory (non-volatile memory), such as a read-only memory (read-only memory). Only memory, ROM), flash memory, hard disk drive (HDD) or solid state drive (SSD); the memory 904 may also include a combination of the foregoing types of memories.
  • volatile memory such as a random-access memory (random-access memory, RAM)
  • non-volatile memory such as a read-only memory (read-only memory).
  • SSD solid state drive
  • the interpretation methods provided in the foregoing embodiments are all executed by the processor 903. Data such as pictures, voices, and picture book content will be stored in the memory 904.
  • the memory 904 will also be used to store program instructions executed by the processor 903 for implementing the terminal information protection method described in the foregoing embodiment, and so on.
  • various storage media described herein may represent one or more devices and/or other machine-readable media for storing information.
  • the term "machine-readable medium” may include, but is not limited to, wireless channels and various other media capable of storing, containing, and/or carrying instructions and/or data.
  • the above embodiments it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof.
  • software it can be implemented in the form of a computer program product in whole or in part.
  • the computer program product includes one or more computer instructions.
  • the computer can be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.
  • the computer instruction may be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium.
  • the computer instruction may be transmitted from a website, computer, server, or data center through a cable (Such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) to another website site, computer, server or data center.
  • the computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server or data center integrated with one or more available media.
  • the usable medium may be a magnetic medium (for example, a floppy disk, a hard disk, and a magnetic tape), an optical medium (for example, a DVD), or a semiconductor medium (for example, a solid state disk (SSD)).
  • the size of the sequence number of the above-mentioned processes does not mean the order of execution.
  • the execution order of the processes should be determined by their functions and internal logic, and should not be dealt with.
  • the implementation process of the embodiments of the present application constitutes any limitation.
  • the disclosed system, device, and method can be implemented in other ways.
  • the device embodiments described above are merely illustrative.
  • the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components may be combined or It can be integrated into another system, or some features can be ignored or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • the function is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium.
  • the technical solutions of the embodiments of the present application are essentially or the part that contributes to the prior art or the part of the technical solutions can be embodied in the form of a software product, and the computer software product is stored in a storage medium.
  • Including several instructions to make a computer device (which may be a personal computer, a server, or an access network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the embodiments of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disks or optical disks and other media that can store program codes. .

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Educational Administration (AREA)
  • Tourism & Hospitality (AREA)
  • Theoretical Computer Science (AREA)
  • Educational Technology (AREA)
  • General Physics & Mathematics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • General Business, Economics & Management (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present application provides a method and an apparatus for interpreting a picture book, an electronic device, and a smart robot, relating to the field of picture book interpretation technology. Said method comprises: determining the current picture book being read by a reader; and acquiring historical information of the current picture book, and raising a first round of questions with the reader. In the present application, after a picture book being read by a reader is determined, the number of previous interpretations by the reader on the current picture book and a positive answer rate of each round of questions raised during each interpretation are acquired from a resource library, and then a round of questions, matching an understanding degree of the reader, are raised with the reader according to the number of interpretations and the average value of a plurality of positive answer rates of each round of questions, thereby preventing the first round of question raised with the reader during an interpretation of a picture book from going beyond the reader's comprehension ability or being too simple.

Description

一种解读绘本方法、装置、电子设备和智能机器人A method, device, electronic equipment and intelligent robot for interpreting picture books
本申请要求于2020年04月30日提交中国国家知识产权局、申请号为202010365911.X、申请名称为“一种解读绘本方法、装置、电子设备和智能机器人”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application requires the priority of a Chinese patent application filed with the State Intellectual Property Office of China, the application number is 202010365911.X, and the application title is "A method, device, electronic equipment and intelligent robot for interpreting picture books" on April 30, 2020. The entire content is incorporated into this application by reference.
技术领域Technical field
本发明涉及绘本解读技术领域,尤其涉及一种解读绘本方法、装置、电子设备和智能机器人。The present invention relates to the technical field of picture book interpretation, and in particular to a method, device, electronic equipment and intelligent robot for picture book interpretation.
背景技术Background technique
听故事是所有儿童成长过程中喜爱的一种活动,儿童也希望父母在身边陪伴自己并一起进行亲子阅读。现在大多数父母由于工作过于繁忙、将孩子留给老人照看等原因,无法给自己孩子讲故事,所以市场上产生了很多给儿童读绘本的设备,比如点读机、绘本阅读机器人等。但是这些设备只能简单的播放音频,无法做到跟儿童进行问答互动,引导儿童思考并启发阅读者的想象力。Listening to stories is an activity that all children love when they grow up. Children also hope that their parents will accompany themselves by their side and do parent-child reading together. Nowadays, most parents are unable to tell stories to their children due to busy work and leaving their children to the care of the elderly. Therefore, there are many devices on the market for reading picture books for children, such as point reading machines and picture book reading robots. However, these devices can only play audio simply, and cannot interact with children in question and answer, guide children to think and inspire the imagination of readers.
发明内容Summary of the invention
为了现有技术中终端设备无法跟儿童进行问答互动、引导儿童思考等问题,本申请的实施例提供了一种解读绘本方法、装置电子设备和智能机器人。In order for the terminal device in the prior art to be unable to interact with children in question and answer and guide children to think, the embodiments of the present application provide a method for interpreting picture books, device electronic equipment and intelligent robots.
为了达到上述目的,本申请的实施例采用如下技术方案:In order to achieve the foregoing objectives, the embodiments of the present application adopt the following technical solutions:
第一方面,本申请提供一种解读绘本方法,由终端设备执行,所述方法包括:确定阅读者正在阅读的当前绘本;获取所述当前绘本的历史信息,所述历史信息包括历史上所述阅读者的对所述当前绘本进行解读的解读次数和历史上所述阅读者每次解读时提出的各个轮次问题的正向答案率;向所述阅读者提出第一轮次问题,所述第一轮次问题是根据所述历史信息确定的。其中,各个轮次问题是指:将绘本关联的多个问题按照一定规则,划分成N个集合,再按照一定顺序进行排序,然后在上个集合的问题提完后,再提出后一个集合的问题,从而构成一个轮次一个轮次的问题。In the first aspect, the present application provides a method for interpreting picture books, which is executed by a terminal device, and the method includes: determining the current picture book being read by the reader; acquiring historical information of the current picture book, the historical information including the historical information The number of readers’ interpretations of the current picture book and the positive answer rate of each round of questions asked by the readers in each interpretation in history; the reader is asked the first round of questions, the The first round of questions is determined based on the historical information. Among them, each round of questions refers to: divide the multiple questions associated with the picture book into N sets according to certain rules, and then sort them in a certain order, and then put forward the next set after the questions in the previous set are asked The problem, which constitutes a round-by-round problem.
上述发明中,在确定阅读者正在阅读的绘本后,从资源库中获取在此之前的该阅读者对当前绘本进行解读的解读次数和每次解读时提出的各个轮次问题的正向答案率,然后根据解读次数和每个轮次问题的多个正向答案率的均值,向阅读者提出符合其理解程度的轮次问题,从而避免在一次解读绘本的过程中,向阅读者提出的第一个轮次问题超出阅读者理解能力范围或过于简单。In the above invention, after the picture book being read by the reader is determined, the number of interpretations of the current picture book by the reader before that and the positive answer rate of each round of questions raised during each interpretation are obtained from the resource library. , And then according to the number of interpretations and the average value of multiple positive answer rates for each round of questions, the reader is asked the round question that meets the level of understanding, so as to avoid the first time that the reader is asked in the process of interpreting the picture book. A round question is beyond the scope of the reader’s comprehension or is too simple.
在另一个可能的实现中,所述方法还包括:接收所述阅读者答复所述第一轮次问题中的各个问题的答案;当所述各个问题的答案的正向答案率大于设定阈值时,向所述阅读者提出第二轮次问题,所述第二轮次问题中的问题不同于所述第一轮次问题中的问题In another possible implementation, the method further includes: receiving the reader's answer to each question in the first round of questions; when the positive answer rate of the answer to each question is greater than a set threshold When, ask the reader a second round of questions, the questions in the second round of questions are different from the questions in the first round of questions
上述发明中,当第一轮次问题提问完后,根据阅读者回复的答案的正向答案率,确定是否向阅读者提问更难一级轮次的问题,从而循序渐进地引导阅读者对该故事的理解,提 升阅读者的阅读理解能力。In the above invention, after the first round of questions is asked, it is determined whether to ask the reader the more difficult first-level round of questions based on the positive answer rate of the reader's reply, so as to guide the reader to the story step by step To improve the reader’s reading comprehension ability.
在另一个可能的实现中,所述方法还包括:所述第一轮次问题和所述第二轮次问题的先后顺序是按难易程度设置。In another possible implementation, the method further includes: the sequence of the first round of questions and the second round of questions is set according to the degree of difficulty.
在另一个可能的实现中,在所述获取所述当前绘本的历史信息之前,还包括:获取所述阅读者的身份信息,所述获取所述当前绘本的历史信息,包括:获取与所述身份信息相对应的所述当前绘本的历史信息。In another possible implementation, before the acquiring the historical information of the current picture book, it further includes: acquiring the identity information of the reader, and the acquiring the historical information of the current picture book includes: acquiring the historical information of the current picture book. The historical information of the current picture book corresponding to the identity information.
上述发明中,通过识别阅读者身份信息,参考该阅读者相关的历史信息,从而更准确的向该阅读者提出与其理解程度相匹配的轮次问题。In the above-mentioned invention, by identifying the identity information of the reader and referring to the historical information related to the reader, it is possible to more accurately ask the reader a round question matching the degree of understanding of the reader.
在另一个可能的实现中,在所述资源库中存储至少一个绘本的数据包,所述数据包包括绘本的名称、绘本的页码、页码对应的内容、以及所述内容对应的至少一个轮次问题和所述至少一个轮次问题中各个问题的答案,所述至少一个绘本包括所述阅读者正在阅读的当前绘本,所述至少一个轮次问题包括所述第一轮次问题和所述第二轮次问题。In another possible implementation, a data package of at least one picture book is stored in the resource library, the data package including the name of the picture book, the page number of the picture book, the content corresponding to the page number, and at least one round corresponding to the content Questions and answers to each question in the at least one round of questions, the at least one picture book includes the current picture book being read by the reader, and the at least one round question includes the first round question and the first round question Second round question.
上述发明中,绘本设计人员将各个绘本制作好的文本提前录入存储器中,以便终端设备在不联网的情况下,仍能为阅读者解读绘本。In the above-mentioned invention, the picture book designer enters the text prepared by each picture book into the memory in advance, so that the terminal device can still interpret the picture book for the reader even when the terminal device is not connected to the Internet.
在另一个可能的实现中,在所述确定阅读者正在阅读的当前绘本之后,所述方法还包括:判断所述当前绘本是否存储在资源库中;当所述资源库没有所述当前绘本时,判断所述当前绘本的内容是否与所述资源库中已存储的第一绘本的内容相同;当所述当前绘本的内容与所述第一绘本的内容相同时,向所述阅读者提出与所述第一绘本对应的轮次问题。In another possible implementation, after the determining the current picture book that the reader is reading, the method further includes: determining whether the current picture book is stored in a resource library; when the resource library does not have the current picture book , To determine whether the content of the current picture book is the same as the content of the first picture book stored in the resource library; when the content of the current picture book is the same as the content of the first picture book, suggest to the reader with The round question corresponding to the first picture book.
上述发明中,由于市场上绘本的数量巨多,但是有些绘本由于出版社不同、排版不同等情况造成内容相同但形式上不同,因此绘本设计人员不需要将内容相同的各个版本的绘本都录入存储器中,只需要根据绘本的内容制作一个绘本信息,然后录入存储器,既减少绘本设计人员的工作量,又减少存储器的存储空间。In the above invention, due to the huge number of picture books on the market, some picture books have the same content but different forms due to different publishers and different layouts. Therefore, the picture book designer does not need to record all the picture books with the same content into the memory. , Only need to make a picture book information according to the content of the picture book, and then enter it into the memory, which not only reduces the workload of the picture book designer, but also reduces the storage space of the memory.
在另一个可能的实现中,所述方法还包括:确定所述阅读者正在阅读的所述当前绘本的页码;向所述阅读者提出与所述页码对应的内容的轮次问题。In another possible implementation, the method further includes: determining the page number of the current picture book that the reader is reading; and asking the reader a question about the turn of the content corresponding to the page number.
上述发明中,由于绘本的内容比较多,阅读者不可能一次性将一本绘本在一次阅读中阅读完,所以可以根据绘本的页码,将对绘本解读的问题分成N部分,在阅读者阅读完一页内容后,就向阅读者提问问题,从而更好的协助阅读者解读绘本。In the above invention, due to the relatively large content of picture books, it is impossible for readers to read a picture book in one reading at a time. Therefore, the problem of interpretation of picture books can be divided into N parts according to the page number of the picture book. After one page of content, ask the reader questions to better assist the reader in interpreting the picture book.
在另一个可能的实现中,所述确定阅读者正在阅读的当前绘本,包括:通过麦克风获取所述阅读者正在阅读的所述当前绘本的语音信息;将所述语音信息转换成绘本文字,所述绘本文字用于从所述资源库中确定所述阅读者正在阅读的当前绘本。在另一个可能的实现中,所述确定阅读者正在阅读的当前绘本,包括:通过摄像头获取所述阅读者正在阅读的所述当前绘本的图像;识别所述图像中特征,得到绘本特征值,所述绘本特征值用于从所述资源库中确定所述阅读者正在阅读的当前绘本。在另一个可能的实现中,在确定所述阅读者答复第一轮次问题的正向答案率大于设定阈值之前,所述方法还包括:通过麦克风获取所述阅读者答复问题的语音信息;将所述语音信息转换成答案文字,所述答案文字用于从所述资源库中确定所述阅读者答复所述第一轮次问题中第一问题的答案是否正确,所述第一轮次问题包括所述第一问题。In another possible implementation, the determining the current picture book being read by the reader includes: acquiring the voice information of the current picture book being read by the reader through a microphone; converting the voice information into picture book text, so The picture book text is used to determine the current picture book being read by the reader from the resource library. In another possible implementation, the determining the current picture book being read by the reader includes: acquiring the image of the current picture book being read by the reader through a camera; identifying features in the image to obtain the picture book feature value, The picture book feature value is used to determine the current picture book being read by the reader from the resource library. In another possible implementation, before it is determined that the positive answer rate of the reader's answer to the first round of questions is greater than a set threshold, the method further includes: obtaining voice information of the reader's answer to the question through a microphone; The voice information is converted into answer text, and the answer text is used to determine from the resource database whether the reader’s answer to the first question in the first round of questions is correct, and the first round The problem includes the first problem.
在另一个可能的实现中,在确定所述阅读者答复第一轮次问题的正向答案率大于设定阈值之前,包括:通过摄像头获取所述阅读者的动作或手势;识别所述阅读者的动作或手 势的图像中特征,得到答案特征值,所述答案特征值用于从所述资源库中确定所述阅读者答复所述第一轮次问题中第二问题的答案是否正确,所述第一轮次问题包括所述第二问题。In another possible implementation, before determining that the positive answer rate of the reader's answer to the first round of questions is greater than a set threshold, the method includes: acquiring the reader's actions or gestures through a camera; and identifying the reader The feature in the image of the action or gesture is obtained, and the feature value of the answer is used to determine from the resource database whether the reader’s answer to the second question in the first round of questions is correct, so The first round question includes the second question.
在另一个可能的实现中,所述方法还包括:在所述历史信息中增加一次所述阅读者的对所述当前绘本进行解读的解读次数和存储所述阅读者在本次解读过程中答复各个轮次问题的答案的正向答案率。In another possible implementation, the method further includes: adding, in the historical information, the number of interpretations of the reader's interpretation of the current picture book and storing the reader's replies during the interpretation process. The positive answer rate of the answers to the questions in each round.
在另一个可能的实现中,在所述向所述阅读者提出第一轮次问题之前,包括:接收解读指令,所述解读指令用于指示向所述阅读者提出所述第一轮次问题。In another possible implementation, before the first round of questions is asked to the reader, the method includes: receiving an interpretation instruction, where the interpretation instruction is used to instruct to ask the reader the first round of questions .
上述发明中,终端设备在确定出阅读者当前正在阅读的绘本后,有时候阅读者并不想对该部分内容进行解读,或过一段时间再进行解读,所以通过指令的方式控制终端设备是否进行解读或什么时候进行解读,从而满足阅读者使用需求。In the above-mentioned invention, after the terminal device determines the picture book that the reader is currently reading, sometimes the reader does not want to interpret the part of the content, or interpret it after a period of time, so the terminal device is controlled by instructions to interpret it. Or when to interpret to meet the needs of readers.
第二方面,本申请实施例提供了一种绘本解读装置,包括:收发器、处理器和存储器;所述收发器用于接收和发送数据;所述存储器存储有一个或多个程序,所述一个或多个程序包括指令,当所述指令被所述处理器执行时,使得所述电子设备执行第一方面中各个可能实现的方案。In the second aspect, an embodiment of the present application provides a picture book interpretation device, including: a transceiver, a processor, and a memory; the transceiver is used to receive and send data; the memory stores one or more programs, the one The or multiple programs include instructions, and when the instructions are executed by the processor, the electronic device executes each possible implementation solution in the first aspect.
第三方面,本申请实施例提供了一种电子设备,包括:摄像头和/或麦克风、存储器和执行如第一方面的各个可能实现的实施例的处理器。In a third aspect, an embodiment of the present application provides an electronic device, including: a camera and/or a microphone, a memory, and a processor that executes each possible implementation of the first aspect.
第四方面,本申请实施例提供了一种智能机器人,包括:摄像头和/或麦克风,用于接收阅读者阅读绘本的语音信息或图像信息,以及阅读者回答所述绘本对应的问题的答案的语音信息或图像信息;存储器,用于存储至少一个绘本的第二信息,以及每个绘本历史上所述阅读者的对所述当前绘本进行解读的解读次数和历史上所述阅读者的每次解读时提出的各个轮次问题的正向答案率;处理器,用于处理所述摄像头和/或所述麦克风获取的所述语音信息或图像信息,然后从所述存储器中确定所述阅读者正在阅读的当前绘本,,以及根据所述存储器中所述当前绘本的解读次数和每个轮次问题在每次解读时得到的正向答案率的均值,确定向所述阅读者提问对应轮次的问题;扬声器,用于向阅读者播放语音;通信单元,用于接收每个绘本的第二信息,以及从当前轮次进入下一轮次的正向答案率的阈值。In a fourth aspect, an embodiment of the present application provides an intelligent robot, including: a camera and/or a microphone, for receiving voice information or image information of a reader reading a picture book, and the reader’s answer to a question corresponding to the picture book Voice information or image information; memory for storing the second information of at least one picture book, as well as the number of interpretations of the current picture book by the reader in the history of each picture book and each time the reader in the history The positive answer rate of each round of questions raised during interpretation; a processor for processing the voice information or image information obtained by the camera and/or the microphone, and then determining the reader from the memory The current picture book being read, and based on the number of interpretations of the current picture book in the memory and the average value of the positive answer rate obtained at each interpretation of each round of questions, determine the corresponding round of questions to the reader The speaker is used to play the voice to the reader; the communication unit is used to receive the second information of each picture book, and the threshold of the positive answer rate from the current round to the next round.
第五方面,本申请实施例提供了一种可读存储介质,用于存储指令,当所述指令被执行时,使得第一方面的各个可能实现的实施例被实现。In the fifth aspect, the embodiments of the present application provide a readable storage medium for storing instructions. When the instructions are executed, each possible implementation of the first aspect is realized.
第六方面,本申请实施例提供了一种包含指令的计算机程序设备,当其在终端上运行时,使得第一方面的各个可能实现的实施例被实现。In the sixth aspect, the embodiments of the present application provide a computer program device containing instructions, which when running on a terminal, enables each possible implementation of the first aspect to be implemented.
附图说明Description of the drawings
下面对实施例或现有技术描述中所需使用的附图作简单地介绍。The following briefly introduces the drawings needed in the description of the embodiments or the prior art.
图1为本申请实施例提供的一种解读绘本的应用系统架构图;FIG. 1 is an architecture diagram of an application system for interpreting picture books provided by an embodiment of the application;
图2为本申请实施例提供的一种终端设备的结构示意图;FIG. 2 is a schematic structural diagram of a terminal device provided by an embodiment of this application;
图3为本申请实施例提供的不同手势代表不同数量的示意图;FIG. 3 is a schematic diagram of different gestures representing different numbers provided by an embodiment of the application;
图4为本申请实施例提供的一种制作绘本的模板示意图;FIG. 4 is a schematic diagram of a template for making a picture book provided by an embodiment of the application;
图5为本申请实施例提供的一种制作绘本的模板示意图;FIG. 5 is a schematic diagram of a template for making a picture book provided by an embodiment of the application;
图6为本申请实施例提供的一种制作绘本的模板示意图;Fig. 6 is a schematic diagram of a template for making a picture book provided by an embodiment of the application;
图7为本申请实施例提供的一种解读方法的流程图;FIG. 7 is a flowchart of an interpretation method provided by an embodiment of this application;
图8为本申请实施例提供的触控屏幕显示的图像示意图;FIG. 8 is a schematic diagram of an image displayed on a touch screen according to an embodiment of the application;
图9为本申请实施例提供的一种终端设备的结构示意图。FIG. 9 is a schematic structural diagram of a terminal device provided by an embodiment of the application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行描述。The technical solutions in the embodiments of the present application will be described below in conjunction with the drawings in the embodiments of the present application.
图1为本申请实施例提供的一种解读绘本的应用系统架构图。如图1所示,该系统架构中包括绘本、终端设备和阅读者。FIG. 1 is an architecture diagram of an application system for interpreting picture books provided by an embodiment of the application. As shown in Figure 1, the system architecture includes picture books, terminal equipment and readers.
绘本可以为纸质书籍,也可以为平板、kindle等可以供阅读者进行阅读故事的设备。The picture book can be a paper book, or a tablet, kindle, and other devices that can be used by readers to read stories.
本申请实施例中,终端设备包括但不限于平板电脑、智能手机等智能设备。比如还可以包括针对各种特定的业务场景独立开发的智能机器人等等。在终端设备中,存储有绘本中的各篇故事,以及各篇故事对应的一系列问题和每个问题对应的答案。其中,每篇故事对应的一系列问题按照难易程度分成几个轮次,终端设备按照先易后难的顺序向阅读者提问不同轮次的问题。终端设备协助阅读者进行解读绘本过程中,在确定阅读者正在阅读的故事后,从数据库中找出该故事对应的问题和答案;然后阅读者阅读完故事后,根据历史上阅读者解读该绘本的次数和各个轮次中正向答案率,向阅读者提出相对应难度的轮次问题从而避免一次解读绘本时过程中,向阅读者提出的第一个轮次问题超出阅读者理解能力或过于简单。另外,当一个轮次的问题提问完后,根据阅读者回复的答案的正确率,确定是否再提问更难一级轮次的问题,从而循序渐进地引导阅读者对该故事的理解,提升阅读者的阅读理解能力。In the embodiments of the present application, terminal devices include, but are not limited to, smart devices such as tablet computers and smart phones. For example, it can also include intelligent robots independently developed for various specific business scenarios, and so on. In the terminal device, each story in the picture book, as well as a series of questions corresponding to each story and an answer corresponding to each question are stored. Among them, a series of questions corresponding to each story is divided into several rounds according to the degree of difficulty, and the terminal device asks readers questions of different rounds in the order of easy first and then difficult. When the terminal device assists the reader in the process of interpreting the picture book, after determining the story that the reader is reading, find out the question and answer corresponding to the story from the database; then, after the reader finishes reading the story, interpret the picture book according to the readers in history The number of times and the positive answer rate in each round, to ask the readers the corresponding difficult round questions to avoid the first round of questions asked to the reader in the process of interpreting a picture book that exceeds the reader’s comprehension ability or is too simple . In addition, after a round of questions are asked, according to the correct rate of the answers answered by the reader, it is determined whether to ask questions that are more difficult for the first round, so as to gradually guide the reader’s understanding of the story and improve the reader Reading comprehension skills.
图2为本申请实施例提供的一种终端设备的结构示意图。如图2所示,该终端设备包括输入单元1、输出单元2、处理单元3、存储单元4和录入单元5。Fig. 2 is a schematic structural diagram of a terminal device provided by an embodiment of the application. As shown in FIG. 2, the terminal device includes an input unit 1, an output unit 2, a processing unit 3, a storage unit 4, and an input unit 5.
终端设备需要具备感知当前使用者身份的能力,比如通过人脸识别、声纹识别、用户账号密码等方式识别当前使用者身份,根据用户输入或通过更智能的方式(比如根据人脸识别用户性别、年龄)获取当前用户个人信息。用户账号下会存储记录该用户使用过程中的所有个人数据。The terminal device needs to have the ability to perceive the current user's identity, such as identifying the current user's identity through face recognition, voiceprint recognition, user account passwords, etc., based on user input or through smarter methods (such as identifying user gender based on face) , Age) to obtain the current user's personal information. All personal data during the use of the user will be stored and recorded under the user account.
输入单元1包括麦克风11和摄像头12。其中,麦克风11用于采集语音信息,摄像头12用于采集图像信息。The input unit 1 includes a microphone 11 and a camera 12. Among them, the microphone 11 is used to collect voice information, and the camera 12 is used to collect image information.
在本申请实施例中,对于麦克风11的作用,终端设备可以通过麦克风11获取阅读者阅读绘本中的故事的语音信息,以确定阅读者阅读的故事。终端设备也可以通过麦克风11获取阅读者对终端设备提出的问题进行答复的语音信息,以获取阅读者答复问题的答案。In the embodiment of the present application, for the function of the microphone 11, the terminal device can obtain the voice information of the reader reading the story in the picture book through the microphone 11, so as to determine the story read by the reader. The terminal device may also obtain the voice information of the reader's answer to the question raised by the terminal device through the microphone 11, so as to obtain the reader's answer to the question.
对于摄像头12的作用,终端设备可以通过摄像头12获取阅读者正在阅读绘本的页面,根据页面内容,识别出阅读者阅读的故事。终端设备也可以通过摄像头12获取阅读者的动作或手势,以根据阅读者的动作或手势识别出阅读者答复问题的答案。For the function of the camera 12, the terminal device can obtain the page of the picture book that the reader is reading through the camera 12, and identify the story read by the reader according to the content of the page. The terminal device can also acquire the reader's motion or gesture through the camera 12, so as to recognize the reader's answer to the question based on the reader's motion or gesture.
输出单元2包括扬声器21。在本申请实施例中,终端设备可以通过扬声器21播放阅读者正在阅读的故事的其他人阅读的录音、播报阅读者正在阅读的故事对应的问题、问题对应的正确答案、鼓励阅读者的话等等。The output unit 2 includes a speaker 21. In the embodiment of the present application, the terminal device can play through the speaker 21 recordings of other people reading the story that the reader is reading, broadcast the question corresponding to the story that the reader is reading, the correct answer corresponding to the question, the words to encourage the reader, etc. .
处理单元3包括自动语音识别(automatic speech recognition,ASR)单元31。其中,ASR技术是一种将人的语音转换为文本的技术。在本申请实施例中,ASR单元31配合麦克风11, 用于对麦克风11获取的阅读者正在阅读的故事的语音信息和对终端设备提出的问题进行答复的语音信息进行转换,得到对应的文字文本后,处理单元3根据得到的文字文本在数据库中寻找对应的故事或答案。The processing unit 3 includes an automatic speech recognition (ASR) unit 31. Among them, ASR technology is a technology that converts human speech into text. In the embodiment of the present application, the ASR unit 31 cooperates with the microphone 11 to convert the voice information of the story that the reader is reading obtained by the microphone 11 and the voice information of the answer to the question raised by the terminal device to obtain the corresponding text and text Then, the processing unit 3 searches the database for the corresponding story or answer according to the obtained text.
处理单元3还包括视觉处理单元32。在本申请实施例中,视觉处理单元32配合摄像头12,用于对摄像头12获取的图像信息进行处理,然后提取出图像中所需要的特征。其中,视觉处理单元32包括绘本图像识别单元321、绘本点击识别单元322和手势识别单元323。The processing unit 3 also includes a vision processing unit 32. In the embodiment of the present application, the vision processing unit 32 cooperates with the camera 12 to process the image information obtained by the camera 12, and then extract the required features in the image. Among them, the visual processing unit 32 includes a picture book image recognition unit 321, a picture book click recognition unit 322 and a gesture recognition unit 323.
绘本内容设计人员在提前录入绘本时,提取录入的绘本的封面和每个内面的图像的特征值,然后通过特征值生成该绘本唯一的身份凭证(identity document,ID),并将绘本ID与特征值关联,最后存储在存储单元4中。When the picture book content designer enters the picture book in advance, he extracts the cover of the entered picture book and the feature value of each inner image, and then generates the unique identity document (ID) of the picture book through the feature value, and combines the picture book ID and feature The value is associated and finally stored in the storage unit 4.
绘本图像识别单元321用于在摄像头12获取阅读者正在阅读绘本的页面的图像后,识别当前绘本的封面、具体页码等特征值,然后到存储单元4中进行比对,找到对应的绘本ID和页码ID,以确定阅读者正在阅读的绘本和正在阅读绘本的当前页面。在一个可能的实施例中,绘本图像识别单元321采用尺度不变特征变换(scale-invariant feature transform,SIFT)算法来侦测或描述影像中的局部性特征。The picture book image recognition unit 321 is used to identify the current picture book cover, specific page number and other characteristic values after the camera 12 acquires the image of the page where the reader is reading the picture book, and then compares it to the storage unit 4 to find the corresponding picture book ID and The page ID is used to determine the picture book that the reader is reading and the current page of the picture book that the reader is reading. In a possible embodiment, the picture book image recognition unit 321 uses a scale-invariant feature transform (SIFT) algorithm to detect or describe local features in the image.
绘本点击识别单元322用于通过摄像头12拍摄包括阅读者手指的视频帧图像来检测阅读者手指点击区域的位置。在一个可能的实施例中,绘本点击识别单元322先对采集到的图像进行预处理(也即对图像中手形区域作如噪点削减、排除肤色差异明显的区域等处理);然后对图像边缘提取,以提取外凸区域的边缘(也即根据手指区域的形状轮廓,提取手指的图像);最后根据采集的图像和提取的手指图像,确定阅读者点击绘本的区域。The picture book click recognition unit 322 is used for detecting the position of the reader's finger click area by taking a video frame image including the reader's finger through the camera 12. In a possible embodiment, the picture book click recognition unit 322 first preprocesses the collected image (that is, performs processing such as noise reduction on the hand shape area in the image, excluding areas with obvious skin color differences, etc.); and then extracts the edge of the image , To extract the edge of the convex area (that is, extract the image of the finger according to the shape and contour of the finger area); finally, according to the collected image and the extracted finger image, determine the area where the reader clicks on the picture book.
手势识别单元323用于识别出图像中的阅读者的动作或手势,然后根据根据对应的手势确定阅读者表示的答案。其中,手势识别的方式包括但不限于基于几何特征、通过手势的边缘(如轮廓)和手势区域特征(如手掌颜色、面积等)等方式识别手势。The gesture recognition unit 323 is used to recognize the actions or gestures of the reader in the image, and then determine the answer indicated by the reader according to the corresponding gesture. Among them, gesture recognition methods include, but are not limited to, recognizing gestures based on geometric features, through gesture edges (such as contours) and gesture area features (such as palm color, area, etc.).
示例性的,如图3所示,绘本内容设计人员可以根据手指头数量和手指头特定的手势来定义“1、2、3、4、5、6、7、8、9、10”。在绘本解读过程中,阅读者可以通过手势来回答问题,比如终端设备向阅读者提问:这个画面上有几种小动物呀?阅读者可以采用如图3中任意一个手势作为回答。手势识别单元323通过识别阅读者展示的手势,确定阅读者答复的答案。Exemplarily, as shown in FIG. 3, the designer of the picture book content can define "1, 2, 3, 4, 5, 6, 7, 8, 9, 10" according to the number of fingers and the specific gestures of the fingers. In the process of picture book interpretation, readers can use gestures to answer questions. For example, the terminal device asks the reader: How many small animals are there on this screen? The reader can use any gesture as shown in Figure 3 as an answer. The gesture recognition unit 323 determines the answer of the reader's reply by recognizing the gesture displayed by the reader.
处理单元3还包括阅读结果计算单元33和解读轮次计算单元34。本申请实施例中,阅读结果计算单元33用于计算阅读者回答完一个轮次中所有问题的正确率,然后将计算的结果存储在存储单元4中。The processing unit 3 also includes a reading result calculation unit 33 and an interpretation round calculation unit 34. In the embodiment of the present application, the reading result calculation unit 33 is used to calculate the correct rate of the reader after answering all the questions in a round, and then store the calculated result in the storage unit 4.
解读轮次计算单元34用于根据阅读者当前轮次中回答的答案的正确率、存储单元4中记录阅读者已经解读该绘本和同类绘本的次数、以及该绘本录入时选择的计算规则,确定此时为阅读者解读该绘本对应的轮次。阅读者阅读过程中产生的所有个人相关数据(如阅读次数、阅读轮次、问答结果等)会和该阅读者的账号进行关联,从而实现不同阅读者的数据隔离,进行个性化的阅读轮次计算。The interpretation round calculation unit 34 is used to determine according to the correct rate of the answers answered by the reader in the current round, the number of times the reader has interpreted the picture book and similar picture books in the storage unit 4, and the calculation rule selected when the picture book is entered At this time, the reader will interpret the corresponding round of the picture book. All personal related data (such as reading times, reading rounds, Q&A results, etc.) generated by the reader during the reading process will be associated with the reader’s account, so as to realize the data isolation of different readers and conduct personalized reading rounds calculate.
存储单元4包括数据库41和资源库42。其中,数据库41用于存储处理单元3对语音信息和图像信息进行处理的数据。资源库42用于存储绘本内容设计人员录入的数据。The storage unit 4 includes a database 41 and a resource library 42. Among them, the database 41 is used to store data processed by the processing unit 3 on voice information and image information. The resource library 42 is used to store the data entered by the designer of the picture book content.
示例性的,数据库41根据存储的数据的类型,将数据库41分为绘本图像特征数据库411、第一绘本阅读记录数据库412和第二绘本阅读记录数据库413。Exemplarily, the database 41 divides the database 41 into a picture book image feature database 411, a first picture book reading record database 412, and a second picture book reading record database 413 according to the type of data stored.
绘本图像特征数据库411用于提取资源库42中每个绘本中封面和每个页面的图像,并识别出特征值,然后将图像特征值与该绘本和该对应页面的的数据关联。一个可能的实施例中,绘本开发人员提取录入在资源库42中的绘本的封面和每个页面的图像,识别出特征值,存储绘本图像特征数据库411中,然后将每个特征值与资源库42中存储的对应的封面和页面的内容、各个轮次的问题、答案等数据关联,具体关联关系如下:The picture book image feature database 411 is used to extract the image of the cover and each page of each picture book in the resource library 42 and identify the feature value, and then associate the image feature value with the data of the picture book and the corresponding page. In a possible embodiment, the picture book developer extracts the cover of the picture book and the image of each page entered in the resource library 42, recognizes the feature value, stores it in the picture book image feature database 411, and then combines each feature value with the resource library The corresponding cover and page content stored in 42 are associated with data such as questions and answers for each round. The specific association relationship is as follows:
表一绘本的封面和各个页面的图像的特征值与资源库中存储的数据之间的关联表Table 1 The association table between the feature values of the cover of the picture book and the image of each page and the data stored in the resource library
Figure PCTCN2021080269-appb-000001
Figure PCTCN2021080269-appb-000001
当处理单元3通过摄像头11获取阅读者正在阅读的绘本的封面或某一页的图像后,通过视觉处理单元32进行处理,获取该图像的特征值,然后发送给绘本图像特征数据库411,与绘本图像特征数据库411中存储的特征值进行比对。如果绘本图像特征数据库411中存储的特征值有与发送的图像特征值相匹配的特征值时,处理单元3可以通过绘本图像特征数据库411中存储的与发送的图像特征值相匹配的特征值对应的关联关系,从资源库42中获取相应的绘本或相应页面的内容、各个轮次的问题、答案等数据。After the processing unit 3 obtains the cover of the picture book or a certain page of the image that the reader is reading through the camera 11, it is processed by the visual processing unit 32 to obtain the feature value of the image, and then sent to the picture book image feature database 411, and the picture book The feature values stored in the image feature database 411 are compared. If the feature value stored in the picture book image feature database 411 has a feature value that matches the sent image feature value, the processing unit 3 can correspond to the feature value stored in the picture book image feature database 411 that matches the sent image feature value The association relationship of the corresponding picture book or the content of the corresponding page, the questions, answers and other data of each round are obtained from the resource library 42.
第一绘本阅读记录数据库412用于记录每本绘本被解读的信息,记录,即当绘本被解读时,通过阅读结果计算单元33计算得到的各个轮次的正确率,生成一条解读信息,该解读信息包括一个绘本在一次解读过程中各个轮次的正确率。一个可能的实施例中,存储在第一绘本阅读记录数据库412中的解读信息格式如下:The first picture book reading record database 412 is used to record the information of each picture book being interpreted, that is, when the picture book is interpreted, the correct rate of each round calculated by the reading result calculation unit 33 is used to generate a piece of interpretation information. The information includes the correct rate of each round of a picture book in an interpretation process. In a possible embodiment, the format of the interpretation information stored in the first picture book reading record database 412 is as follows:
表二解读信息中的绘本和各个轮次中回答答案的正确率之间的关系Table 2 The relationship between the picture book in the interpretation information and the correct rate of answers in each round
Figure PCTCN2021080269-appb-000002
Figure PCTCN2021080269-appb-000002
终端设备每次为阅读者进行解读绘本后,都会生成一条解读信息,记录阅读者在此次解读过程中,对该绘本的理解程度,并在后续再次解读该绘本时,解读轮次计算单元34根据该绘本历史上的解读信息,确定再次解读该绘本直接进入的轮次。Each time the terminal device interprets the picture book for the reader, it will generate a piece of interpretation information to record the reader’s understanding of the picture book during the interpretation process. When the picture book is subsequently interpreted again, the interpretation round calculation unit 34 According to the interpretation information in the history of the picture book, determine the round of directly entering the picture book to be interpreted again.
第二绘本阅读记录数据库413用于记录每本绘本的各个类型被解读的信息,相比较第 一绘本阅读记录数据库412中存储的解读信息,第二绘本阅读记录数据库413中存储的解读信息增加了类型分类,将解读信息分为如数字学习解读信息、字母学习解读信息等等,具体如下:The second picture book reading record database 413 is used to record the interpretation information of each type of each picture book. Compared with the interpretation information stored in the first picture book reading record database 412, the interpretation information stored in the second picture book reading record database 413 is increased Type classification, the interpretation information is divided into number learning interpretation information, letter learning interpretation information, etc., as follows:
表三解读信息中的绘本和各个轮次中回答答案的正确率之间的关系Table 3 The relationship between the picture book in the interpretation information and the correct rate of answers in each round
Figure PCTCN2021080269-appb-000003
Figure PCTCN2021080269-appb-000003
资源库42用于存储至少一本绘本。其中,存储的绘本内容包括绘本的基础信息(名称、绘本类型、子类型、页数、封面图片、具体页码图片等)、绘本封面和各个页面的图像、图像的特征值、各个页面的文字和图像、各个页面对应的各个轮次的问题和答案、轮次通过计算标准等信息。当阅读者对绘本进行解读时,处理单元3在确定阅读者阅读的绘本和当前阅读的页面后,从资源库42中获取进行解读的该绘本相关信息。The resource library 42 is used to store at least one picture book. Among them, the stored picture book content includes the basic information of the picture book (name, picture book type, subtype, number of pages, cover picture, specific page number picture, etc.), picture book cover and images of each page, image feature values, text and text of each page Images, questions and answers for each round corresponding to each page, round pass calculation criteria and other information. When the reader interprets the picture book, the processing unit 3 obtains information related to the picture book for interpretation from the resource library 42 after determining the picture book read by the reader and the page currently being read.
录入单元5用于下载绘本内容,然后录入到资源库42中。其中,录入单元5可以为如USB接口、Type-C接口等物理接口,也可以为WiFi模块、蓝牙模块等无线通信模块,本申请在此不作限定。The input unit 5 is used to download the content of the picture book and then input it into the resource library 42. Wherein, the input unit 5 may be a physical interface such as a USB interface, a Type-C interface, etc., or a wireless communication module such as a WiFi module, a Bluetooth module, etc., which is not limited in this application.
终端设备在协助阅读者解读绘本之前,绘本开发人员需要提前在开发工具中制作绘本内容,然后通过录入单元5存储到资源库42中。Before the terminal device assists the reader in interpreting the picture book, the picture book developer needs to make the picture book content in the development tool in advance, and then store it in the resource library 42 through the input unit 5.
下面进一步结合图2所示的终端设备,讲述绘本开发人员如何制作绘本。其中,绘本以儿童绘本《爸爸,别怕》为例进行描述,本申请在此不作限制。In the following, in conjunction with the terminal device shown in Figure 2, it will be described how a picture book developer makes a picture book. Among them, the picture book is described with the children’s picture book "Dad, Don't Fear" as an example, and this application is not limited here.
绘本开发人员通过专用的APP、云端等方式制作绘本内容(本申请以APP为例),在打开APP制作后,APP显示如图4所示的模板,该模板包括绘本名称、作者、解读轮次总数、绘本系列名称、绘本分类、绘本子类和封面图片等选项,绘本开发人员可以根据将要录入绘本,填写该模板中的各个选项。Picture book developers make picture book content through dedicated APP, cloud, etc. (this application uses APP as an example). After opening the APP to make, the APP displays the template shown in Figure 4, which includes the name of the picture book, author, and interpretation rounds The total number, the name of the picture book series, the picture book classification, the picture book sub-category, and the cover picture are options. The picture book developer can fill in the various options in the template according to the picture book to be entered.
示例性的,如图4所示,模板中各个选项可分为必填项和非必填项,必填项需要绘本开发人员必须填写的选项,如绘本名称、作者、封面图片等选项,非必填项可以不需要填写,如绘本分类、绘本子类等选项。Exemplarily, as shown in Figure 4, each option in the template can be divided into required items and non-required items. Required items need to be filled in by the picture book developer, such as picture book name, author, cover image and other options. Required items do not need to be filled in, such as picture book classification, picture book sub-category and other options.
绘本开发人员填写完图4所示的模板中的各个选项后,点击“开始录入绘本解读内容”选项,进入如图5所示的模板。该模板包括至少一个轮次解读(本申请以解读三轮为例)、至少一个绘本页面(本申请以绘本有15页为例)、朗读内容、问答互动等选项。After the picture book developer has filled in the various options in the template shown in Figure 4, click on the "Start Entering Picture Book Interpretation Content" option to enter the template shown in Figure 5. The template includes at least one round of interpretation (this application takes three rounds of interpretation as an example), at least one picture book page (this application takes a picture book with 15 pages as an example), reading content, question and answer interaction and other options.
示例性的,如图5所示,一种情况,每个页面的选项中都包括第一次解读、第二次解读和第三次解读这三个选项,在每个解读的选项中都包括朗读内容和问答互动两个选项。当绘本开发人员制作绘本《爸爸,别怕》的第一页的第一轮解读的绘本内容时,在15个页面的选项中选定“第1页”这个选项后,会出现三个解读选项;然后在三个解读选项中选定“第一轮解读”这个选项后,出现朗读内容和问答互动两个选项。绘本开发人员将绘本《爸爸,别怕》的第一页的图像和第一页记录的文字内容上传到“朗读内容”选项中,然后将需要提问的问题和答案上传到“问答互动”选项中。Exemplarily, as shown in Figure 5, in a situation, the options on each page include three options: the first interpretation, the second interpretation, and the third interpretation, and each interpretation option includes There are two options for reading content and Q&A interaction. When the picture book developer produces the contents of the first round of interpretation of the first page of the picture book "Dad, Don't Fear", after selecting the option "Page 1" from the options on the 15 pages, three interpretation options will appear ; Then, after selecting the "first round of interpretation" option among the three interpretation options, two options for reading content and Q&A interaction will appear. The picture book developer uploads the image on the first page of the picture book "Dad, don’t be afraid" and the text content recorded on the first page to the "Read aloud" option, and then upload the questions and answers that need to be asked to the "Question and answer interaction" option .
另一种情况,每个页面的选项中都包括朗读内容和问答互动两个选项,在朗读内容和问答互动两个选项中都包括第一次解读、第二次解读和第三次解读这三个选项。当绘本开发人员制作绘本《爸爸,别怕》的第一页的第一轮解读的绘本内容时,在15个页面的选项中选定“第1页”这个选项后,会出现朗读内容和问答互动两个选项;在两个选项中选定“朗读内容”这个选项后,会出现三个解读选项;然后在三个解读选项选定“第一轮解读”这个选项后,绘本开发人员将绘本《爸爸,别怕》的第一页的图像和第一页记录的文字内容上传到“朗读内容”选项中;再在两个选项中选定“问答互动”这个选项后,会出现三个解读选项;然后在三个解读选项选定“第一轮解读”这个选项后,绘本开发人员将绘本《爸爸,别怕》的第一页的图像和第一页需要提问的问题和答案上传到“问答互动”选项中。In another case, the options on each page include two options: reading content and Q&A interaction. The two options for reading content and Q&A interaction include the first interpretation, the second interpretation, and the third interpretation. Options. When the picture book developer makes the first round of interpretation of the first page of the picture book "Dad, Don't Be Fear", after selecting the "1st page" option from the 15 page options, the reading content and Q&A will appear There are two interactive options; after selecting the "read aloud" option in the two options, three interpretation options will appear; then after the "first round of interpretation" is selected in the three interpretation options, the picture book developer will set the picture book The image of the first page of "Dad, don't be afraid" and the text content recorded on the first page are uploaded to the "Read aloud" option; after selecting the "Q&A interaction" option in the two options, three interpretations will appear Option; then after selecting the "first round of interpretation" option in the three interpretation options, the picture book developer uploads the image on the first page of the picture book "Dad, Don’t Be Fear" and the questions and answers that need to be asked on the first page to " Q&A interaction" option.
同理,绘本开发人员制作绘本《爸爸,别怕》的其它页面的其它轮解读的绘本内容的方式与上述制作绘本《爸爸,别怕》的第一页的第一轮解读的绘本内容的方式相同。In the same way, the way that the picture book developers make the other rounds of interpretation of the contents of the picture book on the other pages of the picture book "Dad, Don't Fear" is the same as the above-mentioned method of making the first round of interpretation of the contents of the picture book on the first page of the picture book "Dad, Don't Fear" same.
绘本开发人员填写完图5所示的模板中的各个页面、各个轮解读等选项后,点击“完成解读”选项,进入如图6所示的模板。该模板包括进入下一轮解读规则和正向回答平均数等选项。After the picture book developer has filled in the options of each page and each round of interpretation in the template shown in Figure 5, click on the "Complete Interpretation" option to enter the template shown in Figure 6. The template includes options such as entering the next round of interpretation rules and the average number of forward answers.
示例性的,如图6所示,“进入下一轮解读规则”选项包括“方式1:该绘本第一轮解读问答互动,阅读者正向回答数超过”和“方式2:同系列已读绘本超过2本,且第一轮解读回答互动中,阅读者正向回答平均数超过”这两个“二择一”的选项。如果制作的绘本为系列绘本,则选择“方式1”;如果制作的绘本为非系列绘本,则选择“方式2”。然后在“正向回答平均数等选项”中选定正向回答率数字,只有阅读者回复的正向回答率大于设定的正向回答率,终端设备才会进入第二轮解读。Exemplarily, as shown in Figure 6, the "Enter the next round of interpretation rules" options include "Method 1: The first round of interpretation of the picture book Q&A interaction, the number of readers' positive answers exceeds" and "Method 2: The same series have been read There are more than 2 picture books, and in the first round of interpretation and answer interaction, the average number of readers' positive answers exceeds the two "choose one" options. If the picture book you make is a series picture book, select "Method 1"; if the picture book you make is a non-series picture book, select "Method 2". Then select the number of positive answer rate in the "options such as the average number of positive answers", and the terminal device will enter the second round of interpretation only if the positive answer rate of the reader's reply is greater than the set positive answer rate.
绘本开发人员填写完图6所示的模板中的各个选项后,点击“提交”选项,此时表明绘本《爸爸,别怕》制作完成。绘本开发人员可以将该绘本上传到云服务器上,然后阅读者根据需要通过录入单元5下载该绘本。After the picture book developer fills in the various options in the template shown in Figure 6, click on the "Submit" option, which indicates that the picture book "Dad, don't be afraid" is finished. The picture book developer can upload the picture book to the cloud server, and then the reader downloads the picture book through the input unit 5 as needed.
当该绘本通过录入单元5下载到资源库42中,绘本图像特征值数据库411提取该绘本中封面和每个页面的图像,识别出特征值,生成绘本封面和每个页面的ID,并将每个特征值与各个ID关联,具体关系如表一所示,然后存储在绘本图像特征值数据库411。When the picture book is downloaded to the resource library 42 through the entry unit 5, the picture book image feature value database 411 extracts the image of the cover and each page in the picture book, identifies the feature value, generates the picture book cover and the ID of each page, and adds each Each feature value is associated with each ID, and the specific relationship is shown in Table 1, and then stored in the picture book image feature value database 411.
下面进一步结合图2所示的终端设备,对本申请实施例的技术方案进行更加详细的描 述(下面以终端设备为智能机器人为例)。The technical solution of the embodiment of the present application will be described in more detail below in conjunction with the terminal device shown in FIG. 2 (the terminal device is an intelligent robot as an example below).
图7为本申请实施例提供的终端设备协助阅读者解读绘本的工作流程图。FIG. 7 is a work flow chart of a terminal device provided by an embodiment of the application to assist a reader in interpreting a picture book.
步骤S701,确定阅读者正在阅读的当前绘本。Step S701: Determine the current picture book that the reader is reading.
在阅读者使用智能机器人陪伴自己解读故事时,阅读者通过语音、按键等方式向智能机器人发起绘本解读的指令,让智能机器人工作。智能机器人在接受到指令后,(通过声纹、密码、指纹、人脸信息等方式)判断当前用户身份信息、进而开启麦克风11和/或摄像头12工作,以获取阅读者正在阅读绘本的语音和/或阅读者正在阅读的绘本当前页面的图像。When the reader uses the intelligent robot to accompany himself to interpret the story, the reader initiates a picture book interpretation instruction to the intelligent robot through voice, keystrokes, etc., to let the intelligent robot work. After receiving the instructions, the intelligent robot judges the current user's identity information (through voiceprints, passwords, fingerprints, face information, etc.), and then turns on the microphone 11 and/or the camera 12 to work to obtain the voice and voice of the reader who is reading the picture book. / Or the image of the current page of the picture book that the reader is reading.
一个可能的实施例中,智能机器人的麦克风11采集阅读者的语音信息,然后处理单元3中的ASR单元31将采集的语音转换为对应的文字,然后从转换后的文字文本中提取出多个关键词,与存储单元4中的资源库42中存储的各个绘本中的对应的关键词进行比对。如果采集的语音信息转换到的关键词与资源库42中的某个绘本的对应的关键词相匹配,则表明阅读者正在阅读该绘本,从而确定出阅读者正在阅读的绘本和正在阅读绘本的内容所对应的页码;如果采集的语音信息转换到关键词在资源库中没有任何个绘本的对应的关键词与之匹配,则表明阅读者并不在“看故事”或数据库42并不存储阅读者正在阅读的绘本,从而智能机器人不能为阅读者解读故事。In a possible embodiment, the microphone 11 of the intelligent robot collects the voice information of the reader, and then the ASR unit 31 in the processing unit 3 converts the collected voice into corresponding text, and then extracts multiple texts from the converted text. The keywords are compared with the corresponding keywords in each picture book stored in the resource library 42 in the storage unit 4. If the keywords converted from the collected voice information match the corresponding keywords of a picture book in the resource library 42, it indicates that the reader is reading the picture book, so as to determine the picture book that the reader is reading and the picture book that the reader is reading. The page number corresponding to the content; if the collected voice information is converted to keywords and there is no corresponding keyword in any picture book in the resource library, it means that the reader is not "reading the story" or the database 42 does not store the reader The picture book being read, so that the intelligent robot cannot interpret the story for the reader.
其中,资源库42中存储的各个绘本的以如图4-6及相应的描述内容的方式制作,本申请在此不再赘述了。Among them, each picture book stored in the resource library 42 is made in the manner shown in Figures 4-6 and the corresponding description content, which will not be repeated here in this application.
一个可能的实施例中,智能机器人的摄像头12采集图像或视频,然后处理单元3中的视觉处理单元32对采集的图像(如果是视频,视觉处理单元32对视频中的每帧图像单独处理)进行识别,以获取当前图像中的多个特征值,然后与存储单元4中的绘本图像特征数据库411中存储的各个绘本中的对应的图像的特征值进行比对。如果采集的图像或视频的特征值与资源库中的某个绘本的对应的特征值相匹配,则表明阅读者正在阅读该绘本,从而确定出阅读者正在阅读的绘本和正在阅读绘本的内容所对应的页码;如果采集的图像或视频的特征值在资源库中没有任何个绘本的对应的特征值与之匹配,则表明阅读者并不在“看故事”或数据库42并不存储阅读者正在阅读的绘本,从而智能机器人不能阅读者解读故事。In a possible embodiment, the camera 12 of the smart robot collects images or videos, and then the vision processing unit 32 in the processing unit 3 processes the collected images (if it is a video, the vision processing unit 32 processes each frame of the video separately) The recognition is performed to obtain multiple feature values in the current image, and then the feature values of the corresponding images in each picture book stored in the picture book image feature database 411 in the storage unit 4 are compared. If the feature value of the captured image or video matches the corresponding feature value of a picture book in the resource library, it indicates that the reader is reading the picture book, so as to determine the picture book the reader is reading and the content of the picture book. Corresponding page number; if the feature value of the collected image or video does not match the corresponding feature value of any picture book in the resource library, it indicates that the reader is not "reading the story" or the database 42 does not store the reader is reading So that the intelligent robot cannot read the story.
当然,本申请实施例中的智能机器人也可以同时开启麦克风11和摄像头12进行工作,通过同时获取语音信息、图像或视频信息,从而更准确的确定阅读者正在阅读的绘本和正在阅读绘本的内容所对应的页码。Of course, the intelligent robot in the embodiment of the present application can also turn on the microphone 11 and the camera 12 to work at the same time, and obtain voice information, image or video information at the same time, so as to more accurately determine the picture book the reader is reading and the content of the picture book. The corresponding page number.
步骤S703,获取当前绘本的历史信息。Step S703: Obtain historical information of the current picture book.
步骤S705,向阅读者提出第一轮次问题。In step S705, the reader is asked the first round of questions.
智能机器人在确定阅读者正在阅读的绘本和正在阅读绘本的内容所对应的页码后,开启绘本解读工作。After determining the page number corresponding to the picture book the reader is reading and the content of the picture book, the intelligent robot starts the picture book interpretation work.
在一个可能的实施例中,处理单元3中的解读轮次计算单元34从存储单元4的第一绘本阅读记录数据库412中获取关于该绘本的历史阅读情况的数据。当解读轮次计算单元34获取的数据中没有记录该绘本被解读相关的信息时,则表明该绘本第一次被解读,处理单元3则从资源库42中获取该绘本第一轮解读问题,然后通过扬声器21向阅读者提问第一轮解读的问题;当解读轮次计算单元34获取一条如下表格的数据时,In a possible embodiment, the interpretation round calculation unit 34 in the processing unit 3 obtains data about the historical reading situation of the picture book from the first picture book reading record database 412 of the storage unit 4. When the data obtained by the interpretation round calculation unit 34 does not record the information related to the interpretation of the picture book, it indicates that the picture book is interpreted for the first time, and the processing unit 3 obtains the first round of interpretation question of the picture book from the resource library 42. Then the reader is asked the question of the first round of interpretation through the speaker 21; when the interpretation round calculation unit 34 obtains a piece of data in the following table,
表四解读信息中的绘本和各个轮次中回答答案的正确率之间的关系Table 4 The relationship between the picture book in the interpretation information and the correct rate of answers in each round
Figure PCTCN2021080269-appb-000004
Figure PCTCN2021080269-appb-000004
则表明该绘本在历史上进行过一次解读,且在第一轮解读中阅读者正向答案率为60%(假设从第一轮进入第二轮解读的正向答案率为60%),没有第二轮解读中阅读者正向答案率,也没有第三轮解读中阅读者正向答案率,处理单元3得到第一轮问题的正向答案率均值为60%,然后从资源库42中获取该绘本第二轮解读问题,然后通过扬声器21向阅读者提问第二轮解读的问题;当解读轮次计算单元34获取两条如下表格的数据时,It indicates that the picture book has been interpreted once in history, and the positive answer rate of the reader in the first round of interpretation is 60% (assuming that the positive answer rate from the first round to the second round of interpretation is 60%), no The positive answer rate of readers in the second round of interpretation, and there is no positive answer rate of readers in the third round of interpretation. The processing unit 3 obtains the average positive answer rate of the first round of questions at 60%, and then retrieves it from the resource library 42 Obtain the second-round interpretation question of the picture book, and then ask the reader the second-round interpretation question through the speaker 21; when the interpretation-round calculation unit 34 obtains two pieces of data in the following table,
表五解读信息中的绘本和各个轮次中回答答案的正确率之间的关系Table 5 The relationship between the picture book in the interpretation information and the correct rate of answers in each round
Figure PCTCN2021080269-appb-000005
Figure PCTCN2021080269-appb-000005
则表明该绘本在历史上进行过两次解读,在第一次解读过程中,在第一轮解读中阅读者正向答案率为60%(假设从第一轮进入第二轮解读的正向答案率为60%)、第二轮解读中阅读者正向答案率为10%,没有第三轮解读中阅读者正向答案率,在第二次解读过程中,在第一轮解读中阅读者正向答案率为60%、第二轮解读中阅读者正向答案率为60%,没有第三轮解读中阅读者正向答案率为,处理单元3得到第一轮问题的正向答案率为60%,得到第二轮问题的正向答案率为35%,然后从资源库42中获取该绘本第二轮解读问题,然后通过扬声器21向阅读者提问第二轮解读的问题;其它情况依次类推。It indicates that the picture book has been interpreted twice in history. In the first interpretation process, the positive answer rate of the reader in the first round of interpretation is 60% (assuming the positive answer from the first round to the second round of interpretation) The answer rate is 60%), the positive answer rate of readers in the second round of interpretation is 10%, and there is no positive answer rate of readers in the third round of interpretation. During the second interpretation, read in the first round of interpretation. The positive answer rate of readers is 60%. The positive answer rate of readers in the second round of interpretation is 60%. There is no positive answer rate of readers in the third round of interpretation. Processing unit 3 gets the positive answer to the first round of questions. The rate is 60%, and the positive answer rate for the second round of questions is 35%. Then, obtain the second round of interpretation questions for the picture book from the resource library 42, and then ask the readers the second round of interpretation through the speaker 21; others; The situation can be deduced by analogy.
在一个可能的实施例中,处理单元3中的解读轮次计算单元34从存储单元4的第二绘本阅读记录数据库413中获取关于该绘本的历史阅读情况的数据。如果当前阅读者进行“英文绘本”学习,当解读轮次计算单元34获的数据中没有记录该绘本为“英文绘本”类型的被解读相关的信息时,则表明该绘本的“英文绘本”类型第一次被解读,处理单元3则从资源库42中获取该绘本的“英文绘本”类型的第一轮解读问题,然后通过扬声器21向阅读者提问第一轮解读的问题;当解读轮次计算单元34获取四条如下表格的数据时,In a possible embodiment, the interpretation round calculation unit 34 in the processing unit 3 obtains data about the historical reading situation of the picture book from the second picture book reading record database 413 of the storage unit 4. If the current reader is studying "English picture books", when the data obtained by the interpretation round calculation unit 34 does not record that the picture book is of the "English picture book" type, it indicates that the picture book is of the "English picture book" type. When it is interpreted for the first time, the processing unit 3 obtains the first-round interpretation question of the "English picture book" type of the picture book from the resource library 42, and then asks the reader the first-round interpretation question through the speaker 21; When the calculation unit 34 obtains four pieces of data in the following table,
表六解读信息中的绘本和各个轮次中回答答案的正确率之间的关系Table 6 The relationship between the picture book in the interpretation information and the correct rate of answers in each round
Figure PCTCN2021080269-appb-000006
Figure PCTCN2021080269-appb-000006
Figure PCTCN2021080269-appb-000007
Figure PCTCN2021080269-appb-000007
则表明该绘本的“英文绘本”类型在历史上进行过两次解读,在第一次解读过程中,在第一轮解读中阅读者正向答案率为50%(假设从第一轮进入第二轮解读的正向答案率为60%),没有第二轮解读中阅读者正向答案率,也没有第三轮解读中阅读者正向答案率,在第二次解读过程中,在第一轮解读中阅读者正向答案率为70%和第二轮解读中阅读者正向答案率为60%,没有第三轮解读中阅读者正向答案率,处理单元3得到第一轮问题的正向答案率为60%,得到第二轮问题的正向答案率为60%,然后从资源库42中获取该绘本的“英文绘本”类型的第三轮解读问题,然后通过扬声器21向阅读者提问第三轮解读的问题;其它情况依次类推。It indicates that the “English picture book” type of the picture book has been interpreted twice in history. In the first interpretation process, the positive answer rate of the reader in the first round of interpretation is 50% (assuming that the first round of The positive answer rate of the second round of interpretation is 60%), there is no positive answer rate of the reader in the second round of interpretation, and there is no positive answer rate of the reader in the third round of interpretation. The positive answer rate of readers in the first round of interpretation is 70% and the positive answer rate of readers in the second round of interpretation is 60%. Without the positive answer rate of readers in the third round of interpretation, processing unit 3 gets the first round of questions The positive answer rate for the second round of questions is 60%, and the positive answer rate for the second round of questions is 60%. Then the third round of interpretation questions of the "English picture book" type of the picture book is obtained from the resource library 42, and then the speaker 21 The reader asks the questions of the third round of interpretation; other situations are analogous to this.
其中,阅读者可以通过语音指令、在屏幕输入的指令等方式让智能机器人进入该绘本的指定类型。Among them, the reader can let the intelligent robot enter the specified type of the picture book through voice instructions, instructions input on the screen, etc.
步骤S707,接收阅读者答复第一轮次问题中的各个问题的答案。Step S707: Receive the reader's answer to each question in the first round of questions.
步骤S709,当各个问题的答案的正向答案率大于设定阈值时,向阅读者提出第二轮次问题。In step S709, when the positive answer rate of the answer to each question is greater than the set threshold, the reader is asked the second round of questions.
智能机器人在通过扬声器21播放问题后,在设定的时间段开启麦克风11和/或摄像头12工作,以获取阅读者针对问题回复的方案。After the intelligent robot broadcasts the question through the speaker 21, the microphone 11 and/or the camera 12 are turned on for a set time period to obtain the reader's response to the question.
一个可能的实施例中,智能机器人的麦克风11采集阅读者的语音信息,然后处理单元3中的ASR单元31将采集的语音转换为对应的文字,然后从转换后的文字文本中提取出多个关键词,与存储单元4中的资源库42中存储的对应问题的答案进行比对。如果采集的语音信息转换到的关键词与资源库42中的对应问题的答案的关键词相匹配,则表明阅读者回答的答案正确,处理单元3可以通过扬声器21向阅读者播报“答案正确”、“你太棒了,回答正确”等语音,让阅读者知道自己回答的答案是正确的;如果采集的语音信息转换到的关键词与资源库42中的对应问题的答案的关键词不相匹配,则表明阅读者回答的答案错误,处理单元3可以通过扬声器21向阅读者播报“答案错误”、“你再想想,有没有更好的答案”等语音,让阅读者知道自己回答的答案是错误的,然后播报正确的答案。In a possible embodiment, the microphone 11 of the intelligent robot collects the voice information of the reader, and then the ASR unit 31 in the processing unit 3 converts the collected voice into corresponding text, and then extracts multiple texts from the converted text. The keywords are compared with the answers to the corresponding questions stored in the resource library 42 in the storage unit 4. If the key words converted from the collected voice information match the key words of the answer to the corresponding question in the resource library 42, it indicates that the answer answered by the reader is correct, and the processing unit 3 can broadcast "the answer is correct" to the reader through the speaker 21 , "You are great, you answered correctly" and other voices, so that the reader knows that the answer you answered is correct; if the keywords converted from the collected voice information are not the same as the keywords of the answer to the corresponding question in the resource library 42 Match, it indicates that the reader’s answer is wrong. The processing unit 3 can broadcast to the reader voices such as "Answer Wrong", "Think about it again, is there a better answer" and other voices to the reader through the speaker 21, so that the reader knows what he has answered. The answer is wrong, then broadcast the correct answer.
另外,当阅读者回答的答案是错误时,也可以再次开启麦克风11工作,让阅读者再次进行回答,当阅读者在设定次数内,都没有回答出正确的答案,处理单元3再通过扬声器21播报正确的答案。In addition, when the reader’s answer is wrong, the microphone 11 can also be turned on again to allow the reader to answer again. When the reader fails to answer the correct answer within the set number of times, the processing unit 3 then passes through the speaker 21 Broadcast the correct answer.
一个可能的实施例中,如果阅读采用手势进行回答时,智能机器人的摄像头12采集图像或视频,然后处理单元3中的视觉处理单元32对采集的图像(如果是视频,视觉处理单元32对视频中的每帧图像单独处理)进行识别,以获取当前图像中的多个特征值,并根据多个特征值确定阅读者手指数和手指组合的时候,确定阅读者表达的具体数量,然后与存储单元4中的资源库42中存储的对应问题的答案进行比对。如果确定的具体数量与资源库42中的对应问题的答案相同,则表明阅读者回答的答案正确,处理单元3可以通过扬声器21向阅读者播报“答案正确”、“你太棒了,回答正确”等语音,让阅读者知道自己回答的答案是正确的;如果确定的具体数量与资源库42中的对应问题的答案不相同,则表明阅读者回答的答案错误,处理单元3可以通过扬声器21向阅读者播报“答案错误”、“你再想想,有没有更好的答案”等语音,让阅读者知道自己回答的答案是错误的,然后播报正确的答案。In a possible embodiment, if a gesture is used to answer the reading, the camera 12 of the smart robot collects images or videos, and then the vision processing unit 32 in the processing unit 3 performs the processing on the collected images (if it is a video, the vision processing unit 32 responds to the video Each frame of the image is processed separately) for identification to obtain multiple feature values in the current image, and when the reader’s hand index and finger combination are determined based on the multiple feature values, the specific number of the reader’s expression is determined, and then stored with The answers to the corresponding questions stored in the resource library 42 in unit 4 are compared. If the specific number determined is the same as the answer to the corresponding question in the resource library 42, it indicates that the answer answered by the reader is correct, and the processing unit 3 can broadcast to the reader through the speaker 21 "the answer is correct", "you are great, the answer is correct" "And so on, let the reader know that the answer he answered is correct; if the specific number determined is not the same as the answer to the corresponding question in the resource library 42, it indicates that the answer answered by the reader is wrong, and the processing unit 3 can use the speaker 21 Announce the "Answer Wrong", "Think about it again, is there a better answer" and other voices to the reader, let the reader know that the answer he answered is wrong, and then announce the correct answer.
当然,本申请实施例中的智能机器人也可以同时开启麦克风11和摄像头12进行工作,通过同时获取语音信息、图像或视频信息,从而更准确的确定阅读者回答的答案。Of course, the intelligent robot in the embodiment of the present application can also turn on the microphone 11 and the camera 12 to work at the same time, and obtain voice information, image or video information at the same time, so as to more accurately determine the answer of the reader.
一个可能的实施例中,如图8所示,如果智能机器人上有触控屏幕,处理单元3可以将问题以图片形式呈现在触控屏幕上。如智能机器人向阅读者提问“你觉得这个故事里面,谁会变成小兔子呀?”,阅读者在得知问题后,可以在屏幕上的“熊爸爸”、“小熊”等其它物体的位置上进行点击,然后处理单元3将阅读者点击的位置与资源库42中的存储对应问题的答案进行比对,以确定阅读者点击的位置是否正确。如果点击的位置正确,则表明阅读者回答的答案正确,然后通过扬声器21播报或触控屏幕上显示方式告诉阅读者回答正确;如果点击的位置错误,则表明阅读者回答的答案错误,然后通过扬声器21播报或触控屏幕上显示方式告诉阅读者回答错误。In a possible embodiment, as shown in FIG. 8, if the smart robot has a touch screen, the processing unit 3 may present the problem on the touch screen in the form of a picture. For example, the intelligent robot asks the reader "Who do you think will become a bunny in this story?", after the reader knows the problem, he can locate the "Daddy Bear", "Little Bear" and other objects on the screen. Then, the processing unit 3 compares the position clicked by the reader with the answer of the corresponding question stored in the resource library 42 to determine whether the position clicked by the reader is correct. If the clicked position is correct, it means that the reader’s answer is correct, and then broadcast through the speaker 21 or touch the screen to tell the reader that the answer is correct; if the clicked position is wrong, it means that the reader’s answer is wrong, and then pass The speaker 21 broadcasts or the display mode on the touch screen tells the reader that the answer is wrong.
在智能机器人向阅读者提问完第一轮所有问题后,处理单元3中阅读结果计算单元33计算本轮次阅读者回答正确的题目数量,然后计算出正确率,如果正确率不超过设定的阈值时,智能机器人不进入第二轮解读;如果正确率超过设定的阈值时,智能机器人则进入第二轮解读。After the intelligent robot asks the readers all the questions in the first round, the reading result calculation unit 33 in the processing unit 3 calculates the number of questions answered correctly by the readers in this round, and then calculates the correct rate. If the correct rate does not exceed the set When the threshold is set, the intelligent robot does not enter the second round of interpretation; if the correct rate exceeds the set threshold, the intelligent robot enters the second round of interpretation.
另外,如果智能机器人此次解读结束后,处理单元3将此次进行解读的绘本、类型、解读的轮次、每个轮次解读的正向答案率等信息以表二、表三的格式存储在第一绘本阅读记录数据库412和第二绘本阅读记录数据库413,以便后续阅读者再次解读时,可以参考此次解读的结果确定再次解读时直接进入的轮次。In addition, if the intelligent robot finishes this interpretation, the processing unit 3 will store the picture book, type, round of interpretation, and forward answer rate of each round of interpretation in the format of Table 2 and Table 3. In the first picture book reading record database 412 and the second picture book reading record database 413, subsequent readers can refer to the result of this interpretation to determine the round that they directly enter when re-interpreting.
进一步地,由于市面上的绘本数量非常多,可能出现不同绘本中部分故事相同、相同绘本不同出版社出版、相同绘本不同语言呈现等情况,处理单元3在生成解读信息时可以不需要包括绘本封面等信息,这样不必要死板地根据每一本绘本的阅读情况来确定解读的轮次,而是可以根据是否有过类似绘本的阅读记录,来确定当前绘本的解读轮次。Furthermore, due to the large number of picture books on the market, it may happen that some stories in different picture books are the same, the same picture books are published by different publishers, and the same picture books are presented in different languages. The processing unit 3 may not need to include the picture book cover when generating the interpretation information. In this way, it is not necessary to determine the interpretation round based on the reading situation of each picture book rigidly. Instead, the interpretation round of the current picture book can be determined based on whether there has been a reading record similar to the picture book.
进一步地,以宫西达也的超人系列绘本(一共三本)为例进行说明,该系列绘本包括《加法超人与算术星人》、《奇幻超人》和《正义之士》三本绘本。当资源库42中存储有以如图4-6及相应的描述内容的方式制作的该三本绘本时(其中“进入下一轮解读规则”选项选择的是“方式2”),如果阅读者按照前述规则已经读了该系列绘本的《加法超人与算术星人》的第一轮和《正义之士》的第一轮,而《奇幻超人》还没有读,且《加法超人与算术星人》第一轮的10个问题正向回答了7个,《正义之士》的第一轮问题正向回答了8个。 此时,第一绘本阅读记录数据库412中两本的各一条解读信息,而第二绘本阅读记录数据库413中记录了一套解读信息,该解读信息中平均正向回答率为该系列绘本中已读绘本的平均正向回答率:(0.7+0.8)/2=75%。Furthermore, take Miyoshi Tatsuya's Superman series of picture books (a total of three books) as an example. This series of picture books includes three picture books of "Additional Superman and Arithmetic Starman", "Fantasy Superman" and "Prince of Justice". When the three picture books made in the manner shown in Figure 4-6 and the corresponding description content are stored in the resource library 42 (wherein the "Enter the next round of interpretation rules" option is selected "Method 2"), if the reader According to the aforementioned rules, I have read the first round of "Additional Superman and Arithmetic Starman" and the first round of "The Righteous Man" in the series of picture books. Seven of the 10 questions in one round were answered positively, and eight were answered positively in the first round of "The Righteous Man". At this time, the first picture book reading record database 412 contains one piece of interpretation information for each of the two books, and the second picture book reading record database 413 records a set of interpretation information. The average positive answer rate of reading picture books: (0.7+0.8)/2=75%.
当阅读者开始开始第一次读《奇幻超人》时,根据绘本基本信息中的记录,《奇幻超人》属于超人系列绘本,根据阅读轮次计算规则,查询第二绘本阅读记录数据库413中解读信息,根据解读信息表明超人系列绘本的已读绘本已经超过2本,由于该系列已读绘本第二轮解读问答互动中,阅读者正向回答平均数75%>60%,所以智能机器人直接从第二轮开始解读。When readers start to read "Fantasy Superman" for the first time, according to the records in the basic information of the picture book, "Fantasy Superman" belongs to the Superman series of picture books. According to the reading round calculation rules, query the interpretation information in the reading record database 413 of the second picture book According to the interpretation information, there are more than 2 picture books in the Superman series of picture books. Since in the second round of interpretation question-and-answer interaction of the series of picture books, the average number of readers’ positive answers is 75%>60%, so the intelligent robot directly starts from the first Interpretation began in the second round.
本申请实施例提供的绘本解读方法,通过将对绘本进行解读的问题按照难易程度进行分类,如果阅读者第一次阅读该绘本时,向阅读者提问最简单类型的问题,如果根据历史记录的信息表明阅读者进行过多次解读,则向该阅读者提问相对应的轮次问题,从而智能化的向阅读者提出合理的问题,有效的帮助阅读者理解绘本。当一个类型的问题提问完后,根据阅读者回复的答案的正确率,确定是否再提问更难一级类型的问题,从而循序渐进地引导阅读者对该故事的理解,提升阅读者的阅读理解能力。The picture book interpretation method provided in the embodiments of this application classifies the problems of interpreting picture books according to the degree of difficulty. If the reader reads the picture book for the first time, the reader will be asked the simplest type of questions, if based on historical records The information indicates that the reader has done many interpretations, then ask the reader the corresponding round question, so as to intelligently ask the reader reasonable questions, and effectively help the reader understand the picture book. When one type of question is asked, according to the correct rate of the answers from the reader, determine whether to ask the more difficult type one question, so as to gradually guide the reader to understand the story and improve the reader's reading comprehension ability .
图9为本发明实施例提供的一种终端设备的结构示意图。如图9所示的一种终端设备900,该电子设备900可以为上述的智能机器人,包括传感器901,显示器902,处理器903、存储器904、通信接口905以及总线906。电子设备中的处理器903、存储器904和通信接口905可以通过总线906建立通信连接。FIG. 9 is a schematic structural diagram of a terminal device according to an embodiment of the present invention. A terminal device 900 shown in FIG. 9, the electronic device 900 may be the aforementioned intelligent robot, and includes a sensor 901, a display 902, a processor 903, a memory 904, a communication interface 905, and a bus 906. The processor 903, the memory 904, and the communication interface 905 in the electronic device can establish a communication connection through the bus 906.
传感器901,用于获取阅读者语音信息和图像或视频信息,以及发送音视频信息。传感器901可包括摄像头、麦克风、扬声器等等。The sensor 901 is used to obtain the reader's voice information and image or video information, and to send audio and video information. The sensor 901 may include a camera, a microphone, a speaker, and so on.
显示器902,用于显示处理后的数据,如视频、图像。The display 902 is used to display processed data, such as videos and images.
处理器903可以为中央处理器(central processing unit,CPU)。The processor 903 may be a central processing unit (CPU).
存储器904可以包括易失性存储器(volatile memory),例如随机存取存储器(random-access memory,RAM);存储器也可以包括非易失性存储器(non-volatile memory),例如只读存储器(read-only memory,ROM),快闪存储器,硬盘(hard disk drive,HDD)或固态硬盘(solid state drive,SSD);存储器904还可以包括上述种类的存储器的组合。The memory 904 may include a volatile memory (volatile memory), such as a random-access memory (random-access memory, RAM); the memory may also include a non-volatile memory (non-volatile memory), such as a read-only memory (read-only memory). Only memory, ROM), flash memory, hard disk drive (HDD) or solid state drive (SSD); the memory 904 may also include a combination of the foregoing types of memories.
上述实施例提供的解读方法,均由处理器903来执行。图片、语音、绘本内容等数据将存储在存储器904中。另外,存储器904中还将用于存储处理器903执行的用于实现上述实施例所述的终端信息保护方法对应的程序指令等等。The interpretation methods provided in the foregoing embodiments are all executed by the processor 903. Data such as pictures, voices, and picture book content will be stored in the memory 904. In addition, the memory 904 will also be used to store program instructions executed by the processor 903 for implementing the terminal information protection method described in the foregoing embodiment, and so on.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请实施例的范围。A person of ordinary skill in the art may realize that the units and algorithm steps of the examples described in combination with the embodiments disclosed herein can be implemented by electronic hardware or a combination of computer software and electronic hardware. Whether these functions are executed by hardware or software depends on the specific application and design constraint conditions of the technical solution. Professionals and technicians can use different methods for each specific application to implement the described functions, but such implementation should not be considered as going beyond the scope of the embodiments of the present application.
此外,本申请实施例的各个方面或特征可以实现成方法、装置或使用标准编程和/或工程技术的制品。本申请中使用的术语“制品”涵盖可从任何计算机可读器件、载体或介质访问的计算机程序。例如,计算机可读介质可以包括,但不限于:磁存储器件(例如,硬盘、 软盘或磁带等),光盘(例如,压缩盘(compact disc,CD)、数字通用盘(digital versatile disc,DVD)等),智能卡和闪存器件(例如,可擦写可编程只读存储器(erasable programmable read-only memory,EPROM)、卡、棒或钥匙驱动器等)。另外,本文描述的各种存储介质可代表用于存储信息的一个或多个设备和/或其它机器可读介质。术语“机器可读介质”可包括但不限于,无线信道和能够存储、包含和/或承载指令和/或数据的各种其它介质。In addition, various aspects or features of the embodiments of the present application can be implemented as methods, devices, or products using standard programming and/or engineering techniques. The term "article of manufacture" used in this application encompasses a computer program accessible from any computer-readable device, carrier, or medium. For example, the computer-readable medium may include, but is not limited to: magnetic storage devices (for example, hard disks, floppy disks, or tapes, etc.), optical disks (for example, compact discs (CD), digital versatile discs (DVD)) Etc.), smart cards and flash memory devices (for example, erasable programmable read-only memory (EPROM), cards, sticks or key drives, etc.). In addition, various storage media described herein may represent one or more devices and/or other machine-readable media for storing information. The term "machine-readable medium" may include, but is not limited to, wireless channels and various other media capable of storing, containing, and/or carrying instructions and/or data.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。该计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行该计算机程序指令时,全部或部分地产生按照本申请实施例所述的流程或功能。该计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。该计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,该计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。该计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。该可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘Solid State Disk(SSD))等。In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented by software, it can be implemented in the form of a computer program product in whole or in part. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on the computer, the processes or functions described in the embodiments of the present application are generated in whole or in part. The computer can be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices. The computer instruction may be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium. For example, the computer instruction may be transmitted from a website, computer, server, or data center through a cable (Such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) to another website site, computer, server or data center. The computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server or data center integrated with one or more available media. The usable medium may be a magnetic medium (for example, a floppy disk, a hard disk, and a magnetic tape), an optical medium (for example, a DVD), or a semiconductor medium (for example, a solid state disk (SSD)).
应当理解的是,在本申请实施例的各种实施例中,上述各过程的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that in the various embodiments of the embodiments of the present application, the size of the sequence number of the above-mentioned processes does not mean the order of execution. The execution order of the processes should be determined by their functions and internal logic, and should not be dealt with. The implementation process of the embodiments of the present application constitutes any limitation.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and conciseness of description, the specific working process of the system, device and unit described above can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed system, device, and method can be implemented in other ways. For example, the device embodiments described above are merely illustrative. For example, the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components may be combined or It can be integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请实施例的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者接入网设备等)执行本申请实施例各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。If the function is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium. Based on this understanding, the technical solutions of the embodiments of the present application are essentially or the part that contributes to the prior art or the part of the technical solutions can be embodied in the form of a software product, and the computer software product is stored in a storage medium. , Including several instructions to make a computer device (which may be a personal computer, a server, or an access network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the embodiments of the present application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disks or optical disks and other media that can store program codes. .
以上所述,仅为本申请实施例的具体实施方式,但本申请实施例的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请实施例揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请实施例的保护范围之内。The above are only specific implementations of the embodiments of the present application, but the protection scope of the embodiments of the present application is not limited to this. Anyone familiar with the technical field in the technical scope disclosed in the embodiments of the present application can easily Any change or replacement should be included in the protection scope of the embodiments of the present application.

Claims (15)

  1. 一种解读绘本方法,由终端设备执行,其特征在于,所述方法包括:A method for interpreting picture books, executed by a terminal device, characterized in that the method includes:
    确定阅读者正在阅读的当前绘本;Determine the current picture book the reader is reading;
    获取所述当前绘本的历史信息,所述历史信息包括历史上所述阅读者的对所述当前绘本进行解读的解读次数和历史上所述阅读者每次解读时提出的各个轮次问题的正向答案率;Acquire the historical information of the current picture book, the historical information includes the number of interpretations of the current picture book by the reader in the history and the correctness of each round of questions raised by the reader each time the reader in the history interprets the picture book. Answer rate
    向所述阅读者提出第一轮次问题,所述第一轮次问题是根据所述历史信息确定的。A first round of questions is asked to the reader, and the first round of questions is determined according to the historical information.
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method according to claim 1, wherein the method further comprises:
    接收所述阅读者答复所述第一轮次问题中的各个问题的答案;Receiving the reader's answer to each question in the first round of questions;
    当所述各个问题的答案的正向答案率大于设定阈值时,向所述阅读者提出第二轮次问题,所述第二轮次问题中的问题不同于所述第一轮次问题中的问题。When the positive answer rate of the answers to the various questions is greater than the set threshold, the reader is asked a second round of questions, and the questions in the second round of questions are different from those in the first round of questions The problem.
  3. 根据权利要求2所述的方法,其特征在于,所述方法还包括:The method according to claim 2, wherein the method further comprises:
    所述第一轮次问题和所述第二轮次问题的先后顺序是按难易程度设置。The order of the first round of questions and the second round of questions is set according to the degree of difficulty.
  4. 根据权利要求1-3任意一项所述的方法,其特征在于,在所述获取所述当前绘本的历史信息之前,还包括:获取所述阅读者的身份信息,The method according to any one of claims 1 to 3, wherein before said obtaining the historical information of the current picture book, it further comprises: obtaining the identity information of the reader,
    所述获取所述当前绘本的历史信息,包括:获取与所述身份信息相对应的所述当前绘本的历史信息。The obtaining the historical information of the current picture book includes: obtaining the historical information of the current picture book corresponding to the identity information.
  5. 根据权利要求1-4任意一项所述的方法,其特征在于,在所述确定阅读者正在阅读的当前绘本之后,所述方法还包括:The method according to any one of claims 1 to 4, wherein after the determining the current picture book that the reader is reading, the method further comprises:
    判断所述当前绘本是否存储在资源库中;Determine whether the current picture book is stored in the resource library;
    当所述资源库没有所述当前绘本时,判断所述当前绘本的内容是否与所述资源库中已存储的第一绘本的内容相同;When the resource library does not have the current picture book, determining whether the content of the current picture book is the same as the content of the first picture book stored in the resource library;
    当所述当前绘本的内容与所述第一绘本的内容相同时,向所述阅读者提出与所述第一绘本对应的轮次问题。When the content of the current picture book is the same as the content of the first picture book, a round question corresponding to the first picture book is asked to the reader.
  6. 根据权利要求1-5任意一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-5, wherein the method further comprises:
    确定所述阅读者正在阅读的所述当前绘本的页码;Determine the page number of the current picture book that the reader is reading;
    向所述阅读者提出与所述页码对应的内容的轮次问题。Ask the reader a question about the turn of the content corresponding to the page number.
  7. 根据权利要求1-6任意一项所述的方法,其特征在于,所述确定阅读者正在阅读的当前绘本,包括:The method according to any one of claims 1-6, wherein the determining the current picture book currently being read by the reader comprises:
    通过麦克风获取所述阅读者正在阅读的所述当前绘本的语音信息;Acquire the voice information of the current picture book that the reader is reading through a microphone;
    将所述语音信息转换成绘本文字,所述绘本文字用于从所述资源库中确定所述阅读者正在阅读的当前绘本。The voice information is converted into picture book text, and the picture book text is used to determine the current picture book currently being read by the reader from the resource library.
  8. 根据权利要求1-6任意一项所述的方法,其特征在于,所述确定阅读者正在阅读的当前绘本,包括:The method according to any one of claims 1-6, wherein the determining the current picture book currently being read by the reader comprises:
    通过摄像头获取所述阅读者正在阅读的所述当前绘本的图像;Acquiring, through a camera, the image of the current picture book that the reader is reading;
    识别所述图像中特征,得到绘本特征值,所述绘本特征值用于从所述资源库中确定所述阅读者正在阅读的当前绘本。Identify the features in the image to obtain the picture book feature value, and the picture book feature value is used to determine the current picture book being read by the reader from the resource library.
  9. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method according to claim 1, wherein the method further comprises:
    在所述历史信息中增加一次所述阅读者的对所述当前绘本进行解读的解读次数和存 储所述阅读者在本次解读过程中答复各个轮次问题的答案的正向答案率。In the historical information, the number of interpretations of the reader's interpretation of the current picture book is added once, and the positive answer rate of the answers of the readers' answers to each round of questions during this interpretation process is stored.
  10. 根据权利要求1-9任意一项所述的方法,其特征在于,在所述向所述阅读者提出第一轮次问题之前,包括:The method according to any one of claims 1-9, wherein before the first round of questions is asked to the reader, the method comprises:
    接收解读指令,所述解读指令用于指示向所述阅读者提出所述第一轮次问题。An interpretation instruction is received, and the interpretation instruction is used to instruct to ask the reader the first round of questions.
  11. 一种解读绘本装置,其特征在于,包括:收发器、处理器和存储器;A picture book interpretation device, which is characterized by comprising: a transceiver, a processor and a memory;
    所述收发器用于接收和发送数据;The transceiver is used to receive and send data;
    所述存储器存储有一个或多个程序,所述一个或多个程序包括指令,当所述指令被所述处理器执行时,使得所述电子设备执行根据权利要求1-10中的任意一项所述的方法。The memory stores one or more programs, and the one or more programs include instructions. When the instructions are executed by the processor, the electronic device executes any one of claims 1-10. The method described.
  12. 一种电子设备,其特征在于,包括:摄像头和/或麦克风、存储器和执行如权利要求1-10所述方法的处理器。An electronic device, characterized by comprising: a camera and/or a microphone, a memory, and a processor for executing the method according to claims 1-10.
  13. 一种智能机器人,其特征在于,包括:An intelligent robot, characterized in that it includes:
    摄像头和/或麦克风,用于接收阅读者阅读绘本的语音信息或图像信息,以及阅读者回答所述绘本对应的问题的答案的语音信息或图像信息;The camera and/or the microphone are used to receive the voice information or image information of the reader reading the picture book, and the voice information or the image information of the reader's answer to the question corresponding to the picture book;
    存储器,用于存储至少一个绘本的数据包,以及每个绘本历史上所述阅读者的对所述当前绘本进行解读的解读次数和历史上所述阅读者的每次解读时提出的各个轮次问题的正向答案率;The memory is used to store data packets of at least one picture book, as well as the number of interpretations of the current picture book by the reader in the history of each picture book and the rounds proposed by the reader in each interpretation in the history The positive answer rate of the question;
    处理器,用于处理所述摄像头和/或所述麦克风获取的所述语音信息或图像信息,然后从所述存储器中确定所述阅读者正在阅读的当前绘本,,以及根据所述存储器中所述当前绘本的解读次数和每个轮次问题在每次解读时得到的正向答案率的均值,确定向所述阅读者提问对应轮次的问题;The processor is configured to process the voice information or image information acquired by the camera and/or the microphone, and then determine from the memory the current picture book that the reader is reading, and according to the information in the memory State the number of interpretations of the current picture book and the average value of the positive answer rate obtained at each interpretation of each round of questions, and determine to ask the reader the corresponding round of questions;
    扬声器,用于向阅读者播放语音;Loudspeaker, used to play voice to readers;
    通信单元,用于接收每个绘本的数据包,以及从当前轮次进入下一轮次的正向答案率的阈值。The communication unit is used to receive the data packet of each picture book and the threshold of the forward answer rate from the current round to the next round.
  14. 一种可读存储介质,用于存储指令,当所述指令被执行时,使得如权利要求1-10中的任一项所述的方法被实现。A readable storage medium for storing instructions. When the instructions are executed, the method according to any one of claims 1-10 is realized.
  15. 一种包含指令的计算机程序设备,当其在终端上运行时,使得所述终端执行如权利要求1-10中的任一项所述的方法。A computer program device containing instructions, when it runs on a terminal, causes the terminal to execute the method according to any one of claims 1-10.
PCT/CN2021/080269 2020-04-30 2021-03-11 Method and apparatus for interpreting picture book, electronic device and smart robot WO2021218432A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010365911.XA CN111613100A (en) 2020-04-30 2020-04-30 Interpretation and drawing method and device, electronic equipment and intelligent robot
CN202010365911.X 2020-04-30

Publications (1)

Publication Number Publication Date
WO2021218432A1 true WO2021218432A1 (en) 2021-11-04

Family

ID=72203096

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/080269 WO2021218432A1 (en) 2020-04-30 2021-03-11 Method and apparatus for interpreting picture book, electronic device and smart robot

Country Status (2)

Country Link
CN (1) CN111613100A (en)
WO (1) WO2021218432A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114554079A (en) * 2022-01-11 2022-05-27 浙江大华技术股份有限公司 Intelligent service management method and intelligent service management system

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111613100A (en) * 2020-04-30 2020-09-01 华为技术有限公司 Interpretation and drawing method and device, electronic equipment and intelligent robot
CN113420131A (en) * 2021-06-11 2021-09-21 洪恩完美(北京)教育科技发展有限公司 Reading guide method and device for children picture book and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102289960A (en) * 2011-07-28 2011-12-21 赵冬 Interactive children's electronic book system
CN103680222A (en) * 2012-09-19 2014-03-26 镇江诺尼基智能技术有限公司 Question-answer interaction method for children stories
CN107316507A (en) * 2016-04-26 2017-11-03 它它(上海)信息科技有限公司 A kind of children paint this reading auxiliary system
US20170337841A1 (en) * 2016-05-20 2017-11-23 Creative Styles LLC Interactive multimedia story creation application
CN109940627A (en) * 2019-01-29 2019-06-28 北京光年无限科技有限公司 It is a kind of towards the man-machine interaction method and system of drawing this reading machine people
CN111613100A (en) * 2020-04-30 2020-09-01 华为技术有限公司 Interpretation and drawing method and device, electronic equipment and intelligent robot

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6101367A (en) * 1999-09-20 2000-08-08 Luciano; Philip P. Combination question-answer book and answer display
US8277221B2 (en) * 2004-12-15 2012-10-02 Justin Clarke Gomez Tagle, legal representative Talking plush novelty toy and interactive story book for teaching a reader desired behavioral patterns to protect oneself from sexual predators
CN104281847B (en) * 2013-07-12 2017-10-03 步步高教育电子有限公司 A kind of reading method, device and equipment
CN108509136A (en) * 2018-04-12 2018-09-07 山东音为爱智能科技有限公司 A kind of children based on artificial intelligence paint this aid reading method
CN108845786A (en) * 2018-05-31 2018-11-20 北京智能管家科技有限公司 Intelligent reading partner method, apparatus, equipment and storage medium
CN109637207B (en) * 2018-11-27 2020-09-01 曹臻祎 Preschool education interactive teaching device and teaching method
CN109858391A (en) * 2019-01-11 2019-06-07 北京光年无限科技有限公司 It is a kind of for drawing the man-machine interaction method and device of robot
CN110060524A (en) * 2019-04-30 2019-07-26 广东小天才科技有限公司 The method and reading machine people that a kind of robot assisted is read

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102289960A (en) * 2011-07-28 2011-12-21 赵冬 Interactive children's electronic book system
CN103680222A (en) * 2012-09-19 2014-03-26 镇江诺尼基智能技术有限公司 Question-answer interaction method for children stories
CN107316507A (en) * 2016-04-26 2017-11-03 它它(上海)信息科技有限公司 A kind of children paint this reading auxiliary system
US20170337841A1 (en) * 2016-05-20 2017-11-23 Creative Styles LLC Interactive multimedia story creation application
CN109940627A (en) * 2019-01-29 2019-06-28 北京光年无限科技有限公司 It is a kind of towards the man-machine interaction method and system of drawing this reading machine people
CN111613100A (en) * 2020-04-30 2020-09-01 华为技术有限公司 Interpretation and drawing method and device, electronic equipment and intelligent robot

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114554079A (en) * 2022-01-11 2022-05-27 浙江大华技术股份有限公司 Intelligent service management method and intelligent service management system

Also Published As

Publication number Publication date
CN111613100A (en) 2020-09-01

Similar Documents

Publication Publication Date Title
WO2021218432A1 (en) Method and apparatus for interpreting picture book, electronic device and smart robot
US11226673B2 (en) Affective interaction systems, devices, and methods based on affective computing user interface
CN110598576B (en) Sign language interaction method, device and computer medium
CN109871450B (en) Multi-mode interaction method and system based on textbook reading
US20210012777A1 (en) Context acquiring method and device based on voice interaction
WO2019100319A1 (en) Providing a response in a session
EP3493032A1 (en) Robot control method and companion robot
JP2018014094A (en) Virtual robot interaction method, system, and robot
KR102529262B1 (en) Electronic device and controlling method thereof
TW201937344A (en) Smart robot and man-machine interaction method
WO2024000867A1 (en) Emotion recognition method and apparatus, device, and storage medium
Bhattacharya et al. Exploring the contextual factors affecting multimodal emotion recognition in videos
KR102222911B1 (en) System for Providing User-Robot Interaction and Computer Program Therefore
CN110795913A (en) Text encoding method and device, storage medium and terminal
CN110825164A (en) Interaction method and system based on wearable intelligent equipment special for children
CN113703585A (en) Interaction method, interaction device, electronic equipment and storage medium
CN110580516A (en) interaction method and device based on intelligent robot
CN110473543B (en) Voice recognition method and device
CN109871440A (en) Intelligent prompt method, device and equipment based on semantic analysis
CN108810625A (en) A kind of control method for playing back of multi-medium data, device and terminal
WO2018209845A1 (en) Method and apparatus for generating stories on the basis of picture content
CN108806699B (en) Voice feedback method and device, storage medium and electronic equipment
CN114708443A (en) Screenshot processing method and device, electronic equipment and computer readable medium
CN111222854A (en) Interview method, device and equipment based on interview robot and storage medium
CN108628454B (en) Visual interaction method and system based on virtual human

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21797199

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21797199

Country of ref document: EP

Kind code of ref document: A1