CN113626013A - Automatic interpretation method and device for slides - Google Patents

Automatic interpretation method and device for slides Download PDF

Info

Publication number
CN113626013A
CN113626013A CN202110898884.7A CN202110898884A CN113626013A CN 113626013 A CN113626013 A CN 113626013A CN 202110898884 A CN202110898884 A CN 202110898884A CN 113626013 A CN113626013 A CN 113626013A
Authority
CN
China
Prior art keywords
current
read
script
text
explanation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110898884.7A
Other languages
Chinese (zh)
Inventor
杨永志
徐世波
陈小卫
孙建
赵伟峰
柯宏发
王保顺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peoples Liberation Army Strategic Support Force Aerospace Engineering University
Original Assignee
Peoples Liberation Army Strategic Support Force Aerospace Engineering University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peoples Liberation Army Strategic Support Force Aerospace Engineering University filed Critical Peoples Liberation Army Strategic Support Force Aerospace Engineering University
Priority to CN202110898884.7A priority Critical patent/CN113626013A/en
Publication of CN113626013A publication Critical patent/CN113626013A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/30Creation or generation of source code
    • G06F8/31Programming languages or programming paradigms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformation of program code
    • G06F8/41Compilation
    • G06F8/42Syntactic analysis
    • G06F8/427Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/06Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
    • G09B5/065Combinations of audio and video presentations, e.g. videotapes, videodiscs, television systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Business, Economics & Management (AREA)
  • Acoustics & Sound (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

The invention discloses an automatic explanation method and a device of a slide, wherein the method comprises the following steps: writing a current explanation script for a current slide to be explained in advance, wherein the current explanation script comprises: acquiring a current explanation script under the condition that a current text to be read and a current command to be executed are received and an automatic explanation request for the current slide to be explained is received; identifying the current explanation script, and determining the current text to be read and the current command to be executed in the current explanation script; and sequentially reading the current text to be read and executing the current command to be executed. In the process, the current explanation script is written for the current slide to be explained in advance, the reading of the text to be read aloud in the explanation script and the execution of the operation instruction corresponding to the command to be executed in the explanation script are completed, the corresponding function is realized based on the operation instruction, the automatic explanation of the slide to be explained is realized, and the learning of the related knowledge in the slide is completed.

Description

Automatic interpretation method and device for slides
Technical Field
The invention relates to the technical field of data processing, in particular to an automatic explanation method and device for slides.
Background
The turnover class translation (Flipped class or Inverted class) can also be translated into 'reverse class', which means that the time inside and outside the class is readjusted to transfer the decision right of learning from the teacher to the students. In this mode of teaching, the students can concentrate more on active project-based learning, co-study the challenges of localization or globalization and other real-world facing problems, and gain a deeper understanding, at the precious time in the class. Teachers do not take time in classes to teach related knowledge, students can learn the related knowledge before classes to finish independent learning, and the traditional classes adopt computer slides and teacher explanation to finish the teaching of the related knowledge.
Therefore, it is highly desirable to provide an automatic interpretation method of slides to help students to complete learning of related knowledge in slides, thereby supporting the implementation of a flip classroom form: and after the teacher automatically explains the teaching, the teacher answers questions and organizes the practical practice and operation of the student. The automatic explanation technology minimizes the manufacturing overhead of the turnover classroom, and has the advantages of lecture increment evolution, teacher burden reduction and the like.
Disclosure of Invention
In view of the above, the present invention provides an automatic interpretation method and apparatus for slides, which are used to support the realization of a flip classroom form, and the automatic interpretation technology minimizes the manufacturing overhead of the flip classroom, and has the advantages of evolving lecture material increment, reducing the burden of instructors, and the like. The specific scheme is as follows:
an automatic interpretation method of a slide show, comprising:
writing a current explanation script for a current slide to be explained in advance, wherein the current explanation script comprises: the current text to be read and the current command to be executed;
under the condition that an automatic explanation request for the slide to be explained currently is received, acquiring the current explanation script;
identifying the current explanation script, and determining the current text to be read and the current command to be executed in the current explanation script;
and sequentially reading the current text to be read and executing the current command to be executed.
Optionally, the method for identifying the current explanation script and determining the current text to be read and the current command to be executed in the current explanation script includes:
traversing the current explanation script and sequentially identifying the current explanation script;
and determining a part between the starting identifier and the ending identifier in the current explanation script as a current command to be executed, and determining the rest as a current text to be read, wherein the number of the current scripts to be read is at least one, and the number of the current commands to be executed is at least one.
Optionally, the method for sequentially reading the current text to be read and executing the current command to be executed includes:
sequentially analyzing the current explanation script;
if the current text to be read is analyzed in the current explanation script, reading the current text to be read based on a text-to-speech conversion technology;
and if the current command to be executed in the current explanation script is analyzed, traversing a preset command set, searching a method corresponding to the current command to be executed, and executing an operation instruction corresponding to the method.
The above method, optionally, further includes:
adding a control interface;
and controlling and displaying any slide to be explained based on the control interface.
The above method, optionally, further includes:
acquiring the current text to be read aloud;
and synchronously displaying the current text to be read in a preset position in the current slide to be explained while reading the text to be read.
An automatic interpretation apparatus of a slide show, comprising:
the compiling module is used for compiling a current explaining script for a current slide to be explained in advance, wherein the current explaining script comprises the following components: the current text to be read and the current command to be executed;
the first acquisition module is used for acquiring the current explanation script under the condition of receiving an automatic explanation request for the current slide to be explained;
the determining module is used for identifying the current explanation script and determining the current text to be read and the current command to be executed in the current explanation script;
and the reading and executing module is used for sequentially reading the current text to be read and executing the current command to be executed.
The above apparatus, optionally, the determining module includes:
the identification unit is used for traversing the current explanation script and sequentially identifying the current explanation script;
and the determining unit is used for determining a part between the starting identifier and the ending identifier in the current explanation script as a current command to be executed, and determining the rest of the current explanation script as a current text to be read, wherein the number of the current scripts to be read is at least one, and the number of the current commands to be executed is at least one.
The above apparatus, optionally, the reading and executing module includes:
the analysis unit is used for sequentially analyzing the current explanation scripts;
the reading unit is used for reading the current text to be read based on a text-to-speech conversion technology if the current text to be read in the current explanation script is analyzed;
and the execution unit is used for traversing a preset command set if the current command to be executed in the current explanation script is analyzed, searching a method corresponding to the current command to be executed, and executing an operation instruction corresponding to the method.
The above apparatus, optionally, further comprises:
the adding module is used for adding a control interface;
and the control module is used for controlling and displaying any slide to be explained based on the control interface.
The above apparatus, optionally, further comprises:
the second acquisition module is used for acquiring the current text to be read aloud;
and the display module is used for synchronously displaying the current text to be read in the preset position of the current slide to be explained while reading the text to be read.
Compared with the prior art, the invention has the following advantages:
the invention discloses an automatic explanation method and a device of a slide, wherein the method comprises the following steps: writing a current explanation script for a current slide to be explained in advance, wherein the current explanation script comprises: acquiring a current explanation script under the condition that a current text to be read and a current command to be executed are received and an automatic explanation request for the current slide to be explained is received; identifying the current explanation script, and determining the current text to be read and the current command to be executed in the current explanation script; and sequentially reading the current text to be read and executing the current command to be executed. In the process, the current explanation script is written for the current slide to be explained in advance, the reading of the text to be read aloud in the explanation script and the execution of the operation instruction corresponding to the command to be executed in the explanation script are completed, the corresponding function is realized based on the operation instruction, the automatic explanation of the slide to be explained is realized, and the learning of the related knowledge in the slide is completed.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a flowchart of an automatic interpretation method of slides according to an embodiment of the present invention;
fig. 2 is a block diagram of an automatic interpretation apparatus for slides according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The invention discloses an automatic explanation method and a device for slides, which are used for realizing the automatic explanation of contents in the slides, students can finish the independent learning of knowledge through the automatic explanation method in the specific realization process facing a turnover classroom, and after the automatic explanation, the students answer questions and organize the practical practice and operation of the students. The automatic explanation technology minimizes the manufacturing overhead of the turnover classroom, and has the advantages of lecture increment evolution, teacher burden reduction and the like. Therefore, the present invention provides an automatic interpretation method of a slide show, the execution flow of the method is shown in fig. 1, and the method comprises the steps of;
s101, writing a current explanation script for a current slide to be explained in advance, wherein the current explanation script comprises: the current text to be read and the current command to be executed;
in the embodiment of the present invention, an explanation script is written in advance based on experience or specific conditions, where the current explanation script includes a current text to be presented and a current command to be executed, where the text to be presented is an explanation of knowledge in a slide to be explained, and the current command to be executed is an operation instruction executed in an explanation process, for example, a suitable pause is performed, an animation in the slide is displayed, or other documents or application programs are opened, and the current explanation text may be in a form of a file that is independent from the slide to be explained, and the file is associated with the corresponding slide by identification, or may be associated with a remark page of the slide to be explained.
Furthermore, at least one slide to be explained currently is arranged, and all slides are sequentially arranged according to the explanation sequence.
S102, acquiring the current explanation script under the condition of receiving an automatic explanation request for the current slide to be explained;
in the embodiment of the present invention, when an automatic explanation instruction is received, a current explanation script of the current slide to be explained is determined based on a pre-established association relationship, where the pre-established association relationship may be established based on an identifier and a remark page or in other manners, and the automatic explanation instruction may be implemented by a user by clicking a button or a related play key, which is not specifically limited in the embodiment of the present invention.
S103, identifying the current explanation script, and determining the current text to be read and the current command to be executed in the current explanation script;
in the embodiment of the present invention, the current interpretation script is traversed, and the content included in the current interpretation script is sequentially recognized, preferably, because the current interpretation script only includes the current text to be read aloud and the current command to be executed, and the current text to be read aloud is at least one, each current text to be read aloud may be the same or different, the current command to be executed is at least one, each current command to be executed may be the same or different, in writing the current interpretation script, a start identifier and an end identifier are assigned for each current command to be executed in the current interpretation script, the start identifier is used as the start, the end identifier is used as the end, the part included between the start identifier and the end identifier is used as the current command to be executed, and the part between each executed command is used as the current text to be read aloud, in the embodiment of the present invention, specific existence forms of the start identifier and the end identifier are not limited.
For example, the current explanation script is illustrated, and assuming that the starting identifier is [ and the ending identifier is ], the current explanation script is: a, [ command 1 ] [ command 2 ] [ BC [ command 3 ], wherein ABC in the current explanation script is the current text to be read aloud, and command 1, command 2 and command are the current commands to be executed
And S104, sequentially reading the current text to be read and executing the current command to be executed.
In the embodiment of the present invention, after the current text to be read and the current command to be executed are determined, for a speech interpretation portion, the current text to be read is recognized and converted into speech by a speech reading engine in a text-to-speech conversion technology tts (texttospeech), and a specific conversion principle is as follows:
text analysis, namely performing linguistic analysis on the current text to be read aloud, and performing lexical, grammatical and semantic analysis on the current text to be read aloud sentence by sentence to determine the low-level structure of the sentence and the composition of each word phoneme, including text break, word segmentation, polyphone processing, digital processing, abbreviation processing and the like.
And voice synthesis, namely extracting the single character or phrase corresponding to the processed current text to be read from a voice synthesis library, and converting the linguistic description into a speech waveform.
Prosodic processing-synthesized Speech quality (Qualityof Synthetic Speech) refers to the quality of Speech output by a Speech synthesis system, and is generally subjectively evaluated in terms of intelligibility (or intelligibility), naturalness, and coherence. Clarity is the percentage of meaningful words that are correctly heard; the naturalness is used for evaluating whether the tone quality of the synthesized voice is close to the voice of a person and whether the tone of the synthesized word is natural; coherence is used to evaluate whether a synthesized sentence is fluent.
Because com. speak a given text; setSpeed, setting the reading speed.
For the current command to be executed, in the embodiment of the present invention, if the current command to be executed in the current interpretation script is analyzed, a preset command set is traversed, a method corresponding to the current command to be executed is searched, and an operation instruction corresponding to the method is executed. If the current command to be executed is a command supported by the openoffice, calling an API of the openoffice to realize a corresponding function; if the command is java, the corresponding function is realized through the java, and if the script starting mode of OpenOffice is adopted, the starting of the automatic explanation software facing the turnover classroom is finished by the BeanShell script. The script content is as follows:
Figure BDA0003196044850000071
Figure BDA0003196044850000081
Figure BDA0003196044850000091
in the embodiment of the present invention, the current command to be executed is a bearer shell script, and an execution command for starting a slide show is taken as an example:
Figure BDA0003196044850000092
the effect is to call the command manager, get the PPT Proxy object, and then call the startpresentation method of the object to start the presentation. The preset command set is given as follows. These commands can all be invoked as needed in the current interpretation script. The preset command set includes: CANCEL (), DL (long), DL1(), DL2(), DL3(), EXITFUYIN (), goto (int), NE (), NOOP (), NS (), PAUSE (strings), PE (), PLAY (), PS (), q _ jsapispeak (String), q _ pars (slope), q _ pars (dark, etc.), SPEED (slope), SPEED (String), SPEED (slope, String), SPEED (slope ), SPEED (slope ), srwidth (slope ), srid (slope ), srid (slope, slope (2), and slope (slope, slope; hello World "you so" where the SPEED command is called to set the reading SPEED to 160 words per minute. Note that, in "[ is ] are commands, and in these commands, the parameter brackets, sentence semicolons, and the like use english half-angle symbols.
The invention discloses an automatic explaining method of a slide, which comprises the following steps: writing a current explanation script for a current slide to be explained in advance, wherein the current explanation script comprises: acquiring a current explanation script under the condition that a current text to be read and a current command to be executed are received and an automatic explanation request for the current slide to be explained is received; identifying the current explanation script, and determining the current text to be read and the current command to be executed in the current explanation script; and sequentially reading the current text to be read and executing the current command to be executed. In the process, the current explanation script is written for the current slide to be explained in advance, the reading of the text to be read aloud in the explanation script and the execution of the operation instruction corresponding to the command to be executed in the explanation script are completed, the corresponding function is realized based on the operation instruction, the automatic explanation of the slide to be explained is realized, and the learning of the related knowledge in the slide is completed.
In the embodiment of the present invention, the method further includes: adding a control interface, and controlling and displaying any slide to be explained based on the control interface, wherein the control interface (ControlFrame) is a simple window and is internally provided with buttons with various control functions. After clicking the buttons, the corresponding operation is completed by sending corresponding commands to the command manager. And the control interface manager controls the generation and display of the ControlFrame. The control interface can complete the functions of Current, Following, ALL, Cancel, Resume and subtitle.
Further, acquiring the current text to be read aloud; and synchronously displaying the current text to be read at a preset position in the current slide to be explained while reading the text to be read to realize a subtitle function, wherein the subtitle function is to construct a JFrame without a title bar and display the subtitle text in a given format.
Furthermore, JNA is adopted in the embodiment of the invention to realize the monitoring of mouse and keyboard events in the system range. The JNA class library uses a small native class library sub to dynamically call native code. The programmer only needs to describe the structure and some basic properties of the method of the native code to be called next using a specific java interface. This saves a lot of configuration and compilation code to adapt to multiple platforms. Because all calls are interfaces in the common jar package provided by the JNA.
On the basis that the JNA intercepts mouse and keyboard events, a set of Event and EventListener mechanisms are simply defined and used for supporting a user to flexibly register Event response codes. On the basis of the method, the following shortcut keys and commands are supported:
start slide show (Alt + G, ASCII code 71), execute command: PLAY ();
read all pages (Alt + a, ASCII code 65), execute the command: q _ spaakallslides ();
read subsequent page (Alt + F, ASCII code 70), execute command: q _ spaakfollowslides ();
reading current page (Alt + C, ASCII code 67), executing command: q _ spackcurrentslide ();
pause the interpretation (Alt + K, ASCII code 75), execute the command: PAUSE ();
continuing with the explanation (Alt + R, ASCII code 82), the command is executed: RESUME ();
cancel all the narrated content (Alt + X, ASCII code 88), execute the command: CANCEL ();
display/closed caption (Alt + S, ASCII code 83), execution command: SWITCHSRT ();
in the embodiment of the invention, the automatic explanation software facing the turnover classroom is designed based on the method, and the software is in OpenOffice (compatible with LibreOffice, the same below) and exists in a plug-in form of OpenOffice. Openoffice (libreoffice) is the latest version available 12 months later in 2016. The automatic explanation software for the roll-over class is developed in Java language, and therefore, it is required to correctly configure JDK (version is greater than JDK1.6, 32-bit x86) used by OpenOffice.
The development environment is as follows:
an integrated development environment: eclipse Mars and above;
a source code warehouse: git;
a Maven warehouse: nexus.
In the embodiment of the present invention, the automatic interpretation software mainly includes 7 classes: the IPPTproxy interface is used to abstract the slideshow software so that the system can support more slideshow software in the future. BshOpenOfficeProxy is realized by IPPTProxy of OpenOffice, and obtains some necessary information through a BeanShell script supported by OpenOffice, so that the function defined in the IPPTProxy is realized for other modules.
CommandMan, a command manager, for managing and executing individual instructions. By representing the instructions in the form of a benshell script, the user may be allowed to execute and extend various types of commands. CommandMan simultaneously holds the implementation object of IPPTProxy, so that other modules can access the functions of the PPT player through the CommandMan.
The mouse KeyboardMan is a mouse and keyboard manager and is used for managing mouse and keyboard operations in a system range, packaging the operations into events by monitoring the mouse and keyboard operations and supporting an event monitor. And finally, the user registers the hot key and executes the corresponding instruction. Note that: the common Java code can only monitor the keyboard and mouse events within the range of the interface control (such as a window), and once the control loses Focus (Focus), the monitoring cannot be continued. MouseKeyboardMan is used to listen for mouse and keyboard events within the scope of the system, regardless of which interface is in focus.
SRTMan, a caption manager, a window for controlling the display of captions, a GUICONroller, a control interface manager and a control main interface (ControlFrame) window.
TTSMan, TTS manager, used to manage the speech reading engine, using com.cloudggarden.jsapi package to directly call various speech libraries installed in Windows system, supporting SAPI 5.
Wherein the PPT control interface (IPPTProxy) defines the following functions:
startPresentin, start the slide show.
isPresenting, judges whether or not a show is being performed.
getSlideCount, the number of slides in the file is obtained.
getcurrentshowslidindex, the slide index value of the current show is obtained.
gatherNoteOfShowslides, remarked words of slides within a specified range in the presentation are obtained, and calling the method can cause the start of the showing.
getNoteOfShowSlide, get the remark text of the specified index slide in the presentation, and invoking this method will result in starting the show.
gatherNoteOfslides, the remark words of the slides in the specified range are obtained, and the slide show cannot be started by calling the method.
getNoteOfSlide, obtains the remark text of the specified index slide, and calls the method to not start the slide show.
nextEffect,Show next effect of this slide in presentation。
nextSlide,Show next slide in presentation。
previousEffect,Show previous effect of this slide in presentation。
previousSlide,Show previous slide in presentation。
gotoSlide,goto given slide in presentation。
onExiting, the system is ready to exit, reclaiming the used resources.
Once the functions are provided for a certain PPT player, the automatic explanation software facing a turnover classroom can complete the acquisition of remarks and the control of explanation.
The BshOpenOfficeProxy is supported by a BeanShell script of OpenOffice, and three main objects are obtained: xscriptinvoccioncontext, XModel and XDesktop, the UNO interface of OpenOffice can be used to access the relevant objects: XPresentation supply, XPresentation2, XDrawPages supply, thereby implementing the function defined in IPPTProxy. These classes and their functions refer to the definitions in UNO and OpenOffice.
Wherein, the command manager (Command Man) has a command queue (Command queue: mainQueue) inside, and the command queue can include command items (ICommand QueueItem). ICommandQueueItem has two subclasses: CommandQueueItem and CommandQueue, the former for encapsulating a particular command and the latter meaning that it is clear that nesting between that queues is possible, thus forming nested queues. Nested queues may allow a user to undo a command sequence within a specified range in the future, thereby enhancing control functionality.
After the command manager is started, a command execution thread is started. The command execution thread constantly acquires a command item from the mainQueue (invokes the advanced command method) and then executes it. The command in each command item is a bearer shell script. The CommandMan internally creates a BeanShell interpreter (and executes the command script described later), and then submits the command to the interpreter for execution.
In the embodiment of the invention, in order to further enhance the function, a static webpage server and a WebSocket message forwarding server are provided, so that the mobile phone, the head-wearing VR/AR glasses and other devices can be supported. The static webpage server responds to the HTTP GET request of the designated port by monitoring the port and returns the static webpage file data of the request, thereby supporting the access to the static webpage. The extension provides three access interfaces: a void settrack (String path) for setting a root directory of the web page; starting an HTTP web server by using a bootstart (streaming path, int port) with parameters of a web root directory and a monitoring port number; void stop (), stop HTTP web server. The WebSocket message forwarding server supports different clients to broadcast messages through webpages based on WebSocket. The module will monitor all available network card designated ports of the host, receive incoming text messages, and broadcast them to all connected clients. List < String > getHostIps () to obtain the IP addresses of all network cards bound by the WebSocket message forwarding server; list < String > start (int port), start WebSocket message forwarding server on the appointed port, return all IP addresses of network cards bound; and (4) the void stop (), and the WebSocket message forwarding server is stopped. void bcastMessage (String msg), broadcasts a text message. Messages beginning with ": bsh:" will be sent to the command manager and interpreted for execution (not to the respective client).
Based on the static webpage server and the WebSocket message forwarding server, the mobile phone, the head-mounted VR/AR glasses and other devices can be supported to receive information such as voice in PPT, and meanwhile, control commands can be transmitted. For example, in a web page where a button such as "show" may send ". bsh: PLAY (); "etc. command, when WebSocket message forwarding server receives it, it is forwarded to command manager and interpreted and executed (show command).
Due to the adoption of the WebSocket technology, the client can customize the interface according to the preference of the equipment and the user, for example, the interface of the AR equipment MadGaze is simplified as follows:
when devices such as MadGaze are used, instant explanation content prompting can be realized, so that a speaker can leave a platform and move freely under the condition of wearing the glasses. The same effect can be achieved by using a mobile phone and the like.
Based on the above method for automatically explaining a slide, an embodiment of the present invention further provides an automatic explaining apparatus, where an execution flow of the automatic explaining apparatus is shown in fig. 2, and the method includes:
a writing module 201, a first obtaining module 202, a determining module 203 and a reading and executing module 204.
Wherein,
the compiling module 201 is configured to compile a current interpretation script for a current slide to be interpreted in advance, where the current interpretation script includes: the current text to be read and the current command to be executed;
the first obtaining module 202 is configured to obtain the current explanation script under the condition that an automatic explanation request for the current slide to be explained is received;
the determining module 203 is configured to identify the current explanation script, and determine the current text to be read and the current command to be executed in the current explanation script;
the reading and executing module 204 is configured to read the current text to be read and execute the current command to be executed in sequence.
The invention discloses an automatic explaining device of a slide, comprising: writing a current explanation script for a current slide to be explained in advance, wherein the current explanation script comprises: acquiring a current explanation script under the condition that a current text to be read and a current command to be executed are received and an automatic explanation request for the current slide to be explained is received; identifying the current explanation script, and determining the current text to be read and the current command to be executed in the current explanation script; and sequentially reading the current text to be read and executing the current command to be executed. In the process, the current explanation script is written for the current slide to be explained in advance, the reading of the text to be read aloud in the explanation script and the execution of the operation instruction corresponding to the command to be executed in the explanation script are completed, the corresponding function is realized based on the operation instruction, the automatic explanation of the slide to be explained is realized, and the learning of the related knowledge in the slide is completed.
In this embodiment of the present invention, the determining module 203 includes:
an identification unit 205 and a determination unit 206.
Wherein,
the identifying unit 205 is configured to traverse the current explanation script and sequentially identify the current explanation script;
the determining unit 206 is configured to determine a portion between the start identifier and the end identifier in the current explanation script as a current command to be executed, and determine the rest as a current text to be read aloud, where the number of the current scripts to be read aloud is at least one, and the number of the current commands to be executed is at least one.
In this embodiment of the present invention, the reading and executing module 204 includes:
a parsing unit 207, a reading unit 208 and an execution unit 209.
Wherein,
the parsing unit 207 is configured to parse the current explanation scripts in sequence;
the reading unit 208 is configured to, if the current text to be read is analyzed in the current interpretation script, read the current text to be read based on a text-to-speech conversion technology;
the execution unit 209 is configured to traverse a preset command set if the current to-be-executed command in the current interpretation script is analyzed, search for a method corresponding to the current to-be-executed command, and execute an operation instruction corresponding to the method.
In the embodiment of the present invention, the apparatus further includes:
an add module 210 and a control module 211.
Wherein,
the adding module 210 is configured to add a control interface;
the control module 211 is configured to control display of any slide to be explained based on the control interface.
In the embodiment of the present invention, the apparatus further includes:
a second acquisition module 212 and a presentation module 213.
Wherein,
the second obtaining module 212 is configured to obtain the current text to be read aloud;
the display module 213 is configured to read the text to be read and simultaneously display the current text to be read at a preset position in the current slide to be explained.
It should be noted that, in the present specification, the embodiments are all described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments may be referred to each other. For the device-like embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, reference may be made to the partial description of the method embodiment.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The method and the device for automatically explaining the slide provided by the invention are described in detail, a specific example is applied in the text to explain the principle and the implementation mode of the invention, and the description of the embodiment is only used for helping to understand the method and the core idea of the invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (10)

1. An automatic interpretation method of a slide, comprising:
writing a current explanation script for a current slide to be explained in advance, wherein the current explanation script comprises: the current text to be read and the current command to be executed;
under the condition that an automatic explanation request for the slide to be explained currently is received, acquiring the current explanation script;
identifying the current explanation script, and determining the current text to be read and the current command to be executed in the current explanation script;
and sequentially reading the current text to be read and executing the current command to be executed.
2. The method of claim 1, wherein identifying the current interpretation script, determining the current to-be-spoken text and the current to-be-executed command in the current interpretation script comprises:
traversing the current explanation script and sequentially identifying the current explanation script;
and determining a part between the starting identifier and the ending identifier in the current explanation script as a current command to be executed, and determining the rest as a current text to be read, wherein the number of the current scripts to be read is at least one, and the number of the current commands to be executed is at least one.
3. The method of claim 1, wherein reading the current text to be read and executing the current command to be executed in sequence comprises:
sequentially analyzing the current explanation script;
if the current text to be read is analyzed in the current explanation script, reading the current text to be read based on a text-to-speech conversion technology;
and if the current command to be executed in the current explanation script is analyzed, traversing a preset command set, searching a method corresponding to the current command to be executed, and executing an operation instruction corresponding to the method.
4. The method of claim 1, further comprising:
adding a control interface;
and controlling and displaying any slide to be explained based on the control interface.
5. The method of claim 1, further comprising:
acquiring the current text to be read aloud;
and synchronously displaying the current text to be read in a preset position in the current slide to be explained while reading the text to be read.
6. An apparatus for automatically interpreting a slide, comprising:
the compiling module is used for compiling a current explaining script for a current slide to be explained in advance, wherein the current explaining script comprises the following components: the current text to be read and the current command to be executed;
the first acquisition module is used for acquiring the current explanation script under the condition of receiving an automatic explanation request for the current slide to be explained;
the determining module is used for identifying the current explanation script and determining the current text to be read and the current command to be executed in the current explanation script;
and the reading and executing module is used for sequentially reading the current text to be read and executing the current command to be executed.
7. The apparatus of claim 6, wherein the determining module comprises:
the identification unit is used for traversing the current explanation script and sequentially identifying the current explanation script;
and the determining unit is used for determining a part between the starting identifier and the ending identifier in the current explanation script as a current command to be executed, and determining the rest of the current explanation script as a current text to be read, wherein the number of the current scripts to be read is at least one, and the number of the current commands to be executed is at least one.
8. The apparatus of claim 6, wherein the read and execute module comprises:
the analysis unit is used for sequentially analyzing the current explanation scripts;
the reading unit is used for reading the current text to be read based on a text-to-speech conversion technology if the current text to be read in the current explanation script is analyzed;
and the execution unit is used for traversing a preset command set if the current command to be executed in the current explanation script is analyzed, searching a method corresponding to the current command to be executed, and executing an operation instruction corresponding to the method.
9. The apparatus of claim 6, further comprising:
the adding module is used for adding a control interface;
and the control module is used for controlling and displaying any slide to be explained based on the control interface.
10. The apparatus of claim 6, further comprising:
the second acquisition module is used for acquiring the current text to be read aloud;
and the display module is used for synchronously displaying the current text to be read in the preset position of the current slide to be explained while reading the text to be read.
CN202110898884.7A 2021-08-04 2021-08-04 Automatic interpretation method and device for slides Pending CN113626013A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110898884.7A CN113626013A (en) 2021-08-04 2021-08-04 Automatic interpretation method and device for slides

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110898884.7A CN113626013A (en) 2021-08-04 2021-08-04 Automatic interpretation method and device for slides

Publications (1)

Publication Number Publication Date
CN113626013A true CN113626013A (en) 2021-11-09

Family

ID=78383002

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110898884.7A Pending CN113626013A (en) 2021-08-04 2021-08-04 Automatic interpretation method and device for slides

Country Status (1)

Country Link
CN (1) CN113626013A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114363529A (en) * 2021-12-08 2022-04-15 上海浦东发展银行股份有限公司 Video generation method, device, computer equipment, storage medium and product

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5640590A (en) * 1992-11-18 1997-06-17 Canon Information Systems, Inc. Method and apparatus for scripting a text-to-speech-based multimedia presentation
CN105679120A (en) * 2016-01-29 2016-06-15 右江民族医学院 Method for making standard mandarin speech micro-courseware based on TTS technology
CN108989776A (en) * 2017-06-05 2018-12-11 深圳市领芯者科技有限公司 Slideshow control method, device, storage medium and handheld terminal
CN109167822A (en) * 2018-08-14 2019-01-08 众安信息技术服务有限公司 A kind of internet of things equipment control method and system based on block chain
CN109509464A (en) * 2017-09-11 2019-03-22 珠海金山办公软件有限公司 It is a kind of text to be read aloud the method and device for being recorded as audio
CN111246288A (en) * 2020-01-22 2020-06-05 福建天泉教育科技有限公司 Method and terminal for demonstrating PPTX document in video mode
CN111538851A (en) * 2020-04-16 2020-08-14 北京捷通华声科技股份有限公司 Method, system, device and storage medium for automatically generating demonstration video

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5640590A (en) * 1992-11-18 1997-06-17 Canon Information Systems, Inc. Method and apparatus for scripting a text-to-speech-based multimedia presentation
CN105679120A (en) * 2016-01-29 2016-06-15 右江民族医学院 Method for making standard mandarin speech micro-courseware based on TTS technology
CN108989776A (en) * 2017-06-05 2018-12-11 深圳市领芯者科技有限公司 Slideshow control method, device, storage medium and handheld terminal
CN109509464A (en) * 2017-09-11 2019-03-22 珠海金山办公软件有限公司 It is a kind of text to be read aloud the method and device for being recorded as audio
CN109167822A (en) * 2018-08-14 2019-01-08 众安信息技术服务有限公司 A kind of internet of things equipment control method and system based on block chain
CN111246288A (en) * 2020-01-22 2020-06-05 福建天泉教育科技有限公司 Method and terminal for demonstrating PPTX document in video mode
CN111538851A (en) * 2020-04-16 2020-08-14 北京捷通华声科技股份有限公司 Method, system, device and storage medium for automatically generating demonstration video

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114363529A (en) * 2021-12-08 2022-04-15 上海浦东发展银行股份有限公司 Video generation method, device, computer equipment, storage medium and product

Similar Documents

Publication Publication Date Title
RU2352979C2 (en) Synchronous comprehension of semantic objects for highly active interface
RU2349969C2 (en) Synchronous understanding of semantic objects realised by means of tags of speech application
KR101066732B1 (en) Dynamic help including available speech commands from content contained within speech grammars
TW201921267A (en) Method and system for generating a conversational agent by automatic paraphrase generation based on machine translation
McTear et al. Voice application development for Android
US11907665B2 (en) Method and system for processing user inputs using natural language processing
US11650834B1 (en) Leveraging digital footprints in software development life cycle
US20110041052A1 (en) Markup language-based authoring and runtime environment for interactive content platform
Santiago et al. Building cognitive applications with IBM Watson services: Volume 6 speech to text and text to speech
Kita et al. Implementation of voice user interfaces to enhance users’ activities on Moodle
CN113626013A (en) Automatic interpretation method and device for slides
CN113850898B (en) Scene rendering method and device, storage medium and electronic equipment
Muhammad et al. Development of English conversation practice app with artificial intelligence & speech recognition
Tomko et al. Towards efficient human machine speech communication: The speech graffiti project
RU2344492C2 (en) Dynamic support of pronunciation for training in recognition of japanese and chinese speech
CN109710221B (en) General simulation training system
KR100447667B1 (en) Interactive Language Teaching System consisted of Dolls and Voice Recognizing Computer
KR102098377B1 (en) Method for providing foreign language education service learning grammar using puzzle game
US20240119854A1 (en) Method for Automatically Converting a Text string to an Interactive Video Experience
CN112017487A (en) Flat Flash learning system based on artificial intelligence
JP2006236037A (en) Voice interaction content creation method, device, program and recording medium
Pathak et al. Designing a multilingual virtual agent capable of interacting with uneducated people for automated data collection
CN111104118A (en) AIML-based natural language instruction execution method and system
Rozga et al. Applying our learnings: Alexa skills kit
Mátis et al. Voice Recognition Based Automated Teleprompter Application

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination