CN112704485A

CN112704485A - Pronunciation perception system, perception method, device, electronic equipment and storage medium

Info

Publication number: CN112704485A
Application number: CN202011613607.9A
Authority: CN
Inventors: 王屹尊
Original assignee: Shenzhen United Imaging Research Institute of Innovative Medical Equipment
Current assignee: Shenzhen United Imaging Research Institute of Innovative Medical Equipment
Priority date: 2020-12-30
Filing date: 2020-12-30
Publication date: 2021-04-27

Abstract

The embodiment of the invention discloses a pronunciation sensing system, a sensing method, a sensing device, electronic equipment and a storage medium. The pronunciation perception system includes: the device comprises a trigger device and a prompt device; when the trigger device is triggered, the prompt device sends prompt information to prompt a scanning technician in the scanning operation room that a currently scanned person has a pronunciation requirement; the scanning equipment is arranged in a scanning room, and the scanning room is isolated from the scanning operation room. The pronunciation requirement of a scanned person is sensed in time, and a scanning technician is reminded.

Description

Pronunciation perception system, perception method, device, electronic equipment and storage medium

Technical Field

The present invention relates to the field of medical speech, and in particular, to a pronunciation sensing system, a sensing method, a sensing device, an electronic device, and a storage medium.

Background

In MRI (Magnetic Resonance Imaging) scanning clinical work, communication between a scanning technician and a scanned person is crucial. Therefore, voice interactive devices are essential in MRI scanning systems.

Voice interactive devices have 3 major uses in MRI scanning: firstly, an alarm function: the scanning can be stopped in time when the scanned person has an emergency demand; secondly, communication function: for example, in some functional task state scans, the scanning technician needs to make the scanned person perform a specific action (e.g., open eyes, make a fist, move legs, recognize a figure, etc.) or remind the scanned person to hold his breath, keep his or her body still, relax, etc. in a conventional scan. Meanwhile, the scanned person may have urgent needs, such as requiring to know the remaining scanning time, blowing nose, dizziness, numbness, temperature, and other needs to communicate with the scanning technician; thirdly, the function of reducing blood pressure: research shows that the system can communicate with a scanning technician effectively in time in advance, so that anxiety and psychological pressure of a scanned person can be reduced obviously, the scanned person can better match with scanning tasks through warm polite guidance, such as keeping motionless scanning requirements, and in addition, an entertainment system based on voice and video is also developed widely to improve scanning experience of the scanned person.

However, in the actual workflow, the scanner technician often chooses to minimize the voice input of the scanner end due to the loud noise between scans, and therefore the scanner technician cannot immediately detect the voice input while the scanner and the scanner technician are speaking.

Disclosure of Invention

The embodiment of the invention provides a pronunciation sensing system, a sensing method, a sensing device, electronic equipment and a storage medium, which realize the purposes of sensing the pronunciation requirement of a scanned person in time and reminding a scanning technician.

In a first aspect, an embodiment of the present invention provides a pronunciation perception system, where the system includes:

the device comprises a trigger device and a prompt device;

when the trigger device is triggered, the prompt device sends prompt information to prompt a scanning technician in the scanning operation room that a currently scanned person has a pronunciation requirement;

the scanning equipment is arranged in a scanning room, and the scanning room is isolated from the scanning operation room.

In a second aspect, an embodiment of the present invention further provides a pronunciation perception method, where the method includes:

detecting whether a trigger device is triggered and/or detecting whether a voice signal of a current scanned person is identified;

if the trigger device is triggered or the voice signal of the current scanned person is detected, controlling the prompt device to send prompt information to prompt the current scanned person to have a pronunciation requirement;

the trigger device is arranged at a first associated position of the scanning equipment, the prompt device is arranged at a set position of a scanning operation room, the scanning equipment is arranged in the scanning room, and the scanning room and the scanning operation room are arranged in an isolated mode.

In a third aspect, an embodiment of the present invention further provides a pronunciation sensing apparatus, where the apparatus includes:

the detection module is used for detecting whether the trigger device is triggered and/or detecting whether the voice signal of the current scanned person is identified;

the control module is used for controlling the prompting device to send out prompting information to prompt the current scanned person to have pronunciation requirements if the triggering device is triggered or the voice signal of the current scanned person is detected;

In a fourth aspect, an embodiment of the present invention further provides an electronic device, where the electronic device includes:

one or more processors;

a memory for storing one or more programs;

when executed by the one or more processors, cause the one or more processors to implement the pronunciation perception method steps as described in embodiments of the invention.

In a fifth aspect, the embodiment of the present invention further provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the pronunciation perception method steps according to the embodiment of the present invention.

The pronunciation perception system provided by the embodiment of the invention comprises: the device comprises a trigger device and a prompt device; when the trigger device is triggered, the prompt device sends prompt information to prompt a scanning technician in the scanning operation room that a currently scanned person has a pronunciation requirement; the scanning equipment is arranged in a scanning room, and the scanning room is isolated from the scanning operation room. The pronunciation perception system provided by the embodiment of the invention realizes the purposes of timely perceiving the pronunciation requirement of the scanned person and reminding the scanning technician.

Drawings

In order to more clearly illustrate the technical solutions of the exemplary embodiments of the present invention, a brief description is given below of the drawings used in describing the embodiments. It should be clear that the described figures are only views of some of the embodiments of the invention to be described, not all, and that for a person skilled in the art, other figures can be derived from these figures without inventive effort.

Fig. 1 is a schematic structural diagram of a pronunciation sensing system according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of a pronunciation perception system of an MRI apparatus according to an embodiment of the present invention;

fig. 3 is a schematic structural diagram of a pronunciation sensing system according to a second embodiment of the present invention;

FIG. 4 is a schematic diagram of an alarm ball and a voice recognition device provided in a second embodiment of the present invention;

fig. 5 is a schematic structural diagram of another pronunciation perception system according to a second embodiment of the present invention;

fig. 6 is a schematic flowchart of a pronunciation sensing method according to a third embodiment of the present invention;

fig. 7 is a schematic structural diagram of a pronunciation sensing device according to a fourth embodiment of the present invention;

fig. 8 is a schematic flowchart of an electronic device according to a fifth embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the invention and are not limiting of the invention. It should be further noted that, for the convenience of description, only some of the structures related to the present invention are shown in the drawings, not all of the structures.

Before discussing exemplary embodiments in more detail, it should be noted that some exemplary embodiments are described as processes or methods depicted as flowcharts. Although a flowchart may describe the steps as a sequential process, many of the steps can be performed in parallel, concurrently or simultaneously. In addition, the order of the various steps may be rearranged. The process may be terminated when its steps are completed, but may have additional steps not included in the figure. The processes may correspond to methods, functions, procedures, subroutines, and the like.

Example one

Fig. 1 is a schematic structural diagram of a pronunciation sensing system according to an embodiment of the present invention. Referring to fig. 1, the pronunciation perception system includes: a triggering device 110 and a prompting device 120. The triggering device 110 is arranged at a first relevant position of the scanning device, the prompting device 120 is arranged at a set position of a scanning operation room, and when the triggering device 110 is triggered, the prompting device 120 sends out prompting information to prompt a scanning technician in the scanning operation room that the current scanned person has a pronunciation requirement; the scanning equipment is arranged in the scanning room, and the scanning room is isolated from the scanning operation room.

The scanning device may be a medical device having a scanning function, such as CT (Computer Tomography), MRI (Magnetic Resonance Imaging), DR (Digital X-ray radiography System), ultrasound Imaging, or molecular Imaging. It should be noted that the scanning device is disposed in the scanning room, as shown in fig. 2, the upper half of the drawing is the scanning room, the scanning device is disposed in the scanning room, and the person to be scanned completes scanning in the scanning room; the lower part of the figure is a scanning operation room, the scanning operation room and the scanning room are arranged in an isolated mode, electronic equipment for controlling scanning equipment is placed in the scanning operation room, the electronic equipment can monitor the state of the scanning equipment and pictures in the scanning room, and can control the action of the scanning equipment and play sound signals in the scanning room.

The first related position in this embodiment may be a position close to the set body part of the scanned person, or a position above the set body part of the scanned person, that is, a position that can be easily touched by the scanned person during the scanning process. The set body part may be a part such as a hand, an elbow, or an eye of the scanned person. For example, as shown in fig. 2, the scanning device is an MRI device, a person to be scanned needs to lie on an examination table during scanning, and the first relevant position may be a position near a hand on a bed board of the examination table; or the magnetic body can be arranged at a position above the head of the scanned person; it may be disposed above the middle point of the eyes and hands of the person to be scanned on the magnet.

The triggering device 110 in this embodiment is used to be triggered by the scanned person when the scanned person has a need of pronunciation. It should be noted that the pronunciation requirement may be an urgent pronunciation requirement of the scanned person during the scanning process, for example, it is desired to terminate the scanning; or the communication sound requirement between the scanned person and the scanning technician during the scanning process, such as dizziness, convenience or understanding of the remaining scanning time. Specifically, after the person to be scanned generates a pronunciation requirement, the person to be scanned may be notified of the pronunciation requirement by triggering the triggering device 110 at the first associated position of the scanning apparatus, so as to remind the scanning technician to turn up the loudness of the sound signal recorded between the scans.

Optionally, the triggering device 110 includes a triggering button, and when the current scanned person has a need of pronunciation, the triggering button is pressed to trigger the triggering button.

The trigger button may be an on-off button, that is, the trigger button is triggered after the scanner presses the trigger button. The trigger button may also be a capacitive or inductive button that is triggered when the scanned person places a hand or elbow on the trigger button. In this embodiment, the scanned person can express that the scanned person has the pronunciation requirement by pressing the trigger button, and the timely perception of the pronunciation requirement of the scanned person is realized.

It is considered that if the trigger button is disposed on the magnet or the examination table of the existing scanning apparatus, the structural change of the existing scanning apparatus is large, and meanwhile, there may be a case that the trigger button is not compatible with other structures of the scanning apparatus. Optionally, the trigger button is provided on an alarm ball of the MRI apparatus.

The alarm ball is the original alarm device in the MRI device, and when the scanned person has an emergency (such as blowing nose, dizziness, discomfort, defecation and the like) and needs to stop scanning, the alarm ball can be pressed to trigger an alarm. As shown in fig. 2, the reference numeral 3 is an alarm ball. In this embodiment, the trigger button may be located on a surface of the alarm ball, such as the thumb end of the surface of the alarm ball. For example, the label 1 in fig. 2 is a trigger button, the trigger button is disposed on the surface of the alarm ball, and the person to be scanned can press the trigger button on the alarm ball to trigger the trigger button. In this embodiment, the trigger button may be disposed on the alarm ball of the MRI apparatus, so as to avoid excessive modification of the structure of the scanning apparatus, and facilitate triggering by a scanner, thereby quickly implementing application of the trigger button of this embodiment to the existing scanning apparatus.

In the present embodiment, the prompting device 120 is used to prompt the scanning technician that the current scanned person has a requirement for pronunciation. The prompting message emitted by the prompting device 120 can be an audio prompting message and/or a visual prompting message. The audio prompt information includes, but is not limited to, set audio information and set voice information, for example, the set audio information may be a droplet sound, and the set voice information may be "the scanned person has a need for pronunciation". The visual cue may be a display-enabled cue, such as a flashing sensor light. Specifically, the prompting device 120 emits an audio prompt message and/or a visual prompt message when the triggering device 110 is triggered, so as to prompt the scanning technician that the currently scanned person has a need to pronounce.

Optionally, the prompting device 120 includes a sensing light that is illuminated when the triggering device 110 is triggered.

Wherein the induction lamp is used to light up when the triggering device 110 is triggered. The number of the induction lamps can be one or more. Taking the number of the induction lamps as 1 as an example, for example, the reference numeral 5 in fig. 2 is an induction lamp. Specifically, when the trigger device 110 is not triggered, the induction lamp is in an off state, and when the trigger device 110 is triggered, the induction lamp is in an on state. It should be noted that the color of the induction lamp when it is lighted may be a color having a prominent visual effect, such as red, green, etc. In another embodiment, the sensing lamp may be in a green normally-on state when the triggering device 110 is not triggered, and the sensing lamp may be in a red flashing state when the triggering device 110 is triggered. In the present embodiment, the prompting device 120 includes a sensing lamp, and when the triggering device 110 is triggered, the sensing lamp is turned on, so as to realize visual prompt for the scanning technician.

In the embodiment, when the triggering device 110 is triggered by the scanned person, the prompting device 120 sends out the prompting message, and at this time, the scanning technician in the scanning operation room can know that the scanned person has a need for pronunciation, so that the loudness of the sound signal in the scanning operation room and/or the sound receiving volume of the microphone of the scanned person can be increased to clearly identify the sound of the scanned person. It should be noted that a volume adjustment knob, such as the mark 6 in fig. 2, may be provided in the scanning operation room, and the scanning technician may increase the loudness of the sound signal in the scanning operation room and/or the sound receiving volume of the microphone of the scanned person by using the volume adjustment knob after receiving the prompting message sent by the prompting device 120.

The pronunciation perception system provided by the embodiment comprises: the device comprises a trigger device and a prompt device; when the trigger device is triggered, the prompt device sends prompt information to prompt a scanning technician in the scanning operation room that a currently scanned person has a pronunciation requirement; the scanning equipment is arranged in a scanning room, and the scanning room is isolated from the scanning operation room. The pronunciation requirement of a scanned person is sensed in time, and a scanning technician is reminded.

Example two

Fig. 3 is a schematic structural diagram of a pronunciation sensing system according to a second embodiment of the present invention. On the basis of the above embodiment, the pronunciation sensing system in this embodiment further includes a voice recognition device to realize dual sensing of the pronunciation requirement of the scanned person based on the voice recognition device and the trigger device, and improve accuracy and flexibility of sensing the pronunciation requirement of the scanned person.

Referring to fig. 3, the pronunciation perception system includes: a triggering device 310, a prompting device 320 and a speech recognition device 330. The triggering device 310 is arranged at a first relevant position of the scanning device, the prompting device 320 is arranged at a set position of the scanning operation room, and when the triggering device 310 is triggered, the prompting device 320 sends out prompting information to prompt a scanning technician in the scanning operation room that the current scanned person has a pronunciation requirement; the scanning equipment is arranged in the scanning room, and the scanning room and the scanning operation room are arranged in an isolated mode; the voice recognition device 330 is disposed at a second associated position of the scanning device, and is linked with the prompting device 320 for receiving and recognizing the voice signal of the current scanned person, and when the voice signal of the current scanned person is recognized, the prompting device 320 sends out a prompting message to prompt a scanning technician in the scanning operation room that the current scanned person has a requirement for pronunciation.

The second relevant position may be a position close to or above the mouth of the scanned person, for example, a position above the mouth of the scanned person on the MRI apparatus, as indicated by reference numeral 4 in fig. 2. Or may be a location on the microphone of the person being scanned. Specifically, the voice recognition device 330 is disposed at a second associated position of the scanning apparatus to receive the voice signal of the current scanned person and recognize the voice signal.

The voice recognition device 330 in this embodiment may be a device that can recognize voice information of a scanned person and is insensitive to device noise of the scanning apparatus. The voice signal of the current scanned person received by the voice recognition device comprises a voice signal and equipment noise, and the voice recognition device can recognize the voice information of the current scanned person through a preset voice recognition algorithm, such as a hidden markov model algorithm, a dynamic time warping algorithm and other voice recognition algorithms; the voice information of the current scanned person can be identified through a preset neural network model. When the voice signal of the current scanned person is recognized, the prompting device 320 sends out a prompting message to prompt the scanning technician that the current scanned person has a need of pronunciation.

In this embodiment, the voice information of the scanned person may be collected in advance and input to the voice recognition device, so that the voice recognition device analyzes and learns the voice characteristics of the scanned person in advance, and thus when the voice information of the current scanned person is received, the voice information of the current scanned person may be recognized from the voice information containing the device noise. In an embodiment, the pre-collection of the voice information of the scanned person may be completed by an alarm ball, as shown in fig. 4, a trigger button 401, a recording collector 402 and a recording emitter 403 may be provided on the alarm ball, so that the scanned person may pre-collect the voice signal of the scanned person by the recording collector 402 after pressing the trigger device or when pressing the trigger device 401 for a long time, and transmit the pre-collected voice signal to the voice recognition device by the recording emitter 403, so that the voice recognition device may complete the early analysis and extraction of the voice feature of the scanned person, and may accurately distinguish the voice information of the scanned person from the device noise of the scanning device. Record collector 402 and record emitter 403 may be integrated together, such as tag 2 in FIG. 2.

In one embodiment, as shown in FIG. 4, the speech recognition device includes a recording receiver 404, a speech receiver 405, a speech converter 406, and a speech analyzer 407. The recording receiver 404 is configured to receive a pre-collected voice signal sent by the recording transmitter 403 on the alarm ball; the voice receiver 405 is used for receiving the voice signal of the current scanned person; the voice converter 406 is used for converting the voice signal into voice data that can be analyzed by the voice analyzer 407; the voice analyzer 407 is used to analyze characteristics of the voice data. Specifically, the pre-collected voice signal passes through the recording receiver 404, the voice converter 406 and the voice analyzer 407, and then the pre-analysis and extraction of the voice feature of the scanned person are completed; the voice signal of the current scanned person passes through the voice receiver 405, the voice converter 406 and the voice analyzer 407, and then the voice signal of the current scanned person is recognized, and when the voice signal of the current scanned person is recognized, the prompting device sends out prompting information.

The teaching process of the first use of the alarm ball and the voice recognition device by the scanned person is introduced below, the psychological pressure of the scanned person can be reduced through the teaching process, and the pre-collection of the voice signal of the scanned person can be realized:

the scan technician says to the scanned person: "you are good, if you have physical discomfort or emergency, please press the alarm ball, and the scanning will stop after pressing the ball. If you want to speak with me, please press the trigger button on the alarm ball. Please try me, press the trigger button on the alarm ball and say it-doctor, i just moved ". The scanned person presses the trigger button and repeats: "doctor, i just moved". The scan technician says to the scanned person: "good, then start scanning and continue to press the trigger button if you want to speak".

In the teaching process, the words spoken by the technician can be collected by the recording collector 402 of the alarming ball after the scanner presses the trigger button to repeatedly scan, and the words are sent to the voice recognition device by the recording emitter 403 of the alarming ball, so that the voice recognition device can analyze and learn the voice characteristics of the scanner.

It should be noted that the triggering device 310 and the voice recognition device 330 in this embodiment may be used to trigger the prompting device 320 to send out the prompting message at the same time, or may be selected from the triggering device 310 and the voice recognition device 330 to trigger the prompting device 320 to send out the prompting message. Correspondingly, referring to a schematic structural diagram of a pronunciation perception system shown in fig. 5, the pronunciation perception system includes: speech recognition means 510 and prompting means 520; the voice recognition device 510 is arranged at a second relevant position of the scanning device, the prompting device 520 is arranged at a set position of the scanning operation room, the voice recognition device 510 and the prompting device 520 are arranged in a linkage manner and used for receiving and recognizing a voice signal of a current scanned person, and when the voice signal of the current scanned person is recognized, the prompting device 520 sends out prompting information to prompt a scanning technician in the scanning operation room that the current scanned person has a pronunciation requirement; the scanning equipment is arranged in a scanning room, and the scanning room is isolated from the scanning operation room. For example, for a speech recognition device with higher recognition accuracy, the pronunciation perception system may cancel the setting of the trigger device, and only include the prompting device and the speech recognition device, so that the prompting device issues the prompt message when the speech recognition device recognizes the speech signal of the current scanned person. However, for a speech recognition device with low recognition accuracy, the pronunciation perception system can comprise a triggering device, a speech recognition device and a prompting device, but the pre-analysis process of the speech recognition device is simplified; the voice recognition device can also be cancelled, and only comprises the triggering device and the prompting device, so that when the triggering device is triggered, the prompting device sends out prompting information.

The pronunciation perception system provided by the embodiment is characterized in that the voice recognition device is arranged at the second relevant position of the scanning device and is linked with the prompt device, so that the voice signal of the current scanned person is received and recognized, and when the voice signal of the current scanned person is recognized, the prompt device sends prompt information to prompt a scanning technician in a scanning operation room that the current scanned person has pronunciation requirements, thereby realizing double perception based on the voice recognition device and the trigger device and improving the accuracy and flexibility of perception of the pronunciation requirements of the scanned person.

Optionally, the pronunciation sensing system further comprises a volume adjusting device, and the volume adjusting device is used for controlling the sound receiving volume to increase when the trigger device is triggered; wherein, the larger the sound receiving volume is, the clearer the sound transmitted from the scanning room to the scanning operation room is.

The volume adjusting device is connected with the trigger device and used for automatically controlling the sound receiving volume of the microphone to be increased when the trigger device is triggered, so that the sound transmitted from the scanning room to the scanning operation room is clearer. In the embodiment, the volume adjusting device is arranged to automatically control the sound receiving volume to be increased when the triggering device is triggered, so that the manual adjustment of a scanning technician is not needed, the automation from sensing the sound production requirement to increasing the sound receiving volume is realized, and meanwhile, the situation that the scanning technician does not increase the sound receiving volume in time can be avoided.

EXAMPLE III

Fig. 6 is a flowchart of a pronunciation sensing method according to a third embodiment of the present invention, which is applicable to prompt a scanning technician that a current scanned person has a requirement for pronunciation, and is particularly applicable to prompt the scanning technician that the current scanned person has a requirement for pronunciation when a triggering device is detected to be triggered by the scanned person or a voice signal of the current scanned person is recognized. Wherein explanations of the same or corresponding terms as those of the above-described embodiments are omitted.

As shown in fig. 6, the pronunciation sensing method provided by this embodiment includes the following steps:

s610, whether the triggering device is triggered or not is detected, and/or whether the voice signal of the current scanned person is recognized or not is detected.

S620, if the trigger device is triggered or the voice signal of the current scanned person is detected, controlling the prompt device to send prompt information to prompt the current scanned person to have pronunciation requirements; the trigger device is arranged at a first associated position of the scanning equipment, the prompt device is arranged at a set position of the scanning operation room, the scanning equipment is arranged in the scanning room, and the scanning room and the scanning operation room are arranged in an isolated mode.

Optionally, if the triggering device is triggered, the volume adjusting device is controlled to increase the sound receiving volume; wherein, the larger the sound receiving volume is, the clearer the sound transmitted from the scanning room to the scanning operation room is.

Through setting up the volume adjustment device to when trigger device is triggered, automatic control radio reception volume increases, need not the manual regulation of scanning technician, has realized the automation from perception pronunciation demand to increase radio reception volume, simultaneously, can avoid appearing the situation that the scanning technician does not in time increase radio reception volume.

According to the technical scheme of the embodiment, whether the trigger device is triggered and/or whether the voice signal of the current scanned person is identified is detected, and when the trigger device is triggered or the voice signal of the current scanned person is detected, the prompt device is controlled to send the prompt information to prompt a scanning technician in a scanning operation room that the current scanned person has a pronunciation requirement, so that the pronunciation requirement of the scanned person is sensed in time, and the scanning technician is reminded.

Example four

Fig. 7 is a schematic structural diagram of a pronunciation sensing device according to a fourth embodiment of the present invention, which is applicable to prompt a scanning technician that a current scanned person has a requirement for pronunciation, and is particularly applicable to prompt the scanning technician that the current scanned person has a requirement for pronunciation when a triggering device is detected to be triggered by the scanned person or a voice signal of the current scanned person is recognized, where the device specifically includes: a detection module 710 and a control module 720.

A detecting module 710, configured to detect whether a triggering device is triggered and/or detect whether a voice signal of a current scanned person is recognized;

the control module 720 is used for controlling the prompting device to send out prompting information to prompt the current scanned person to have a pronunciation requirement if the triggering device is triggered or a voice signal of the current scanned person is detected; the trigger device is arranged at a first associated position of the scanning equipment, the prompt device is arranged at a set position of the scanning operation room, the scanning equipment is arranged in the scanning room, and the scanning room and the scanning operation room are arranged in an isolated mode.

According to the technical scheme of the embodiment, whether the trigger device is triggered or not is detected through the detection module, and/or whether the voice signal of the current scanned person is identified or not is detected, when the trigger device is triggered or the voice signal of the current scanned person is detected through the control module, the prompt device is controlled to send prompt information to prompt a scanning technician located in a scanning operation room that the current scanned person has a pronunciation requirement, so that the pronunciation requirement of the scanned person is timely sensed, and the scanning technician is prompted.

The pronunciation sensing device provided by the embodiment of the invention can execute the pronunciation sensing method provided by any embodiment of the invention, and has the corresponding functional modules and beneficial effects of the execution method.

It should be noted that, the units and modules included in the system are merely divided according to functional logic, but are not limited to the above division as long as the corresponding functions can be realized; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the embodiment of the invention.

EXAMPLE five

Fig. 8 is a schematic structural diagram of an electronic device according to a fifth embodiment of the present invention. FIG. 8 illustrates a block diagram of an exemplary electronic device 12 suitable for use in implementing embodiments of the present invention. The electronic device 12 shown in fig. 8 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiment of the present invention. The device 12 is typically a pronunciation-aware electronic device.

As shown in FIG. 8, electronic device 12 is embodied in the form of a general purpose computing device. The components of electronic device 12 may include, but are not limited to: one or more processors or processing units 16, a memory 28, and a bus 18 that couples the various components (including the memory 28 and the processing unit 16).

Bus 18 represents one or more of any of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, and a processor or local bus using any of a variety of bus architectures. By way of example, such architectures include, but are not limited to, an Industry Standard Architecture (ISA) bus, a Micro Channel Architecture (MCA) bus, an enhanced ISA bus, a Video Electronics Standards Association (VESA) local bus, and a Peripheral Component Interconnect (PCI) bus.

Electronic device 12 typically includes a variety of computer-readable media. Such media may be any available media that is accessible by electronic device 12 and includes both volatile and nonvolatile media, removable and non-removable media.

Memory 28 may include computer device readable media in the form of volatile Memory, such as Random Access Memory (RAM) 30 and/or cache Memory 32. The electronic device 12 may further include other removable/non-removable, volatile/nonvolatile computer storage media. By way of example only, the storage device 34 may be used to read from and write to non-removable, nonvolatile magnetic media (not shown in FIG. 8, and commonly referred to as a "hard drive"). Although not shown in FIG. 8, a magnetic disk drive for reading from and writing to a removable, nonvolatile magnetic disk (e.g., a "floppy disk") and an optical disk drive for reading from or writing to a removable, nonvolatile optical disk (e.g., a Compact disk-Read Only Memory (CD-ROM), a Digital Video disk (DVD-ROM), or other optical media) may be provided. In these cases, each drive may be connected to bus 18 by one or more data media interfaces. Memory 28 may include at least one program product 40, with program product 40 having a set of program modules 42 configured to carry out the functions of embodiments of the invention. Program product 40 may be stored, for example, in memory 28, and such program modules 42 include, but are not limited to, one or more application programs, other program modules, and program data, each of which examples or some combination may comprise an implementation of a network environment. Program modules 42 generally carry out the functions and/or methodologies of the described embodiments of the invention.

Electronic device 12 may also communicate with one or more external devices 14 (e.g., keyboard, mouse, camera, etc., and display), one or more devices that enable a user to interact with electronic device 12, and/or any devices (e.g., network card, modem, etc.) that enable electronic device 12 to communicate with one or more other computing devices. Such communication may be through an input/output (I/O) interface 22. Also, the electronic device 12 may communicate with one or more networks (e.g., a Local Area Network (LAN), Wide Area Network (WAN), and/or a public Network such as the internet) via the Network adapter 20. As shown, the network adapter 20 communicates with other modules of the electronic device 12 via the bus 18. It should be understood that although not shown in the figures, other hardware and/or software modules may be used in conjunction with electronic device 12, including but not limited to: microcode, device drivers, Redundant processing units, external disk drive Arrays, disk array (RAID) devices, tape drives, and data backup storage devices, to name a few.

The processor 16 executes various functional applications and data processing by executing programs stored in the memory 28, for example, implementing the pronunciation perception method provided by the above-mentioned embodiment of the present invention, including:

Of course, those skilled in the art will understand that the processor may also implement the technical solution of the pronunciation perception method provided by any embodiment of the present invention.

EXAMPLE six

A sixth embodiment of the present invention provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the pronunciation perception method steps provided in any embodiment of the present invention, where the method includes:

Computer storage media for embodiments of the invention may employ any combination of one or more computer-readable media. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.

A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.

Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.

Computer program code for carrying out operations for embodiments of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).

It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the present invention has been described in greater detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims

1. A pronunciation perception system, comprising: the device comprises a trigger device and a prompt device;

2. The pronunciation perception system according to claim 1, wherein the triggering mechanism includes a trigger button that is pressed when a request for pronunciation is made by a current scannable person, so that the trigger button is triggered.

3. The pronunciation perception system according to claim 2, wherein the trigger button is disposed on an alarm ball of the MRI apparatus, and the current scanner triggers an alarm by pressing the alarm ball.

4. The pronunciation perception system according to claim 1, wherein the prompting device includes a sensor light that is illuminated when the triggering device is triggered.

5. The pronunciation perception system according to any one of claims 1-4, further comprising: the volume adjusting device is used for controlling the sound receiving volume to increase when the trigger device is triggered;

wherein, the larger the sound receiving volume is, the clearer the sound transmitted from the scanning room to the scanning operation room is.

6. A pronunciation perception system, comprising: a voice recognition device and a prompting device;

the voice recognition device is arranged at a second relevant position of the scanning equipment, the prompting device is arranged at a set position of the scanning operation room, the voice recognition device and the prompting device are arranged in a linkage manner and are used for receiving and recognizing a voice signal of a current scanned person, and when the voice signal of the current scanned person is recognized, the prompting device sends out prompting information to prompt a scanning technician positioned in the scanning operation room that the current scanned person has a pronunciation requirement;

7. The pronunciation perception system according to claim 6, further comprising: the trigger device is arranged at a third related position of the scanning equipment, and when the trigger device is triggered, the prompt device sends out the prompt information.

8. The pronunciation perception system according to claim 7, wherein the triggering mechanism includes a trigger button that is pressed when a request for pronunciation is made by a current scannable person, so that the trigger button is triggered.

9. The pronunciation perception system according to claim 8, wherein the trigger button is disposed on an alarm ball of the MRI apparatus, and the current scanner triggers an alarm by pressing the alarm ball.

10. The pronunciation perception system according to any one of claims 6 to 9, wherein the prompting device includes a sensor light that is illuminated when the voice recognition device recognizes a voice signal of a currently scanned person.

11. The pronunciation perception system according to any one of claims 6-9, further comprising: the volume adjusting device is used for controlling the sound receiving volume to increase when the trigger device is triggered;

12. A method for pronunciation perception, comprising:

13. An utterance sensing apparatus, comprising:

14. An electronic device, characterized in that the electronic device comprises:

one or more processors;

a memory for storing one or more programs;

when executed by the one or more processors, cause the one or more processors to implement the pronunciation perception method steps as claimed in claim 7.

15. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the pronunciation perception method steps as claimed in claim 7.