CN112307249A - Audio information playing method and device - Google Patents

Audio information playing method and device Download PDF

Info

Publication number
CN112307249A
CN112307249A CN202010147898.0A CN202010147898A CN112307249A CN 112307249 A CN112307249 A CN 112307249A CN 202010147898 A CN202010147898 A CN 202010147898A CN 112307249 A CN112307249 A CN 112307249A
Authority
CN
China
Prior art keywords
hearing test
audio information
preset
playing
type
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010147898.0A
Other languages
Chinese (zh)
Inventor
不公告发明人
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing ByteDance Network Technology Co Ltd
Original Assignee
Beijing ByteDance Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing ByteDance Network Technology Co Ltd filed Critical Beijing ByteDance Network Technology Co Ltd
Priority to CN202010147898.0A priority Critical patent/CN112307249A/en
Publication of CN112307249A publication Critical patent/CN112307249A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • G06F16/635Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

The embodiment of the disclosure provides an audio information playing method and device, firstly, image information of a test question is obtained, in response to the fact that the image information contains a hearing test question, audio information corresponding to the hearing test question is obtained, the audio information corresponding to the hearing test question is played according to a preset playing mode, the audio information can be obtained according to the current hearing test question of a user, the functions of automatically obtaining the audio information and playing the audio information for the user are achieved, user operation and user time are saved, and user learning efficiency can be improved.

Description

Audio information playing method and device
Technical Field
The embodiment of the disclosure relates to the technical field of computers, in particular to an audio information playing method and device.
Background
With the fact that children have more and more abundant learning and living, language and hearing tests are often required in the learning process, parents are generally required to search audio contents corresponding to test questions according to the questions, for example, parents scan two-dimensional codes corresponding to the test questions through a mobile phone to obtain corresponding audio contents, and then the parents play the obtained audio contents to the children.
Disclosure of Invention
The embodiment of the disclosure provides an audio information playing method and device.
In a first aspect, an embodiment of the present disclosure provides an information pushing method, where the method includes: acquiring image information of a test subject; responding to the detected hearing test questions contained in the image information, and acquiring audio information corresponding to the hearing test questions; and playing the audio information corresponding to the hearing test topic according to a preset playing mode.
In some embodiments, obtaining audio information corresponding to a hearing test topic comprises: and responding to the hearing test questions searched in the preset hearing test resource library, and acquiring audio information corresponding to the hearing test questions in the preset hearing test resource library.
In some embodiments, obtaining audio information corresponding to a hearing test topic further comprises: responding to the situation that no hearing test question is searched in a preset hearing test resource library, and pushing prompt information for acquiring image information of answers of the hearing test question to a user; responding to the fact that the user executes corresponding operation according to the prompt information, and acquiring image information of answers of the hearing test questions; and identifying the image information of the answers of the hearing test questions and generating the audio information of the identified answers of the hearing test questions.
In some embodiments, playing the audio information corresponding to the hearing test topic according to a preset playing mode includes: acquiring the type of a hearing test question, and determining a preset playing mode corresponding to the type of the hearing test question; and playing the audio information corresponding to the hearing test topic according to a preset playing mode.
In some embodiments, the types of hearing test subjects include a conversation type; the preset playing mode corresponding to the type of the hearing test subject includes alternately outputting the audio of the conversation contents of different conversation participants using different timbres, respectively.
In a second aspect, an embodiment of the present disclosure provides an audio information playing apparatus, including: a first acquisition unit configured to acquire image information of a test subject; the second acquisition unit is configured to respond to the detection that the image information contains the hearing test questions, and acquire audio information corresponding to the hearing test questions; and the playing unit is configured to play the audio information corresponding to the hearing test topic according to a preset playing mode.
In some embodiments, the second obtaining unit is further configured to: and responding to the hearing test questions searched in the preset hearing test resource library, and acquiring audio information corresponding to the hearing test questions in the preset hearing test resource library.
In some embodiments, the second obtaining unit is further configured to: responding to the situation that no hearing test question is searched in a preset hearing test resource library, and pushing prompt information for acquiring image information of answers of the hearing test question to a user; responding to the fact that the user executes corresponding operation according to the prompt information, and acquiring image information of answers of the hearing test questions; and identifying the image information of the answers of the hearing test questions and generating the audio information of the identified answers of the hearing test questions.
In some embodiments, the playback unit is further configured to: acquiring the type of a hearing test question, and determining a preset playing mode corresponding to the type based on the type; and playing the audio information corresponding to the hearing test topic according to a preset playing mode.
In some embodiments, the types of hearing test subjects include a conversation type; the preset playing mode corresponding to the type of the hearing test subject includes alternately outputting the audio of the conversation contents of different conversation participants using different timbres, respectively.
In a third aspect, an embodiment of the present disclosure provides an electronic device, including: one or more processors; a storage device having one or more programs stored thereon; when the one or more programs are executed by the one or more processors, the one or more processors implement the audio information playing method as described in any of the embodiments of the first aspect.
In a fourth aspect, embodiments of the present disclosure provide a computer-readable medium on which a computer program is stored, the computer program, when executed by a processor, implementing the audio information playing method as described in any one of the embodiments of the first aspect.
According to the audio information playing method and device provided by the embodiment of the disclosure, firstly, image information of a test question is acquired, then, in response to the fact that the image information contains a hearing test question, audio information corresponding to the hearing test question is acquired, and finally, the audio information corresponding to the hearing test question is played according to a preset playing mode.
Drawings
Other features, objects and advantages of the disclosure will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:
FIG. 1 is an exemplary system architecture diagram in which one embodiment of the present disclosure may be applied;
FIG. 2 is a flow diagram for one embodiment of a method for audio information playback, in accordance with an embodiment of the present disclosure;
fig. 3 is a schematic diagram of an application scenario of an audio information playing method according to an embodiment of the present disclosure;
FIG. 4 is an exemplary flow chart for obtaining audio information corresponding to a hearing test topic according to an embodiment of the present disclosure;
FIG. 5 is a schematic structural diagram of one embodiment of an audio information playback device according to an embodiment of the present disclosure;
FIG. 6 is a schematic structural diagram of an electronic device suitable for use in implementing embodiments of the present disclosure.
Detailed Description
The present disclosure is described in further detail below with reference to the accompanying drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant disclosure and are not limiting of the disclosure. It should be noted that, for the convenience of description, only the parts relevant to the related disclosure are shown in the drawings.
It should be noted that, in the present disclosure, the embodiments and features of the embodiments may be combined with each other without conflict. The present disclosure will be described in detail below with reference to the accompanying drawings in conjunction with embodiments.
Fig. 1 illustrates an exemplary system architecture 100 of an audio information playing method and an audio information playing apparatus to which embodiments of the present disclosure may be applied.
As shown in fig. 1, the system architecture 100 may include terminal devices 104, 105, a network 106, and servers 101, 102, 103. The network 106 serves as a medium for providing communication links between the terminal devices 104, 105 and the servers 101, 102, 103. Network 106 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.
A user may interact with the servers 101, 102, 103 via the network 106 via the terminal devices 104, 105 to receive or transmit information or the like. The terminal devices 104, 105 may have various applications installed thereon, such as reading-type applications, data analysis applications, online learning applications, instant messaging tools, social platform software, search-type applications, shopping-type applications, data processing applications, and the like.
The terminal devices 104, 105 may be hardware or software. When the terminal device is hardware, it may be various electronic devices having a display screen and supporting communication with the server, including but not limited to a smart phone, a tablet computer, a laptop portable computer, a desktop computer, and the like. When the terminal device is software, the terminal device can be installed in the electronic devices listed above. It may be implemented as multiple pieces of software or software modules, or as a single piece of software or software module. And is not particularly limited herein.
The terminal devices 104 and 105 may be terminals having an image capturing function and a voice/image prompting function (e.g., a voice device having a screen and voice interaction, or an intelligent desk lamp, an intelligent learning table, etc. having a screen and a camera function), the captured images may be locally processed at the terminal devices 104 and 105, or may be sent to a server for processing, or the terminal devices 104 and 105 may further obtain images from an image capturing device installed corresponding to a learning location of a user, locally process and play audio information, or the terminal plays audio information according to a processing result through the server for processing.
The servers 101, 102, 103 may be servers that provide various services, such as background servers that receive requests sent by terminal devices with which communication connections are established. The background server can receive and analyze the request sent by the terminal device, and generate a processing result.
The server may be hardware or software. When the server is hardware, it may be various electronic devices that provide various services to the terminal device. When the server is software, it may be implemented as a plurality of software or software modules for providing various services to the terminal device, or may be implemented as a single software or software module for providing various services to the terminal device. And is not particularly limited herein.
It should be noted that the audio information playing method provided by the embodiments of the present disclosure may be executed by the terminal devices 104 and 105 or the servers 101, 102 and 103. Accordingly, the audio information playing means is provided in the terminal devices 104, 105 or the servers 101, 102, 103.
It should be understood that the number of terminal devices, networks, and servers in fig. 1 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
With continued reference to fig. 2, a flow 200 of one embodiment of an audio information playback method in accordance with the present disclosure is shown. The audio information playing method comprises the following steps:
step 210, obtaining image information of the test subject.
In this step, the execution subject on which the audio information playing method is executed may obtain the image information in a real-time shooting or memory reading manner, and the image information may include a test title currently being processed by the user. In an exemplary scenario, a user learns at a desk, the desk is provided with an intelligent desk lamp, the intelligent desk lamp can be provided with a camera, the execution main body receives an instruction of the user to turn on the intelligent desk lamp, for example, the user can turn on the intelligent desk lamp through voice "i want to perform hearing test", and the camera on the intelligent desk lamp starts to shoot an image of a test subject currently learned by the user.
Step 220, in response to detecting that the image information contains a hearing test subject, acquiring audio information corresponding to the hearing test subject.
In this step, after the execution subject acquires the image including the test question, the test question included in the image is identified by an image identification method, so as to obtain the content of the test question included in the image information, such as the question text, the question type, and the like. And then the executive body judges whether the current test questions comprise the hearing test questions or not according to the identified content of the test questions. When detecting that the current test question comprises a hearing test question, the execution main body can acquire audio information corresponding to the hearing test question in a corresponding resource library according to the detected hearing test question; and corresponding audio information can be generated according to the content of the answer page.
And step 230, playing the audio information corresponding to the hearing test topic according to a preset playing mode.
In this step, after the execution main body obtains the audio information corresponding to the hearing test question, the audio information corresponding to the hearing test question may be played according to a preset playing mode, where the preset playing mode is preset according to the type of the hearing test question or the type of the audio information, and the playing mode may refer to an output mode of the audio information, such as speech reading, speech conversation, and the like. As an example, after the execution subject acquires the audio information, the content of the audio information is analyzed to determine the type of the audio information, for example, the type of the audio information is a text paragraph. And then the execution main body determines that the playing mode corresponding to the audio information is voice reading according to the type of the audio information, and the execution main body outputs the audio information to a user through the voice reading playing mode.
In some optional implementation manners of this embodiment, in step 230, the audio information corresponding to the hearing test topic is played according to a preset playing manner, and the following process may be performed:
step 1, obtaining the type of a hearing test question, and determining a preset playing mode corresponding to the type of the hearing test question.
In this step, the execution subject may identify the content of the hearing test question in the image information by using an image identification method, and determine the type corresponding to the hearing test question according to the identified content. And then the execution main body determines a preset playing mode corresponding to the type of the hearing test question according to the preset corresponding relation between the type of the hearing test question and the playing mode.
When the type of the hearing test questions comprises conversation types, the preset playing modes corresponding to the type of the hearing test questions comprise audio frequencies which respectively use different timbres to alternately output conversation contents of different conversation participants; when the type of the hearing test question comprises an article type, the preset playing mode corresponding to the type of the hearing test question comprises outputting audio corresponding to the article by using one tone, and the like.
And step 2, playing the audio information corresponding to the hearing test topic according to a preset playing mode.
In this step, after determining the audio information corresponding to the hearing test topic and determining the corresponding preset playing mode, the execution main body plays the audio information corresponding to the hearing test topic for the user according to the determined preset playing mode. As an example, the execution subject determines the type of the hearing test question as a dialog type, and the execution subject alternately outputs audio information corresponding to the hearing test question using a male voice and a female voice, respectively.
In this implementation manner, the execution main body can determine a corresponding playing mode according to the type of the hearing test question, so that the output audio information is more suitable for the question type, and the diversity of the audio information output mode is improved.
With continuing reference to fig. 3, fig. 3 is a schematic diagram of an application scenario of the audio information playing method according to the present embodiment. In the application scenario of fig. 3, a user is learning on a desk with the intelligent desk lamp 310, the intelligent desk lamp 310 may be provided with the camera 320, the intelligent desk lamp 310 receives an instruction from the user to turn on the intelligent desk lamp 310, for example, the user may send a voice "start hearing test", the intelligent desk lamp 310 receives the voice of the user through the voice interaction device, and is automatically turned on, and then the camera 320 starts to shoot an image of a test subject currently learned by the user. After receiving the image including the test question currently learned by the user, the intelligent desk lamp 310 identifies the image by using an image identification method, and identifies the content of the test question. Then, the intelligent desk lamp 310 further determines whether the identified test subject content contains a hearing test subject according to the identified test subject content, and when the identified test subject content contains the hearing test subject, the intelligent desk lamp 310 starts to acquire audio information corresponding to the identified hearing test subject, where the audio information includes the hearing test subject and an answer. And finally, the intelligent desk lamp 310 plays the acquired audio information to the user through the voice interaction device according to a preset playing mode.
According to the audio information playing method provided by the embodiment of the disclosure, by acquiring the image information of the test question, responding to the detection that the image information contains the hearing test question, acquiring the audio information corresponding to the hearing test question, and finally playing the audio information corresponding to the hearing test question according to the preset playing mode, whether the image information contains the hearing test question can be judged according to the current test question, and the corresponding audio information can be automatically acquired according to the hearing test question, so that the automatic acquisition of the audio information and the playing of the audio information for the user are realized, the situation that the user cannot autonomously acquire the audio information for the child is avoided, a parent does not need to independently acquire the audio information for the child, the learning efficiency of the child is improved, and the operation and time of the parent are saved.
In some optional implementation manners of this embodiment, referring to fig. 4, in the above method flow 200, in the step 220, acquiring the audio information corresponding to the hearing test topic, may be performed according to the following flow:
step 410, in response to the hearing test question searched in the preset hearing test resource library, acquiring the audio information corresponding to the hearing test question in the preset hearing test resource library.
In this step, the execution subject may search and find in the hearing test resource library according to the content corresponding to the hearing test question, and determine whether the hearing test resource library stores the hearing test question having the same content as the hearing test question. When the executive main body searches the same hearing test subject in the hearing test resource library, the fact that the hearing test subject and the corresponding audio information are stored in the hearing test resource library is determined, and then the executive main body searches the audio information corresponding to the hearing test subject in the hearing test resource library.
In this implementation manner, the execution main body obtains the audio information corresponding to the hearing test question in a manner of searching in the hearing test resource library, so that the automatic obtaining of the audio information is realized, and the user does not need to obtain the audio information corresponding to the hearing test question through other operations, thereby improving the learning efficiency of the user and saving the user operation.
In some optional implementation manners of this embodiment, please further refer to fig. 4, in the above method flow 200, in the step 220, the obtaining of the audio information corresponding to the hearing test topic may further be performed according to the following flow:
step 420, in response to that no hearing test question is searched in the preset hearing test resource library, pushing prompt information for acquiring image information of an answer to the hearing test question to the user.
In this step, the execution subject does not search the hearing test question in the hearing test resource library, and determines that the audio information corresponding to the hearing test question is not in the hearing test resource library. Then, the execution main body may push prompt information to the user through voice playing, video playing, text presentation, and the like, where the prompt information is used to prompt the user that the execution main body needs to acquire an image of an answer to the hearing test question, for example, the prompt information may be "please turn over to an answer page to let i collect a hearing test audio bar", or "please find content corresponding to the answer page", and the like.
And step 430, in response to determining that the user performs corresponding operations according to the prompt information, acquiring image information of answers to the hearing test questions.
In this step, after the execution main body pushes the prompt information to the user, the user can find an answer page corresponding to the hearing test question according to the prompt information. Then, the execution main body can determine whether the user finds an answer according to the prompt information through real-time shooting, and when the execution main body determines that the user executes corresponding operation according to the prompt information through the shot content, the execution main body shoots image information of the answer of the hearing test question through the camera. As an example, the execution main body plays a prompt message "please turn over to an answer page to let i collect a hearing test audio bar" to the user, starts to shoot the operation of the user in real time, determines whether the user executes an operation of searching for the answer page according to the prompt message according to the shot video, and when it is determined that the user turns over the test question to the corresponding answer page according to the prompt message, the execution main body shoots the answer content of the answer page through the camera to obtain the image information of the answer of the hearing test question.
Step 440, identifying the image information of the answer of the hearing test question, and generating the audio information of the identified answer of the hearing test question.
In this step, after the execution subject acquires the image information of the answer to the hearing test question, the execution subject may perform text recognition on the content of the answer included in the image information by using an image recognition method. And the execution main body generates audio information corresponding to the answer of the hearing test question according to the recognized characters.
In the implementation mode, when the hearing test questions are not searched in the hearing test resource library, the execution main body collects the image information of the answers of the hearing test questions by pushing the prompt information, so that the diversity of the audio information acquisition mode is improved, other people do not need to participate in the audio information acquisition mode, and the operation of acquiring the audio information is simplified.
With further reference to fig. 5, as an implementation of the methods shown in the above figures, the present disclosure provides one embodiment of an audio information playing apparatus. This device embodiment corresponds to the method embodiment shown in fig. 2.
As shown in fig. 5, the audio information playing apparatus 500 of the present embodiment may include: a first acquisition unit 510 configured to acquire image information of a test subject; a second obtaining unit 520 configured to obtain audio information corresponding to a hearing test topic in response to detecting that the image information includes the hearing test topic; and the playing unit 530 is configured to play the audio information corresponding to the hearing test topic according to a preset playing mode.
In some optional implementations of this implementation, the second obtaining unit 520 is further configured to: and responding to the hearing test questions searched in the preset hearing test resource library, and acquiring audio information corresponding to the hearing test questions in the preset hearing test resource library.
In some optional implementations of this implementation, the second obtaining unit 520 is further configured to: responding to the situation that no hearing test question is searched in a preset hearing test resource library, and pushing prompt information for acquiring image information of answers of the hearing test question to a user; responding to the fact that the user executes corresponding operation according to the prompt information, and acquiring image information of answers of the hearing test questions; and identifying the image information of the answers of the hearing test questions, generating the audio information of the identified answers of the hearing test questions, and playing the audio information of the answers of the hearing test questions to the user.
In some optional implementations of this implementation, the playing unit 530 is further configured to: acquiring the type of a hearing test question, and determining a preset playing mode corresponding to the type based on the type; and playing the audio information corresponding to the hearing test topic according to a preset playing mode.
In some alternative implementations of the present implementation, the types of hearing test questions include a conversation type; the preset playing mode corresponding to the type of the hearing test subject includes alternately outputting the audio of the conversation contents of different conversation participants using different timbres, respectively.
The device that this disclosed above embodiment provided, through obtaining the image information of test topic, then contain the hearing test topic in response to detecting image information, acquire the audio information that corresponds with the hearing test topic, play the audio information that the hearing test topic corresponds according to the playback mode that predetermines at last, can judge whether contain the hearing test topic according to current test topic, and can obtain corresponding audio information according to the hearing test topic is automatic, realized automatic acquisition audio information and play this audio information for the user, the condition that the user can not independently obtain audio information when having avoided the user to be child, thereby do not need the head of a family to obtain audio information for child alone, child's learning efficiency has been improved, the operation and the time of saving the head of a family.
Referring now to FIG. 6, shown is a schematic diagram of an electronic device 600 suitable for use in implementing embodiments of the present disclosure. The electronic device shown in fig. 6 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present disclosure.
As shown in fig. 6, electronic device 600 may include a processing means (e.g., central processing unit, graphics processor, etc.) 601 that may perform various appropriate actions and processes in accordance with a program stored in a Read Only Memory (ROM)602 or a program loaded from a storage means 608 into a Random Access Memory (RAM) 603. In the RAM603, various programs and data necessary for the operation of the electronic apparatus 600 are also stored. The processing device 601, the ROM 602, and the RAM603 are connected to each other via a bus 604. An input/output (I/O) interface 605 is also connected to bus 604.
Generally, the following devices may be connected to the I/O interface 605: input devices 606 including, for example, a touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; output devices 607 including, for example, a Liquid Crystal Display (LCD), a speaker, a vibrator, and the like; storage 608 including, for example, tape, hard disk, etc.; and a communication device 609. The communication means 609 may allow the electronic device 600 to communicate with other devices wirelessly or by wire to exchange data. While fig. 6 illustrates an electronic device 600 having various means, it is to be understood that not all illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided. Each block shown in fig. 6 may represent one device or may represent multiple devices as desired.
In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method illustrated in the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network via the communication means 609, or may be installed from the storage means 608, or may be installed from the ROM 602. The computer program, when executed by the processing device 601, performs the above-described functions defined in the methods of embodiments of the present disclosure.
It should be noted that the computer readable medium of the embodiments of the present disclosure may be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In embodiments of the disclosure, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In embodiments of the present disclosure, however, a computer readable signal medium may comprise a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: electrical wires, optical cables, RF (radio frequency), etc., or any suitable combination of the foregoing.
The computer readable medium may be embodied in the electronic device; or may exist separately without being assembled into the electronic device. The computer readable medium carries one or more programs which, when executed by the electronic device, cause the electronic device to: acquiring image information of a test subject; responding to the detected hearing test questions contained in the image information, and acquiring audio information corresponding to the hearing test questions; and playing the audio information corresponding to the hearing test topic according to a preset playing mode.
Computer program code for carrying out operations for embodiments of the present disclosure may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + +, and including conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units described in the embodiments of the present disclosure may be implemented by software or hardware. The described units may also be provided in a processor, and may be described as: a processor includes a first acquisition unit, a second acquisition unit, and a playback unit. Here, the names of the units do not constitute a limitation to the unit itself in some cases, and for example, the first acquisition unit may also be described as a "unit that acquires image information of a test subject".
The foregoing description is only exemplary of the preferred embodiments of the disclosure and is illustrative of the principles of the technology employed. It will be appreciated by those skilled in the art that the scope of the invention in the embodiments of the present disclosure is not limited to the specific combination of the above-mentioned features, but also encompasses other embodiments in which any combination of the above-mentioned features or their equivalents is made without departing from the inventive concept as defined above. For example, the above features and (but not limited to) technical features with similar functions disclosed in the embodiments of the present disclosure are mutually replaced to form the technical solution.

Claims (12)

1. An audio information playing method, comprising:
acquiring image information of a test subject;
responding to the detected hearing test questions contained in the image information, and acquiring audio information corresponding to the hearing test questions;
and playing the audio information corresponding to the hearing test topic according to a preset playing mode.
2. The method of claim 1, wherein the obtaining audio information corresponding to the hearing test topic comprises:
and responding to the hearing test questions searched in a preset hearing test resource library, and acquiring audio information corresponding to the hearing test questions in the preset hearing test resource library.
3. The method of claim 2, wherein the obtaining audio information corresponding to the hearing test topic further comprises:
responding to the situation that the hearing test questions are not searched in a preset hearing test resource library, and pushing prompt information for acquiring image information of answers of the hearing test questions to a user;
responding to the fact that the user executes corresponding operation according to the prompt information, and acquiring image information of answers of the hearing test questions;
and identifying the image information of the answers of the hearing test questions and generating the audio information of the identified answers of the hearing test questions.
4. The method according to any one of claims 1 to 3, wherein the playing the audio information corresponding to the hearing test topic according to a preset playing mode comprises:
acquiring the type of the hearing test question, and determining a preset playing mode corresponding to the type of the hearing test question;
and playing the audio information corresponding to the hearing test topic according to the preset playing mode.
5. The method of claim 4, wherein the type of hearing test topic comprises a dialog type; the preset playing mode corresponding to the type of the hearing test questions comprises that different timbres are respectively used for alternately outputting the audios of the conversation contents of different conversation participants.
6. An audio information playing apparatus comprising:
a first acquisition unit configured to acquire image information of a test subject;
the second acquisition unit is configured to respond to the detection that the image information contains a hearing test subject, and acquire audio information corresponding to the hearing test subject;
and the playing unit is configured to play the audio information corresponding to the hearing test topic according to a preset playing mode.
7. The apparatus of claim 6, wherein the second obtaining unit is further configured to:
and responding to the hearing test questions searched in a preset hearing test resource library, and acquiring audio information corresponding to the hearing test questions in the preset hearing test resource library.
8. The apparatus of claim 7, wherein the second obtaining unit is further configured to:
responding to the situation that the hearing test questions are not searched in a preset hearing test resource library, and pushing prompt information for acquiring image information of answers of the hearing test questions to a user;
responding to the fact that the user executes corresponding operation according to the prompt information, and acquiring image information of answers of the hearing test questions;
and identifying the image information of the answers of the hearing test questions and generating the audio information of the identified answers of the hearing test questions.
9. The apparatus of any of claims 6-8, wherein the playback unit is further configured to:
acquiring the type of the hearing test question, and determining a preset playing mode corresponding to the type based on the type;
and playing the audio information corresponding to the hearing test topic according to the preset playing mode.
10. The apparatus of claim 9, wherein the type of hearing test topic comprises a dialog type; the preset playing mode corresponding to the type of the hearing test questions comprises that different timbres are respectively used for alternately outputting the audios of the conversation contents of different conversation participants.
11. An electronic device, comprising:
one or more processors;
a storage device for storing one or more programs,
the one or more programs, when executed by the one or more processors, cause the one or more processors to implement the method of any of claims 1-5.
12. A computer-readable medium, on which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1-5.
CN202010147898.0A 2020-03-05 2020-03-05 Audio information playing method and device Pending CN112307249A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010147898.0A CN112307249A (en) 2020-03-05 2020-03-05 Audio information playing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010147898.0A CN112307249A (en) 2020-03-05 2020-03-05 Audio information playing method and device

Publications (1)

Publication Number Publication Date
CN112307249A true CN112307249A (en) 2021-02-02

Family

ID=74336435

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010147898.0A Pending CN112307249A (en) 2020-03-05 2020-03-05 Audio information playing method and device

Country Status (1)

Country Link
CN (1) CN112307249A (en)

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009206986A (en) * 2008-02-28 2009-09-10 Canon Inc Image output device and image output method
CN104615689A (en) * 2015-01-22 2015-05-13 百度在线网络技术(北京)有限公司 Searching method and device
CN106710327A (en) * 2015-07-31 2017-05-24 曾晓敏 English-based point reading book system
WO2017146344A1 (en) * 2016-02-25 2017-08-31 (주)뤼이드 Method, apparatus, and computer program for providing personalized educational content
CN107885482A (en) * 2017-11-07 2018-04-06 广东欧珀移动通信有限公司 Audio frequency playing method, device, storage medium and electronic equipment
CN109377795A (en) * 2018-09-27 2019-02-22 广东小天才科技有限公司 A kind of the study exchange method and smart machine of smart machine
CN109871128A (en) * 2019-03-13 2019-06-11 广东小天才科技有限公司 A kind of topic type recognition methods and device
CN109885721A (en) * 2019-02-18 2019-06-14 深圳市沃特沃德股份有限公司 Play method, apparatus, computer equipment and the storage medium of audio-frequency information
CN110297938A (en) * 2019-06-20 2019-10-01 北京奇艺世纪科技有限公司 A kind of audio frequency playing method, device and terminal
CN110428674A (en) * 2019-08-15 2019-11-08 湖北纽云教育科技发展有限公司 A kind of application method of listening study device

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009206986A (en) * 2008-02-28 2009-09-10 Canon Inc Image output device and image output method
CN104615689A (en) * 2015-01-22 2015-05-13 百度在线网络技术(北京)有限公司 Searching method and device
CN106710327A (en) * 2015-07-31 2017-05-24 曾晓敏 English-based point reading book system
WO2017146344A1 (en) * 2016-02-25 2017-08-31 (주)뤼이드 Method, apparatus, and computer program for providing personalized educational content
CN107885482A (en) * 2017-11-07 2018-04-06 广东欧珀移动通信有限公司 Audio frequency playing method, device, storage medium and electronic equipment
CN109377795A (en) * 2018-09-27 2019-02-22 广东小天才科技有限公司 A kind of the study exchange method and smart machine of smart machine
CN109885721A (en) * 2019-02-18 2019-06-14 深圳市沃特沃德股份有限公司 Play method, apparatus, computer equipment and the storage medium of audio-frequency information
CN109871128A (en) * 2019-03-13 2019-06-11 广东小天才科技有限公司 A kind of topic type recognition methods and device
CN110297938A (en) * 2019-06-20 2019-10-01 北京奇艺世纪科技有限公司 A kind of audio frequency playing method, device and terminal
CN110428674A (en) * 2019-08-15 2019-11-08 湖北纽云教育科技发展有限公司 A kind of application method of listening study device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
杨晓吟;: "多样化题型在线考试系统的实现", 数字通信世界, no. 11 *
潘荔霞;徐文斌;任迎春;李世宝;: "基于语音处理的外语听力移动学习平台构建", 实验室研究与探索, no. 11 *

Similar Documents

Publication Publication Date Title
US11158102B2 (en) Method and apparatus for processing information
WO2020065840A1 (en) Computer system, speech recognition method, and program
US11023716B2 (en) Method and device for generating stickers
CN110536166B (en) Interactive triggering method, device and equipment of live application program and storage medium
CN106840209B (en) Method and apparatus for testing navigation applications
CN112364144A (en) Interaction method, device, equipment and computer readable medium
CN112102836B (en) Voice control screen display method and device, electronic equipment and medium
CN109995543B (en) Method and apparatus for adding group members
CN112309387A (en) Method and apparatus for processing information
CN110109597B (en) Singing list switching method, device, system, terminal and storage medium
CN112309389A (en) Information interaction method and device
CN110196900A (en) Exchange method and device for terminal
CN108766429B (en) Voice interaction method and device
CN113299285A (en) Device control method, device, electronic device and computer-readable storage medium
CN112307249A (en) Audio information playing method and device
CN112287171A (en) Information processing method and device and electronic equipment
CN114613350A (en) Test method, test device, electronic equipment and storage medium
JP2024507734A (en) Speech similarity determination method and device, program product
CN113312928A (en) Text translation method and device, electronic equipment and storage medium
CN108881978B (en) Resource playing method and device for intelligent equipment
CN113837986A (en) Method, apparatus, electronic device, and medium for recognizing tongue picture
CN113835995B (en) Method and device for generating test cases
CN113808615B (en) Audio category positioning method, device, electronic equipment and storage medium
US11792494B1 (en) Processing method and apparatus, electronic device and medium
CN110188712B (en) Method and apparatus for processing image

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination