WO2017157067A1 - 一种电子书的翻页方法及装置 - Google Patents

一种电子书的翻页方法及装置 Download PDF

Info

Publication number
WO2017157067A1
WO2017157067A1 PCT/CN2016/110696 CN2016110696W WO2017157067A1 WO 2017157067 A1 WO2017157067 A1 WO 2017157067A1 CN 2016110696 W CN2016110696 W CN 2016110696W WO 2017157067 A1 WO2017157067 A1 WO 2017157067A1
Authority
WO
WIPO (PCT)
Prior art keywords
page turning
tone
text information
sound
information
Prior art date
Application number
PCT/CN2016/110696
Other languages
English (en)
French (fr)
Inventor
李祎哲
Original Assignee
广州阿里巴巴文学信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广州阿里巴巴文学信息技术有限公司 filed Critical 广州阿里巴巴文学信息技术有限公司
Publication of WO2017157067A1 publication Critical patent/WO2017157067A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/02Digital computers in general; Data processing equipment in general manually operated with input through keyboard and computation using a built-in program, e.g. pocket calculators
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/02Digital computers in general; Data processing equipment in general manually operated with input through keyboard and computation using a built-in program, e.g. pocket calculators
    • G06F15/025Digital computers in general; Data processing equipment in general manually operated with input through keyboard and computation using a built-in program, e.g. pocket calculators adapted to a specific application
    • G06F15/0291Digital computers in general; Data processing equipment in general manually operated with input through keyboard and computation using a built-in program, e.g. pocket calculators adapted to a specific application for reading, e.g. e-books
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/725Cordless telephones

Definitions

  • the present invention relates to the field of electronic reading, and in particular to a method and apparatus for page turning of an electronic book.
  • the main can be divided into two categories, one is the user manually page turning, the other is the software automatically page turning, the software automatically page turning as a more convenient one page turning mode is widely used .
  • the software automatic page turning mode the user according to the software's own page turning tone, the page turning mode by the pronunciation trigger page turning operation is more advanced than the page turning mode by the software setting page turning speed, but the user is self-using by software.
  • the page turning sound is turned over, the page turning sound must be imitated by the software, which causes inconvenience to the user's pronunciation. Therefore, how to make the user more flexible and convenient to turn pages over during the reading process has become an urgent problem in the existing electronic reading.
  • the present invention provides a method and apparatus for page turning of an e-book, the main purpose of which is to solve the problem that the existing e-book cannot provide a flexible and convenient voice page turning service for the user.
  • a method for page turning of an electronic book comprising:
  • the set page prompt tone is set by the microphone
  • the present invention provides a page turning device for an electronic book, comprising:
  • a collecting unit configured to collect a set page turning tone by using a microphone
  • a saving unit configured to save the collected page turning tone
  • the collecting unit is further configured to monitor the microphone to obtain sound information collected by the microphone;
  • a confirming unit configured to confirm that the sound information matches the page turning prompt sound, triggering a page turning operation corresponding to the page turning prompt sound.
  • a page turning device for an electronic book comprising a memory and a processor, the memory for storing instructions for controlling the processor to operate to perform The method according to the first aspect of the invention.
  • a computer readable storage medium storing program code for performing the method according to the first aspect of the invention.
  • an embodiment of the present invention provides a page turning method and apparatus for an e-book, which can collect a page turning tone set by a user through a microphone, and save the collected page turning tone, and then start the electronic After reading the application, the microphone is monitored to obtain the sound information collected by the microphone, and the page turning operation corresponding to the page turning tone is triggered only when it is confirmed that the sound information matches the page turning tone.
  • the user when the user performs the page turning operation through the page turning prompt sound of the software, the user must imitate the page turning prompt sound provided by the software, thereby causing a lot of inconvenience to the user's pronunciation.
  • the present invention enables the user to customize the setting page turning tone, so that the page turning can be easily performed by the pronunciation when the page turning is performed. The user does not have to deliberately imitate the page flipping sound that comes with the software to turn the page.
  • FIG. 1 is a flowchart of a page turning method of an e-book according to an embodiment of the present invention
  • FIG. 2 is a block diagram showing the composition of a page turning device of an electronic book according to an embodiment of the present invention
  • FIG. 3 is a block diagram showing the composition of a page turning device of another electronic book according to an embodiment of the present invention.
  • FIG. 4 is a block diagram showing the composition of an electronic device according to an embodiment of the present invention.
  • the existing page turning mode of the e-book is divided into manual page turning and software automatic page turning.
  • the software automatic page turning mode it is more common for the user to perform a page turning operation through the software's own page turning tone, but in this page turning mode, the user must imitate the software's own page turning tone, thereby It causes a lot of inconvenience to the user's pronunciation. Therefore, the existing page turning mode of the e-book cannot provide the user with a flexible and convenient voice page turning service.
  • an embodiment of the present invention provides a method for page turning of an e-book, which can turn pages according to a user-defined page turning prompt tone, and provides a more flexible and convenient voice page turning service for the user.
  • the method includes:
  • the set page prompt tone is set by the microphone.
  • the software's own page turning tone makes the user have to imitate the page turning tone of the software when triggering the page turning operation, thereby causing the user to cause the user to pronounce Many inconveniences, such as inaccurate user pronunciation, can not match the page turning tone, or the user can not accurately imitate the page turning tone in the software. Therefore, the page turning method of the electronic book provided by the present invention enables the user to set the page turning tone.
  • step 101 needs to be performed to collect the set page turning tone through the microphone.
  • step 101 After the page turning tone set by the user is collected in step 101, it is necessary to identify the page turning tone set by the user, and save the recognized page turning tone in a specific format.
  • step 103 is performed to monitor the microphone to obtain sound information collected by the microphone.
  • the sound information includes both the sound information sent by the user and the environment sound information; in the process of confirming whether the sound information matches the page turning prompt sound set by the user, the sound information needs to be identified.
  • the sound information can be confirmed to match the page turning tone only when the recognition result is the same as the recognition result corresponding to the page turning tone of the user-defined setting.
  • the method for page turning of an e-book can collect the page-turning prompt tone set by the user through the microphone, and save the collected page-turning prompt tone, and then monitor the microphone after starting the electronic reading application. Obtaining the sound information collected by the microphone, and triggering the page turning operation corresponding to the page turning prompt sound only when it is confirmed that the sound information matches the page turning prompt sound.
  • the user when the user performs a page turning operation through the page turning sound of the software, the user must imitate the page turning sound of the software, thereby causing the user's pronunciation to be caused. How inconvenient.
  • the present invention enables the user to customize the setting page turning tone, so that the page turning can be easily performed by the pronunciation when the page turning is performed. The user does not have to deliberately imitate the page flipping sound that comes with the software to turn the page.
  • the embodiment of the present invention is for the user to perform page turning, the user does not have to deliberately imitate the page turning tone of the software to perform page turning. Therefore, the embodiment of the present invention can customize the page turning tone by the user. Specifically, the timing of setting the page turning tone by the user is before the microphone is monitored, and the setting method is: first, the page turning tone is collected through the microphone, and the page turning tone is customized by the user. , usually the voice of the user. Then, after the page-turning tone set by the user is obtained, it needs to be saved.
  • the specific saving method is: converting the page-turning tone into a first text information list.
  • the first text information list is composed of different first text information whose pronunciation has a similarity relationship with the page turning tone.
  • the page turning tone is converted into a first text information list, (for the Android system as an example), after the voice input is activated, the android.speech.RecognizerIntent recognizes the page turning tone and converts it into text information, and passes The onActivityResult() method receives the text information to form a first text information list.
  • the first text information list includes a series of different first text information, and the first text information has a common point that the pronunciation is similar to the pronunciation of the page turning sound. For example, if the pronunciation of the page turning tone is fan (one sound), the first text information converted and recognized includes: rice, turn, complex, annoying, return, sail, reverse, van, and the like.
  • a first text information may be determined from the first text information list as the first text information corresponding to the page turning tone based on the information selection operation. Adopting such a processing method is because the user's pronunciation may have a certain error, so that the user's set page turning tone is obtained by a series of possible text information, and only the possible text information is displayed in the form of a list. The user clicks on the text information that he or she really wants to say in order to accurately set the page turning tone. After determining the first text information corresponding to the page turning tone in the first text information list, the determined first text information may be saved for subsequent use of the collected sound information and settings. The page turning tone is in the process of matching.
  • the language type of the page turning tone can be set based on the language selection operation.
  • the RecognizerIntent.EXTRA_LANGUAGE_MODE language type can be input through the .putExtra() method, which is a language that the client language recognition system can recognize, which can be English or Chinese.
  • the setting of the language type may be set at the time of shipment, and may of course be set by the user. For example, when the set language type is English, for the user to set the page turning tone in Chinese, the android.speech.RecognizerIntent may not be able to complete the setting of the page turning tone because the Chinese voice information cannot be recognized.
  • the embodiment of the present invention needs to perform a corresponding page turning operation by recognizing the user's pronunciation. Therefore, after the electronic reading application is started, the embodiment of the present invention needs to obtain the sound information collected by the microphone through the monitoring microphone, thereby determining the sound information and turning Whether the page sounds match.
  • the client obtains the sound information collected by the microphone, that is, the client receives the external voice information.
  • the Android client is taken as an example for description.
  • the process of recognizing the collected sound information is similar to the process of recognizing the page turning tone set by the user.
  • android.speech in Android is the core package for voice input in Android. android.speech.RecognizerIntent is a main class. This active will receive voice input and recognize voice content into text.
  • the client can obtain the sound information collected by the microphone through the android.speech.RecognizerIntent, and the sound information may include the sound information sent by the user, or may Contains sound information generated by the environment.
  • the language type of the acquired sound information may also be set, wherein the language type of the set sound information may be the client's factory.
  • the language type that is initialized at the time may also be a language type that is customized by the user during the process of using the client.
  • the client using the Android system needs to input the RecognizerIntent.EXTRA_LANGUAGE_MODE language type in the .putExtra() method, and the language type may be English or Chinese.
  • the set language type is Chinese
  • android.speech.RecognizerIntent will only recognize the Chinese sound information and convert it into text.
  • the second text information list being composed of different second text information having a similarity relationship with the sound information, the second text information being similar
  • the degrees are arranged in descending order.
  • the sound information is converted into a second text information list, for example, in the Android system, after the voice input is activated, the android.speech.RecognizerIntent recognizes the sound information and converts it into text information, and uses the startActivityForResult() method.
  • the second text information list includes a series of different second text information, and the second text information has a common point that the pronunciation has a similarity relationship with the pronunciation of the acquired sound information. For example, if the sound information obtained is pronounced as a fan, the second text information obtained by the recognition and conversion thereof includes: rice, turn, complex, annoyance, return, sail, reverse, van, and the like.
  • the second text information is arranged in the second text information list from high to low according to the height order of the pronunciation similarity of the acquired sound information, and the second text information listed above is taken as an example, wherein the pronunciation is The similarity of "turning” and “sail” is more than “annoying” and “complex” pronounced as two sounds, and the three words “return”, “reverse” and four sounds of "rice” and “fan” The similarity is high, so the order of arrangement in the second text information list is: flip, sail, annoyance, complex, return, reverse, rice, and van.
  • the second text information (that is, the N positions of the top positions) is composed of the second text information, and the N is a positive integer.
  • the target text information set is generally composed of N second text information in the second text information list with the top position, that is, the TOP N second texts in the second text information list.
  • Information composition the N being a positive integer.
  • N It may be 1 or greater than 1 to avoid deviations between the sound information and the page-turning tone due to changes in the user's pronunciation or multiple results recognized by the same pronunciation.
  • the TOP 3 second text information in the second text information list may be selected as the target text information set, and the target text information set includes: , sail, annoying. Further, when determining whether the acquired sound information matches the page turning tone set by the user, it is possible to find in the target text information set including the flip, sail, and annoyance whether the first page corresponding to the page turning tone is included Text information.
  • the sound information may be determined by using the target text information set including the flip, sail, and annoyance.
  • the page prompt tone is matched; when the first text information corresponding to the page turning tone is not included in the target text information set, it may be determined that the sound information does not match the page turning tone.
  • the sound information collected by the microphone usually includes both user sound information and environmental sound information, and the user's voice information is used to match the page turning sound, and the environmental sound information is useless sound information. Therefore, when a plurality of kinds of sound information are collected through the microphone, if each of the collected sound information is identified according to the above embodiment, the recognition process takes a long time, resulting in a delay in the page turning operation.
  • the embodiment of the present invention provides an implementation manner, that is, before determining whether the acquired sound information matches the page turning prompt tone set by the user, it is required to filter from the obtained sound information. Sound information corresponding to the ambient sound.
  • the embodiment of the present invention provides two implementation manners for filtering sound information corresponding to an environmental sound from the acquired sound information. These two implementations include:
  • the sound information whose volume is less than the preset volume threshold is removed from the acquired sound information.
  • the volume recognized by the microphone is large. In a small sense, the volume of the ambient sound information at a distance must be much smaller than the volume of the user's voice information near the microphone. Therefore, according to the volume of the sound information, the sound information whose volume is less than the preset volume threshold can be removed from the acquired sound information.
  • the preset volume threshold may be an average volume value of user sound information recognized by the microphone. The obtained sound information whose volume is less than the preset volume threshold can be regarded as useless ambient sound information, and the subsequent need to match the sound information with the page turning tone, thereby greatly reducing the page turning prompt from the obtained sound information.
  • the recognition time of the sound improves the recognition efficiency.
  • the matching success rate corresponding to the acquired sound information may be searched in a preset sound information database.
  • the sound information library records the matching success rate of the previously identified various sound information and the user voice information. If the matching success rate of the obtained sound information is less than the preset success rate threshold, it may be considered that the obtained success rate is obtained.
  • the sound information is the sound information corresponding to the environmental sound, and the obtained sound information can be eliminated.
  • the obtained sound information may be added to the sound information library, and the matching success rate thereof is started to be recorded.
  • the operation of the statistical matching success rate may be selected to be performed by the background when the electronic book (reading application) is turned on or off. In this way, the useless ambient sound information can be eliminated before determining whether the acquired sound information matches the page turning tone set by the user, thereby greatly reducing the page turning from the acquired sound information.
  • the recognition time of the prompt tone improves the recognition efficiency.
  • a prompt such as "the currently input sound information is invalid, please re-enter the sound information" may be outputted so that the user re-enters the sound information according to the prompt.
  • the embodiment of the present invention further provides an implementation manner, that is, when the page turning prompt sound sent by the user includes the number M, the page turning operation corresponding to the page turning prompt sound is continuously turned over the M page, and the M is Is a positive integer.
  • the page turning tone in each of the above embodiments includes: a page turning tone (turning the previous page) and a page turning sound (turning the next page).
  • the voice information of the user mentioned in the embodiment of the present invention may refer to the voice information of the user, or may include the voice information sent by the user by clapping, palm, etc., and the voice information sent by the user may be used to set the voice information.
  • the page turning tone in the embodiment of the invention may refer to the voice information of the user, or may include the voice information sent by the user by clapping, palm, etc., and the voice information sent by the user may be used to set the voice information.
  • the page turning tone in the embodiment of the invention may refer to the voice information of the user, or may include the voice information sent by the user by clapping, palm, etc.
  • an embodiment of the present invention provides a page turning device for an electronic book.
  • the device includes: an acquiring unit 21 , a saving unit 22 , and a confirming unit 23 . ;among them,
  • the collecting unit 21 is configured to collect the set page turning tone through the microphone
  • a saving unit 22 configured to save the collected page turning tone
  • the collecting unit 21 is further configured to monitor the microphone to obtain sound information collected by the microphone;
  • the confirming unit 23 is configured to confirm that the sound information matches the page turning prompt sound, and trigger a page turning operation corresponding to the page turning prompt sound.
  • the saving unit 22 includes:
  • the first conversion module 221 is configured to convert the page turning tone into a first text information list, where the first text information list is composed of different first text information having a similarity relationship between the pronunciation and the page turning tone;
  • the first determining module 222 is configured to determine, according to the information selection operation, a first text information from the first text information list as the first text information corresponding to the page turning prompt tone;
  • the saving module 223 is configured to save the determined first text information.
  • the confirmation unit 23 includes:
  • the second conversion module 231 is configured to convert the sound information into a second text information list, where the second text information list is composed of different second text information having a similarity relationship between the pronunciation and the sound information, and the second text information is high according to the similarity Arranged in a low order;
  • the determining module 232 is configured to determine whether the saved first text information is included in the target text information set of the second text information list, and the target text information set is composed of the TOP N second text information of the second text information list, where N is positive Integer
  • the second determining module 233 is configured to determine that the sound information matches the page turning prompt sound when the determination result is that the saved first text information is included in the target file information set.
  • the apparatus further includes: a filtering unit 24, configured to filter the sound information corresponding to the environmental sound from the acquired sound information before confirming that the sound information matches the page turning prompt sound .
  • the filtering unit 24 includes:
  • the first filtering module 241 is configured to remove, from the obtained sound information, sound information whose volume is less than a preset volume threshold.
  • the filtering unit 24 includes a second filtering module 242 for:
  • the matching success rate is less than the preset success rate threshold, the acquired sound information is rejected.
  • the collecting unit 21 is configured to set the language type of the paging prompt sound based on the language selection operation before collecting the set paging prompt sound through the microphone.
  • the page turning prompt sound collected by the collecting unit 21 includes the number M
  • the page turning operation corresponding to the page turning prompt sound is continuously turned over the M page
  • M is a positive integer.
  • the page turning prompt sound collected by the collecting unit 21 includes: a page turning forward sound and a backward turning sound.
  • the page turning device of the e-book provided by the embodiment of the invention can collect the page turning prompt sound set by the user through the microphone, and save the collected page turning prompt sound, and then monitor the microphone after starting the electronic reading application. Obtaining the sound information collected by the microphone, and triggering the page turning operation corresponding to the page turning prompt sound only when it is confirmed that the sound information matches the page turning prompt sound.
  • the user when the user performs the page turning operation through the page turning prompt sound of the software, the user must imitate the page turning prompt sound provided by the software, thereby causing a lot of inconvenience to the user's pronunciation.
  • the present invention enables the user to customize the setting page turning tone, so that the page turning can be easily performed by the pronunciation when the page turning is performed. The user does not have to deliberately imitate the page flipping sound that comes with the software to turn the page.
  • an embodiment of the present invention further provides a hardware structure of a page turning device of an electronic book
  • FIG. 4 is a block schematic diagram of a hardware structure.
  • the electronic device includes a memory 401 for storing instructions for controlling the processor 402 to operate to perform a page turning method of the electronic book according to the present invention, and a processor 402.
  • the memory 401 can include high speed random access memory and can also include non-volatile memory such as one or more magnetic storage devices, flash memory, or other non-volatile solid state memory.
  • the processor 402 can include, but is not limited to, a processing device such as a microprocessor MCU, a digital signal processor DSP, or a programmable logic device FPGA.
  • a processing device such as a microprocessor MCU, a digital signal processor DSP, or a programmable logic device FPGA.
  • the electronic device may further include an input device 404, a communication device 406, an interface device 403, a display device 405, and the like.
  • the communication device 406 can, for example, have wired or wireless communication.
  • the interface device 403 includes, for example, a USB interface, a network port, and the like.
  • the input device 404 may include, for example, a touch screen, a button, or the like to input various information.
  • the display device 405 is, for example, a liquid crystal display, a touch display, or the like to display the contents of the electronic book.
  • the electronic device 400 of the present invention may only be related to some of the devices, such as the processor 401, the memory 402, the display device 405, and the like.
  • Embodiments of the present invention also provide a computer readable storage medium.
  • the foregoing storage medium may be used to save program code executed by the page turning method provided by the present invention.
  • the computer readable storage medium can be a tangible device that can hold and store the instructions used by the instruction execution device.
  • the computer readable storage medium can be, for example, but not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
  • Non-exhaustive list of computer readable storage media include: portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM) Or flash memory), static random access memory (SRAM), portable compact disk read only memory (CD-ROM), digital versatile disk (DVD), memory stick, floppy disk, mechanical encoding device, for example, with instructions stored thereon A raised structure in the hole card or groove, and any suitable combination of the above.
  • a computer readable storage medium as used herein is not to be interpreted as a transient signal itself, such as a radio wave or other freely propagating electromagnetic wave, an electromagnetic wave propagating through a waveguide or other transmission medium (eg, a light pulse through a fiber optic cable), or through a wire The electrical signal transmitted.
  • the computer readable program instructions described herein can be downloaded from a computer readable storage medium Download to an external computer or external storage device to various computing/processing devices, or via a network, such as the Internet, a local area network, a wide area network, and/or a wireless network.
  • the network may include copper transmission cables, fiber optic transmissions, wireless transmissions, routers, firewalls, switches, gateway computers, and/or edge servers.
  • a network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium in each computing/processing device .
  • Computer program instructions for performing the page turning method of the present invention may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine related instructions, microcode, firmware instructions, state setting data, or programmed in one or more Source code or object code written in any combination of languages, including object oriented programming languages such as Smalltalk, C++, etc., as well as conventional procedural programming languages such as the "C" language or similar programming languages.
  • the computer readable program instructions can execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer, partly on the remote computer, or entirely on the remote computer or server. carried out.
  • the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or wide area network (WAN), or can be connected to an external computer (eg, using an Internet service provider to access the Internet) connection).
  • the customized electronic circuit such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA), can be customized by utilizing state information of computer readable program instructions.
  • Computer readable program instructions are executed to implement various aspects of the present invention.
  • the computer readable program instructions can be provided to a general purpose computer, a special purpose computer, or a processor of other programmable data processing apparatus to produce a machine such that when executed by a processor of a computer or other programmable data processing apparatus Means for implementing the functions/acts specified in one or more of the blocks of the flowcharts and/or block diagrams.
  • the computer readable program instructions can also be stored in a computer readable storage medium that causes the computer, programmable data processing device, and/or other device to operate in a particular manner, such that the computer readable medium storing the instructions includes An article of manufacture that includes instructions for implementing various aspects of the functions/acts recited in one or more of the flowcharts.
  • Computer readable program instructions can also be loaded into a computer, other programmable data processing equipment Or a device, such that a series of operational steps are performed on a computer, other programmable data processing device, or other device to produce a computer-implemented process for use on a computer, other programmable data processing device, or other device
  • the executed instructions implement the functions/acts specified in one or more of the flowcharts and/or block diagrams.
  • the descriptions of the various embodiments are different, and the details that are not detailed in a certain embodiment can be referred to the related descriptions of other embodiments.
  • modules in the devices of the embodiments can be adaptively changed and placed in one or more devices different from the embodiment.
  • the modules or units or components of the embodiments may be combined into one module or unit or component, and further they may be divided into a plurality of sub-modules or sub-units or sub-components.
  • any combination of the features disclosed in the specification, including the accompanying claims, the abstract and the drawings, and any methods so disclosed, or All processes or units of the device are combined.
  • Each feature disclosed in this specification (including the accompanying claims, the abstract and the drawings) may be replaced by alternative features that provide the same, equivalent or similar purpose.
  • the various component embodiments of the present invention may be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof.
  • a microprocessor or digital signal processor may be used in practice to implement some or all of the components of the inventive name (e.g., means for determining the level of link within a website) in accordance with embodiments of the present invention.
  • the invention can also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein.
  • Such a program implementing the invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • Computer Hardware Design (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

本发明公开了一种电子书的翻页方法及装置,涉及电子阅读领域,解决了现有的电子书无法为用户提供灵活便捷的语音翻页服务的问题。本发明的方法包括:通过麦克风采集设置的翻页提示音;保存采集的所述翻页提示音;对麦克风进行监听,获取所述麦克风采集的声音信息;确认所述声音信息与所述翻页提示音匹配,则触发与所述翻页提示音对应的翻页操作。本发明主要用于阅读新闻资讯、小说等各类阅读软件中。

Description

一种电子书的翻页方法及装置 技术领域
本发明涉及电子阅读领域,特别是涉及一种电子书的翻页方法及装置。
背景技术
随着移动终端的发展,人们的阅读方式也逐渐发生改变,从原来的纸质阅读变成了电子阅读。电子阅读的方式虽然能够便于人们通过随身携带的移动终端进行阅读,但是在电子阅读的过程中也需要人们进行翻页操作。由于当今的阅读类软件越来越多,翻页模式也越来越丰富。
在现有的众多翻页模式中,主要可以分为两类,一类是用户手动翻页,另一类是软件自动翻页,软件自动翻页作为较为便捷的一种翻页模式应用较广。在软件自动翻页模式中,用户根据软件自带的翻页提示音,通过发音触发翻页操作的翻页模式比通过软件设定翻页速度的翻页方式更先进,但是用户在通过软件自带的翻页提示音进行翻页操作时,必须模仿软件自带的翻页提示音,从而给用户的发音造成诸多不便。因此,如何在阅读过程中使用户能够更加灵活便捷的进行语音翻页成为现有电子阅读中亟待解决的问题。
发明内容
有鉴于此,本发明提出了一种电子书的翻页方法及装置,主要目的在于解决现有的电子书无法为用户提供灵活便捷的语音翻页服务的问题。
依据本发明的第一个方面,本发明提供一种电子书的翻页方法,包括:
通过麦克风采集设置的翻页提示音;
保存采集的所述翻页提示音;
对麦克风进行监听,获取所述麦克风采集的声音信息;
确认所述声音信息与所述翻页提示音匹配,则触发与所述翻页提示音对应的翻页操作。
依据本发明的第二个方面,本发明提供一种电子书的翻页装置,包括:
采集单元,用于通过麦克风采集设置的翻页提示音;
保存单元,用于保存采集的所述翻页提示音;
所述采集单元还用于对麦克风进行监听,获取所述麦克风采集的声音信息;
确认单元,用于确认所述声音信息与所述翻页提示音匹配,则触发与所述翻页提示音对应的翻页操作。
依据本发明的第三个方面,本发明提供了一种电子书的翻页装置,包括存储器和处理器,所述存储器用于存储指令,所述指令用于控制所述处理器进行操作以执行根据本发明第一个方面所述的方法。
依据本发明的第四个方面,本发明提供了一种计算机可读存储介质,其存储有用于执行根据本发明第一个方面所述方法的程序代码。
借由上述技术方案,本发明实施例提供的一种电子书的翻页方法及装置,能够通过麦克风采集用户设置的翻页提示音,并保存采集的所述翻页提示音,随后在启动电子阅读应用后,对麦克风进行监听,获取所述麦克风采集的声音信息,只有在确认所述声音信息与所述翻页提示音匹配时,才触发与所述翻页提示音对应的翻页操作。而在现有技术中,用户在通过软件自带的翻页提示音进行翻页操作时,必须模仿软件自带的翻页提示音,从而给用户的发音造成诸多不便。因此,与现有的翻页模式给用户的阅读过程带来不便的缺陷相比,本发明能够使用户自定义设置翻页提示音,从而在进行翻页时可以轻松的通过发音进行翻页,使用户不必刻意模仿软件自带的翻页提示音进行翻页。
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。
附图说明
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:
图1示出了本发明实施例提供的一种电子书的翻页方法的流程图;
图2示出了本发明实施例提供的一种电子书的翻页装置的组成框图;
图3示出了本发明实施例提供的另一种电子书的翻页装置的组成框图;
图4示出了本发明实施例提供的电子设备的组成框图。
具体实施方式
下面将参照附图更加详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。
随着移动终端的发展,越来越多的用户开始使用电子书进行阅读。在电子阅读过程中同样需要翻页,电子书现有的翻页模式分为用户手动翻页和软件自动翻页。在软件自动翻页模式中,应用较为普遍的是用户通过软件自带的翻页提示音进行翻页操作,但是在这种翻页模式中,用户必须模仿软件自带的翻页提示音,从而给用户的发音造成诸多不便。因此,电子书现有的翻页模式无法给用户提供灵活便捷的语音翻页服务。
为了解决上述问题,本发明实施例提供了一种电子书的翻页方法,能够根据用户自定义设置的翻页提示音进行翻页,给用户提供更加灵活便捷的语音翻页服务。如图1所示,该方法包括:
101、通过麦克风采集设置的翻页提示音。
由于现有的语音翻页模式中,软件自带的翻页提示音使得用户在触发翻页操作时必须模仿软件的翻页提示音,从而在用户发音时给用户造成诸 多不便,如用户发音不准确造成无法匹配翻页提示音,或用户无法准确模仿出软件中的翻页提示音。因此,本发明提供的电子书的翻页方法,能够使用户自己设置翻页提示音。在用户自定义设置翻页提示音时,需要执行步骤101通过麦克风采集设置的翻页提示音。
102、保存采集的所述翻页提示音。
当在步骤101中采集到用户设置的翻页提示音后,就需要对用户设置的翻页提示音进行识别,并将识别后的翻页提示音按照特定的格式进行保存。
103、对麦克风进行监听,获取所述麦克风采集的声音信息。
由于本发明是通过设置个性化的翻页提示音,当客户端通过语音识别确定用户发音符合所述翻页提示音时进行翻页操作。因此在本发明的后续实施过程中,需要执行步骤103对麦克风进行监听,获取所述麦克风采集的声音信息。
104、确认所述声音信息与所述翻页提示音匹配,则触发与所述翻页提示音对应的翻页操作。
当在步骤103中获取到麦克风采集的声音信息后,需要确认所述声音信息与用户自定义设置的翻页提示音是否匹配。其中,所述声音信息既包含用户发出的声音信息,也包含环境声音信息;在确认所述声音信息与用户自定义设置的翻页提示音是否匹配的过程中,需要将所述声音信息进行识别,只有当其识别结果与用户自定义设置的翻页提示音对应的识别结果相同时,才能确认所述声音信息与翻页提示音相匹配。当确认所述声音信息与用户自定义设置的翻页提示音相匹配时,说明用户进行了翻页提示,此时需要触发与所述翻页提示音对应的翻页操作。
本发明实施例提供的一种电子书的翻页方法,能够通过麦克风采集用户设置的翻页提示音,并保存采集的所述翻页提示音,随后在启动电子阅读应用后,对麦克风进行监听,获取所述麦克风采集的声音信息,只有在确认所述声音信息与所述翻页提示音匹配时,才触发与所述翻页提示音对应的翻页操作。而在现有技术中,用户在通过软件自带的翻页提示音进行翻页操作时,必须模仿软件自带的翻页提示音,从而给用户的发音造成诸 多不便。因此,与现有的翻页模式给用户的阅读过程带来不便的缺陷相比,本发明能够使用户自定义设置翻页提示音,从而在进行翻页时可以轻松的通过发音进行翻页,使用户不必刻意模仿软件自带的翻页提示音进行翻页。
为了更好的对上述图1所示的方法进行理解,作为对上述实施方式的细化和扩展,本发明实施例将针对图1中的步骤进行详细说明。
由于本发明实施例是为了使用户在进行语音翻页时,不必刻意模仿软件自带的翻页提示音进行翻页,所以本发明实施例可以由用户自定义设置翻页提示音。具体的,用户设定翻页提示音的时机是在对麦克风进行监听之前,其设定方法为:首先,要通过麦克风采集翻页提示音,所述翻页提示音是由用户自定义设定的,通常为用户发出的语音。然后,获取到用户设定的翻页提示音之后就需要将其进行保存,具体的保存方法为:将所述翻页提示音转换为第一文本信息列表。所述第一文本信息列表由发音与所述翻页提示音具有相似度关系的不同第一文本信息组成。这里所述的将翻页提示音转换为第一文本信息列表,(以安卓系统为例)是通过启动语音输入active后,android.speech.RecognizerIntent识别翻页提示音并转换为文本信息,并通过onActivityResult()方法接收文本信息,形成第一文本信息列表。所述第一文本信息列表中包含有一系列不同的第一文本信息,这些第一文本信息的共同点在于,其发音与翻页提示音的发音具有相似度关系。例如,若翻页提示音的发音为fan(一声),则将其识别后转换得到的第一文本信息就包含:饭、翻、繁、烦、返、帆、反、范等。当通过上述方式得到第一文本信息列表之后,就可以基于信息选择操作,从所述第一文本信息列表中确定一个第一文本信息,作为对应所述翻页提示音的第一文本信息。采用这样的处理方式,是由于用户发音会存在一定误差,所以识别用户设定的翻页提示音得到的是一系列可能的文本信息,只有将这些可能的文本信息以列表的形式进行展示,供用户点击选择自己真正想说的文本信息,才能准确设定翻页提示音。当在第一文本信息列表中确定了对应所述翻页提示音的第一文本信息之后,就可以将确定的所述第一文本信息进行保存,以便后续用于将采集的声音信息与设定的翻页提示音进行匹配的过程中。
这里需要说明的是,在通过麦克风采集所述翻页提示音之前,可以基于语言选择操作设定所述翻页提示音的语种类型。仍以安卓系统为例,可以通过.putExtra()方法输入RecognizerIntent.EXTRA_LANGUAGE_MODE语言类型,所述语言类型是客户端语言识别系统能够识别出的语言,其可以是英文,也可以是中文。所述语言类型的设定可以是出厂时就设定好的,当然也可以由用户自行进行设定。例如,当设定的语言类型为英文时,对于用户以中文设定翻页提示音而言,则android.speech.RecognizerIntent会由于无法识别中文的声音信息进而无法完成翻页提示音的设定。
由于本发明实施例需要通过识别用户的发音进行相应的翻页操作,因此本发明实施例在启动电子阅读应用后,需要通过监听麦克风来获取麦克风采集的声音信息,从而判断所述声音信息与翻页提示音是否匹配。其中,客户端获取麦克风采集的声音信息,也就是客户端接收外界的声音信息。由于客户端通常都具有语音输入及识别功能,因此在本发明实施例中以安卓系统客户端为例进行说明。其中,对采集的声音信息进行识别的过程与上述对用户设置的翻页提示音的识别过程类似。例如,安卓系统中的android.speech是安卓系统语音输入的核心包,其中android.speech.RecognizerIntent是一个主要的类,这个active会接收语音输入,识别语音内容转为文本。因此,在使用安卓系统的客户端中,当电子阅读应用被启动后,客户端可以通过android.speech.RecognizerIntent获取麦克风采集的声音信息,所述声音信息既可能包含用户发出的声音信息,也可能包含环境产生的声音信息。这里需要说明的是,在获取麦克风采集的声音信息并对其进行识别之前,还可以对获取的声音信息的语言类型进行设定,其中设定的声音信息的语言类型,既可以是客户端出厂时初始化设定的语言类型,也可以是由用户在使用客户端的过程中自定义设定的语言类型。具体的,以使用安卓系统的客户端为例,在.putExtra()方法还需要输入RecognizerIntent.EXTRA_LANGUAGE_MODE语言类型,所述语言类型可以是英文,也可以是中文。例如,当设定的语言类型为中文时,对于获取的声音信息而言,android.speech.RecognizerIntent只会识别中文的声音信息并将其转换为文本。
当通过上述方式获取到麦克风采集的声音信息之后,就需要确认所述声音信息与用户设置的翻页提示音是否匹配,只有在确认所述声音信息与所述翻页提示音匹配时,才触发与所述翻页提示音对应的翻页操作。其具体的确认过程可以如下:
(1)将所述声音信息转换为第二文本信息列表,所述第二文本信息列表由发音与所述声音信息具有相似度关系的不同第二文本信息组成,所述第二文本信息按照相似度由高到低的顺序排列。
这里所述的将声音信息转换为第二文本信息列表,以安卓系统为例,可以是通过启动语音输入active后,android.speech.RecognizerIntent识别声音信息并转换为文本信息,并通过startActivityForResult()方法接收文本信息,形成第二文本信息列表。所述第二文本信息列表中包含有一系列不同的第二文本信息,这些第二文本信息的共同点在于,其发音与获取的声音信息的发音具有相似度关系。例如,若获取的声音信息的发音为fan(一声),则将其识别后转换得到的第二文本信息就包含:饭、翻、繁、烦、返、帆、反、范等。并且,这些第二文本信息是按照与获取的声音信息的发音相似度的高度顺序,由高到低在第二文本信息列表中排列的,以上述列举的第二文本信息为例,其中发音为一声的“翻”、“帆”的相似度比发音为二声的“烦”、“繁”,发音为三声的“返”、“反”以及发音为四声的“饭”、“范”的相似度都要高,因此在第二文本信息列表中的排列顺序为:翻、帆、烦、繁、返、反、饭、范。
(2)判断所述第二文本信息列表的目标文本信息集合中是否包含与所述翻页提示音对应的第一文本信息,所述目标文本信息集合由所述第二文本信息列表的TOP N个(即排序位置靠前的N个)第二文本信息组成,所述N为正整数。
当将所述声音信息转换为第二文本信息列表之后,就需要判断所述第二文本信息列表的目标文本信息集合中是否包含与所述翻页提示音对应的第一文本信息。这里需要说明的是,所述目标文本信息集合通常由所述第二文本信息列表中排序位置靠前的N个第二文本信息组成,也就是第二文本信息列表中的TOP N个第二文本信息组成,所述N为正整数。这里的N 可以为1,也可以大于1,以避免因用户发音发生改变或者同一发音识别出的多种结果而导致声音信息与所述翻页提示音的匹配出现偏差。例如,以上述获取的声音信息的发音为fan(一声)为例,可以选取其第二文本信息列表中的TOP 3个第二文本信息作为目标文本信息集合,所述目标文本信息集合包含:翻、帆、烦。进而在判断获取的声音信息是否与用户设置的翻页提示音匹配时,就可以在包含有翻、帆、烦的目标文本信息集合中查找其是否包含与所述翻页提示音对应的第一文本信息。
(3)若判断结果为所述目标文件信息集合中包含与所述翻页提示音对应的第一文本信息,则确定所述声音信息与所述翻页提示音匹配。
以上述包含有翻、帆、烦的目标文本信息集合为例,当所述目标文本信息集合中包含有所述翻页提示音对应的第一文本信息时,可以确定所述声音信息与所述翻页提示音匹配;当所述目标文本信息集合中未包含有所述翻页提示音对应的第一文本信息时,可以确定所述声音信息与所述翻页提示音不匹配。
上述实施方式虽然能够通过识别麦克风采集的声音信息与用户设置的翻页提示音是否匹配,来决定是否触发与所述翻页提示音对应的翻页操作。但是,麦克风采集的声音信息通常既包含用户声音信息,也包含环境声音信息,而其中需要用来匹配翻页提示音的是用户声音信息,环境声音信息就是无用声音信息。因此,当通过麦克风采集到多种声音信息时,若根据上述实施方式识别采集到的每一个声音信息,就会造成识别过程用时较长,从而导致翻页操作延迟。为了避免识别过多的无用声音信息,本发明实施例提供了一种实施方式,也就是在判断获取的声音信息与用户设置的翻页提示音是否匹配之前,需要从获取到的声音信息中过滤环境音对应的声音信息。
具体的,本发明实施例提供了两种实施方式用于从获取到的声音信息中过滤环境音对应的声音信息。这两种实施方式包括:
(1)从获取到的声音信息中剔除音量小于预设音量阈值的声音信息。
由于用户在阅读电子书时,用户发音(声源)的位置通常比环境音(声源)的位置更加靠近客户端的麦克风位置,因此对于麦克风识别到的音量大 小而言,处于远处的环境声音信息的音量大小一定远小于靠近麦克风的用户声音信息的音量大小。因此可以根据声音信息的音量大小,从获取到的声音信息中剔除音量小于预设音量阈值的声音信息。所述预设音量阈值可以是麦克风识别的用户声音信息的平均音量值。获取到的音量小于预设音量阈值的声音信息可以认为是无用的环境声音信息,后续无需将这些声音信息与翻页提示音进行匹配,从而能够大大降低从获取的声音信息中识别出翻页提示音的识别时长,提高识别效率。
(2)在预设的声音信息库中查找所述获取到的声音信息对应的匹配成功率;若所述匹配成功率小于预设成功率阈值,则剔除所述获取到的声音信息,即将获取到的声音信息作为环境音对应的声音信息剔除掉。当获取到麦克风采集的声音信息之后,可以在预设的声音信息库中查找所述获取到的声音信息对应的匹配成功率。其中,所述声音信息库中记录有先前识别过的各种声音信息与用户声音信息的匹配成功率,若所述获取到的声音信息的匹配成功率小于预设成功率阈值,可以认为获取到的声音信息为环境音对应的声音信息,则可以剔除所述获取到的声音信息。此外,如果声音信息库中没有记录所述获取到的声音信息,则可以将所述获取到的声音信息添加到声音信息库中,并开始记录其匹配成功率。同时,为了降低由于声音信息库中统计匹配成功率而对后续声音信息的处理效率造成影响,所述统计匹配成功率的操作可以选择在电子书(阅读应用)开启或关闭时由后台进行。通过这种方式,可以在判断所述获取到的声音信息与用户设置的翻页提示音是否匹配之前,就将无用的环境声音信息剔除,从而能够大大降低从获取的声音信息中识别出翻页提示音的识别时长,提高识别效率。
进一步地,可以输出例如是“当前输入的声音信息无效,请重新输入声音信息”的提示,以使用户根据该提示重新输入声音信息。
在用户实际阅读过程中,有时会需要一次翻过多个页面,若每翻一页都需要识别翻页提示音,那么翻页过程将会浪费很多时间。因此,本发明实施例还提供了一种实施方式,即当用户发出的翻页提示音中包含数字M时,所述翻页提示音对应的翻页操作为连续翻过M页,所述M为正整数。 并且,以上各个实施方式中的翻页提示音包括:向前翻页提示音(翻上一页)及向后翻页提示音(翻下一页)。
此外,本发明实施例中提到的用户的声音信息,既可以指用户的语音信息,也可以包含用户通过拍手、击掌等发出的声音信息,用户发出的这些声音信息都可以用于设定本发明实施例中的翻页提示音。
进一步的,作为对上述图1所示方法的实现,本发明实施例提供了一种电子书的翻页装置,如图2所示,该装置包括:采集单元21、保存单元22以及确认单元23;其中,
采集单元21,用于通过麦克风采集设置的翻页提示音;
保存单元22,用于保存采集的翻页提示音;
采集单元21还用于对麦克风进行监听,获取麦克风采集的声音信息;
确认单元23,用于确认所述声音信息与所述翻页提示音匹配,则触发与所述翻页提示音对应的翻页操作。
进一步的,如图3所示,保存单元22,包括:
第一转换模块221,用于将翻页提示音转换为第一文本信息列表,第一文本信息列表由发音与翻页提示音具有相似度关系的不同第一文本信息组成;
第一确定模块222,用于基于信息选择操作从第一文本信息列表中确定一个第一文本信息,作为对应翻页提示音的第一文本信息;
保存模块223,用于保存确定的第一文本信息。
进一步的,如图3所示,确认单元23,包括:
第二转换模块231,用于将声音信息转换为第二文本信息列表,第二文本信息列表由发音与声音信息具有相似度关系的不同第二文本信息组成,第二文本信息按照相似度由高到低的顺序排列;
判断模块232,用于判断第二文本信息列表的目标文本信息集合中是否包含保存的第一文本信息,目标文本信息集合由第二文本信息列表的TOP N个第二文本信息组成,N为正整数;
第二确定模块233,用于当判断结果为目标文件信息集合中包含保存的第一文本信息时,确定声音信息与翻页提示音匹配。
进一步的,如图3所示,该装置进一步包括,过滤单元24,用于在确认所述声音信息与所述翻页提示音匹配之前,从获取到的声音信息中过滤环境音对应的声音信息。
进一步的,如图3所示,过滤单元24,包括:
第一过滤模块241,用于从获取到的声音信息中剔除音量小于预设音量阈值的声音信息。
进一步的,如图3所示,过滤单元24,包括第二过滤模块242,用于:
在预设的声音信息库中查找获取到的声音信息对应的匹配成功率;
当匹配成功率小于预设成功率阈值时,剔除获取到的声音信息。
进一步的,采集单元21,用于在通过麦克风采集设置的翻页提示音之前,基于语言选择操作设定翻页提示音的语种类型。
进一步的,当采集单元21采集的翻页提示音中包含数字M时,翻页提示音对应的翻页操作为连续翻过M页,M为正整数。
进一步的,采集单元21采集的翻页提示音包括:向前翻页提示音及向后翻页提示音。
本发明实施例提供的一种电子书的翻页装置,能够通过麦克风采集用户设置的翻页提示音,并保存采集的所述翻页提示音,随后在启动电子阅读应用后,对麦克风进行监听,获取所述麦克风采集的声音信息,只有在确认所述声音信息与所述翻页提示音匹配时,才触发与所述翻页提示音对应的翻页操作。而在现有技术中,用户在通过软件自带的翻页提示音进行翻页操作时,必须模仿软件自带的翻页提示音,从而给用户的发音造成诸多不便。因此,与现有的翻页模式给用户的阅读过程带来不便的缺陷相比,本发明能够使用户自定义设置翻页提示音,从而在进行翻页时可以轻松的通过发音进行翻页,使用户不必刻意模仿软件自带的翻页提示音进行翻页。
另外,本发明实施例还提供了电子书的翻页装置的一种硬件结构,图4是一种硬件结构的方框原理图。
根据图4所示,该电子设备包括存储器401和处理器402,存储器401用于存储指令,指令用于控制处理器402进行操作以执行根据本发明的电子书的翻页方法。
该存储器401可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。
该处理器402可以包括但不限于微处理器MCU、数字信号处理器DSP、或者可编程逻辑器件FPGA等处理装置。
该电子设备还可以进一步包括输入装置404、通信装置406、接口装置403和显示装置405等。
该通信装置406例如能够进行有有线或无线通信。
该接口装置403例如包括USB接口、网口等。
该输入装置404例如可以包括触摸屏、按键等,以输入各种信息。
该显示装置405例如是液晶显示屏、触摸显示屏等,以显示电子书内容。
尽管在图4中示出了多个装置,但是,本发明电子设备400可以仅涉及其中的部分装置,例如,处理器401、存储器402、显示装置405等。
本发明的实施例还提供了一种计算机可读存储介质。可选地,在本实施例中,上述存储介质可以用于保存本发明所提供的翻页方法所执行的程序代码。
计算机可读存储介质可以是可以保持和存储由指令执行设备使用的指令的有形设备。计算机可读存储介质例如可以是――但不限于――电存储设备、磁存储设备、光存储设备、电磁存储设备、半导体存储设备或者上述的任意合适的组合。计算机可读存储介质的更具体的例子(非穷举的列表)包括:便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、静态随机存取存储器(SRAM)、便携式压缩盘只读存储器(CD-ROM)、数字多功能盘(DVD)、记忆棒、软盘、机械编码设备、例如其上存储有指令的打孔卡或凹槽内凸起结构、以及上述的任意合适的组合。这里所使用的计算机可读存储介质不被解释为瞬时信号本身,诸如无线电波或者其他自由传播的电磁波、通过波导或其他传输媒介传播的电磁波(例如,通过光纤电缆的光脉冲)、或者通过电线传输的电信号。
这里所描述的计算机可读程序指令可以从计算机可读存储介质下载 到各个计算/处理设备,或者通过网络、例如因特网、局域网、广域网和/或无线网下载到外部计算机或外部存储设备。网络可以包括铜传输电缆、光纤传输、无线传输、路由器、防火墙、交换机、网关计算机和/或边缘服务器。每个计算/处理设备中的网络适配卡或者网络接口从网络接收计算机可读程序指令,并转发该计算机可读程序指令,以供存储在各个计算/处理设备中的计算机可读存储介质中。
用于执行本发明翻页方法的计算机程序指令可以是汇编指令、指令集架构(ISA)指令、机器指令、机器相关指令、微代码、固件指令、状态设置数据、或者以一种或多种编程语言的任意组合编写的源代码或目标代码,所述编程语言包括面向对象的编程语言—诸如Smalltalk、C++等,以及常规的过程式编程语言—诸如“C”语言或类似的编程语言。计算机可读程序指令可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络—包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。在一些实施例中,通过利用计算机可读程序指令的状态信息来个性化定制电子电路,例如可编程逻辑电路、现场可编程门阵列(FPGA)或可编程逻辑阵列(PLA),该电子电路可以执行计算机可读程序指令,从而实现本发明的各个方面。
这些计算机可读程序指令可以提供给通用计算机、专用计算机或其它可编程数据处理装置的处理器,从而生产出一种机器,使得这些指令在通过计算机或其它可编程数据处理装置的处理器执行时,产生了实现流程图和/或框图中的一个或多个方框中规定的功能/动作的装置。也可以把这些计算机可读程序指令存储在计算机可读存储介质中,这些指令使得计算机、可编程数据处理装置和/或其他设备以特定方式工作,从而,存储有指令的计算机可读介质则包括一个制造品,其包括实现流程图和/或框图中的一个或多个方框中规定的功能/动作的各个方面的指令。
也可以把计算机可读程序指令加载到计算机、其它可编程数据处理装 置、或其它设备上,使得在计算机、其它可编程数据处理装置或其它设备上执行一系列操作步骤,以产生计算机实现的过程,从而使得在计算机、其它可编程数据处理装置、或其它设备上执行的指令实现流程图和/或框图中的一个或多个方框中规定的功能/动作。在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。
可以理解的是,上述方法及装置中的相关特征可以相互参考。另外,上述实施例中的“第一”、“第二”等是用于区分各实施例,而并不代表各实施例的优劣。
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。
在此提供的算法和显示不与任何特定计算机、虚拟系统或者其它设备固有相关。各种通用系统也可以与基于在此的示教一起使用。根据上面的描述,构造这类系统所要求的结构是显而易见的。此外,本发明也不针对任何特定编程语言。应当明白,可以利用各种编程语言实现在此描述的本发明的内容,并且上面对特定语言所做的描述是为了披露本发明的最佳实施方式。
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本发明的实施例可以在没有这些具体细节的情况下实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。
类似地,应当理解,为了精简本公开并帮助理解各个发明方面中的一个或多个,在上面对本发明的示例性实施例的描述中,本发明的各个特征有时被一起分组到单个实施例、图、或者对其的描述中。然而,并不应将该公开的方法解释成反映如下意图:即所要求保护的本发明要求比在每个权利要求中所明确记载的特征更多的特征。更确切地说,如下面的权利要求书所反映的那样,发明方面在于少于前面公开的单个实施例的所有特征。因此,遵循具体实施方式的权利要求书由此明确地并入该具体实施方式,其中每个权利要求本身都作为本发明的单独实施例。
本领域那些技术人员可以理解,可以对实施例中的设备中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个设备中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。除了这样的特征和/或过程或者单元中的至少一些是相互排斥之外,可以采用任何组合对本说明书(包括伴随的权利要求、摘要和附图)中公开的所有特征以及如此公开的任何方法或者设备的所有过程或单元进行组合。除非另外明确陈述,本说明书(包括伴随的权利要求、摘要和附图)中公开的每个特征可以由提供相同、等同或相似目的的替代特征来代替。
此外,本领域的技术人员能够理解,尽管在此所述的一些实施例包括其它实施例中所包括的某些特征而不是其它特征,但是不同实施例的特征的组合意味着处于本发明的范围之内并且形成不同的实施例。例如,在下面的权利要求书中,所要求保护的实施例的任意之一都可以以任意的组合方式来使用。
本发明的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器(DSP)来实现根据本发明实施例的发明名称(如确定网站内链接等级的装置)中的一些或者全部部件的一些或者全部功能。本发明还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本发明的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。
应该注意的是上述实施例对本发明进行说明而不是对本发明进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本发明可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算 机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。

Claims (20)

  1. 一种电子书的翻页方法,其特征在于,所述方法包括:
    通过麦克风采集设置的翻页提示音;
    保存采集的所述翻页提示音;
    对麦克风进行监听,获取所述麦克风采集的声音信息;
    确认所述声音信息与所述翻页提示音匹配,则触发与所述翻页提示音对应的翻页操作。
  2. 根据权利要求1所述的方法,其特征在于,所述保存采集的所述翻页提示音,包括:
    将所述翻页提示音转换为第一文本信息列表,所述第一文本信息列表由发音与所述翻页提示音具有相似度关系的不同第一文本信息组成;
    基于信息选择操作从所述第一文本信息列表中确定一个第一文本信息,作为对应所述翻页提示音的第一文本信息;
    保存确定的所述第一文本信息。
  3. 根据权利要求2所述的方法,其特征在于,所述确认所述声音信息与所述翻页提示音匹配,包括:
    将所述声音信息转换为第二文本信息列表,所述第二文本信息列表由发音与所述声音信息具有相似度关系的不同第二文本信息组成,所述第二文本信息按照相似度由高到低的顺序排列;
    判断所述第二文本信息列表的目标文本信息集合中是否包含保存的所述第一文本信息,所述目标文本信息集合由所述第二文本信息列表的TOPN个第二文本信息组成,所述N为正整数;
    若判断结果为所述目标文件信息集合中包含保存的所述第一文本信息,则确定所述声音信息与所述翻页提示音匹配。
  4. 根据权利要求1至3中任一项所述的方法,其特征在于,在确认 所述声音信息与所述翻页提示音匹配之前,所述方法进一步包括:
    从获取到的声音信息中过滤环境音对应的声音信息。
  5. 根据权利要求4所述的方法,其特征在于,所述从获取到的声音信息中过滤环境音对应的声音信息,包括:
    从所述获取到的声音信息中剔除音量小于预设音量阈值的声音信息。
  6. 根据权利要求4所述的方法,其特征在于,所述从获取到的声音信息中过滤环境音对应的声音信息,包括:
    在预设的声音信息库中查找所述获取到的声音信息对应的匹配成功率;
    若所述匹配成功率小于预设成功率阈值,则剔除所述获取到的声音信息。
  7. 根据权利要求1至6中任一项所述的方法,其特征在于,在通过麦克风采集设置的翻页提示音之前,所述方法进一步包括:
    基于语言选择操作设定所述翻页提示音的语种类型。
  8. 根据权利要求1至7中任一项所述的方法,其特征在于,当所述翻页提示音中包含数字M时,所述翻页提示音对应的翻页操作为连续翻过M页,所述M为正整数。
  9. 根据权利要求1至8中任一项所述的方法,其特征在于,所述翻页提示音包括:向前翻页提示音及向后翻页提示音。
  10. 一种电子书的翻页装置,其特征在于,所述装置包括:
    采集单元,用于通过麦克风采集设置的翻页提示音;
    保存单元,用于保存采集的所述翻页提示音;
    所述采集单元还用于对麦克风进行监听,获取所述麦克风采集的声音 信息;
    确认单元,用于确认所述声音信息与所述翻页提示音匹配,则触发与所述翻页提示音对应的翻页操作。
  11. 根据权利要求10所述的装置,其特征在于,所述保存单元,包括:
    第一转换模块,用于将所述翻页提示音转换为第一文本信息列表,所述第一文本信息列表由发音与所述翻页提示音具有相似度关系的不同第一文本信息组成;
    第一确定模块,用于基于信息选择操作从所述第一文本信息列表中确定一个第一文本信息,作为对应所述翻页提示音的第一文本信息;
    保存模块,用于保存确定的所述第一文本信息。
  12. 根据权利要求11所述的装置,其特征在于,所述确认单元,包括:
    第二转换模块,用于将所述声音信息转换为第二文本信息列表,所述第二文本信息列表由发音与所述声音信息具有相似度关系的不同第二文本信息组成,所述第二文本信息按照相似度由高到低的顺序排列;
    判断模块,用于判断所述第二文本信息列表的目标文本信息集合中是否包含保存的所述第一文本信息,所述目标文本信息集合由所述第二文本信息列表的TOP N个第二文本信息组成,所述N为正整数;
    第二确定模块,用于当判断结果为所述目标文件信息集合中包含保存的所述第一文本信息时,确定所述声音信息与所述翻页提示音匹配。
  13. 根据权利要求10至12中任一项所述的装置,其特征在于,所述装置进一步包括:过滤单元,用于在确认所述声音信息与所述翻页提示音匹配之前,从获取到的声音信息中过滤环境音对应的声音信息。
  14. 根据权利要求13所述的装置,其特征在于,所述过滤单元,包 括:
    第一过滤模块,用于从所述获取到的声音信息中剔除音量小于预设音量阈值的声音信息。
  15. 根据权利要求13所述的装置,其特征在于,所述过滤单元,包括第二过滤模块,用于:
    在预设的声音信息库中查找所述获取到的声音信息对应的匹配成功率;
    当所述匹配成功率小于预设成功率阈值时,剔除所述获取到的声音信息。
  16. 根据权利要求10至15中任一项所述的装置,其特征在于,所述采集单元,还用于在通过麦克风采集设置的翻页提示音之前,基于语言选择操作设定所述翻页提示音的语种类型。
  17. 根据权利要求10至16中任一项所述的装置,其特征在于,当所述采集单元采集的所述翻页提示音中包含数字M时,所述翻页提示音对应的翻页操作为连续翻过M页,所述M为正整数。
  18. 根据权利要求10至17中任一项所述的装置,其特征在于,所述采集单元采集的所述翻页提示音包括:向前翻页提示音及向后翻页提示音。
  19. 一种电子书的翻页装置,包括存储器和处理器,其特征在于,所述存储器用于存储指令,所述指令用于控制所述处理器进行操作以执行根据权利要求1至9中任一项所述的方法。
  20. 一种计算机可读存储介质,其特征在于,存储有用于执行根据权利要求1至9中任一项所述方法的程序代码。
PCT/CN2016/110696 2016-03-16 2016-12-19 一种电子书的翻页方法及装置 WO2017157067A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610151571.4 2016-03-16
CN201610151571.4A CN107205076A (zh) 2016-03-16 2016-03-16 一种电子书的翻页方法及装置

Publications (1)

Publication Number Publication Date
WO2017157067A1 true WO2017157067A1 (zh) 2017-09-21

Family

ID=59851760

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/110696 WO2017157067A1 (zh) 2016-03-16 2016-12-19 一种电子书的翻页方法及装置

Country Status (2)

Country Link
CN (1) CN107205076A (zh)
WO (1) WO2017157067A1 (zh)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109213720A (zh) * 2018-08-16 2019-01-15 咪咕数字传媒有限公司 电子书的翻页方法、装置及存储介质
CN114115784A (zh) * 2021-11-30 2022-03-01 云知声智能科技股份有限公司 基于智能麦克风的控制方法、装置、电子设备和存储介质
CN114330277A (zh) * 2021-12-31 2022-04-12 北京字节跳动网络技术有限公司 阅读排版方法、装置、设备和存储介质

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111506744B (zh) * 2020-04-07 2024-03-19 广东小天才科技有限公司 一种点读的方法及终端设备

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013062324A1 (en) * 2011-10-25 2013-05-02 Samsung Electronics Co., Ltd. Method for applying supplementary attribute information to e-book content and mobile device adapted thereto
CN103366740A (zh) * 2012-03-27 2013-10-23 联想(北京)有限公司 语音命令识别方法及装置
US20140019133A1 (en) * 2012-07-12 2014-01-16 International Business Machines Corporation Data processing method, presentation method, and corresponding apparatuses
CN103543930A (zh) * 2012-07-13 2014-01-29 腾讯科技(深圳)有限公司 一种电子书操作控制方法及装置
CN103605468A (zh) * 2013-11-14 2014-02-26 武汉虹翼信息有限公司 一种电子书籍控制装置及其控制交互方法
CN103853703A (zh) * 2014-02-19 2014-06-11 联想(北京)有限公司 一种信息处理方法及电子设备

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013062324A1 (en) * 2011-10-25 2013-05-02 Samsung Electronics Co., Ltd. Method for applying supplementary attribute information to e-book content and mobile device adapted thereto
CN103366740A (zh) * 2012-03-27 2013-10-23 联想(北京)有限公司 语音命令识别方法及装置
US20140019133A1 (en) * 2012-07-12 2014-01-16 International Business Machines Corporation Data processing method, presentation method, and corresponding apparatuses
CN103543930A (zh) * 2012-07-13 2014-01-29 腾讯科技(深圳)有限公司 一种电子书操作控制方法及装置
CN103605468A (zh) * 2013-11-14 2014-02-26 武汉虹翼信息有限公司 一种电子书籍控制装置及其控制交互方法
CN103853703A (zh) * 2014-02-19 2014-06-11 联想(北京)有限公司 一种信息处理方法及电子设备

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109213720A (zh) * 2018-08-16 2019-01-15 咪咕数字传媒有限公司 电子书的翻页方法、装置及存储介质
CN114115784A (zh) * 2021-11-30 2022-03-01 云知声智能科技股份有限公司 基于智能麦克风的控制方法、装置、电子设备和存储介质
CN114330277A (zh) * 2021-12-31 2022-04-12 北京字节跳动网络技术有限公司 阅读排版方法、装置、设备和存储介质
CN114330277B (zh) * 2021-12-31 2023-08-22 抖音视界有限公司 阅读排版方法、装置、设备和存储介质

Also Published As

Publication number Publication date
CN107205076A (zh) 2017-09-26

Similar Documents

Publication Publication Date Title
US10614803B2 (en) Wake-on-voice method, terminal and storage medium
US8983836B2 (en) Captioning using socially derived acoustic profiles
US8768693B2 (en) Automatic tag extraction from audio annotated photos
US11176141B2 (en) Preserving emotion of user input
JP6138305B2 (ja) コンテキスト情報を用いるカメラocr
US8290772B1 (en) Interactive text editing
US9519641B2 (en) Photography recognition translation
US20180277097A1 (en) Method and device for extracting acoustic feature based on convolution neural network and terminal device
WO2017157067A1 (zh) 一种电子书的翻页方法及装置
WO2015169134A1 (en) Method and apparatus for phonetically annotating text
US10586528B2 (en) Domain-specific speech recognizers in a digital medium environment
WO2019096056A1 (zh) 语音识别方法、装置及系统
US11457061B2 (en) Creating a cinematic storytelling experience using network-addressable devices
US20170351371A1 (en) Touch interaction based search method and apparatus
JP2016536652A (ja) モバイル機器におけるリアルタイム音声評価システム及び方法
TW201337911A (zh) 電子裝置以及語音識別方法
EP3039579A1 (en) Method and apparatus for classifying data items based on sound tags
WO2017000613A1 (zh) 在搜索结果页中生成提示信息的方法及装置
JP2009519538A (ja) デジタル・ファイルの集合の中からデジタル・ファイルにアクセスする方法および装置
KR101567449B1 (ko) 음성인식에 기반한 애니메이션 재생이 가능한 전자책 단말기 및 그 방법
WO2016155643A1 (zh) 一种基于输入的显示候选词的方法和装置
CN111009240A (zh) 一种语音关键词筛选方法、装置、出行终端、设备及介质
WO2017020794A1 (zh) 一种交互系统的语音识别方法和装置
US20230177265A1 (en) Electronic apparatus recommending content-based search terms and control method thereof
WO2017092322A1 (zh) 智能电视的浏览器操作方法及智能电视

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16894226

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 16894226

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 16894226

Country of ref document: EP

Kind code of ref document: A1