WO2022188022A1 - Hearing-based perception system and method for using same - Google Patents

Hearing-based perception system and method for using same Download PDF

Info

Publication number
WO2022188022A1
WO2022188022A1 PCT/CN2021/079689 CN2021079689W WO2022188022A1 WO 2022188022 A1 WO2022188022 A1 WO 2022188022A1 CN 2021079689 W CN2021079689 W CN 2021079689W WO 2022188022 A1 WO2022188022 A1 WO 2022188022A1
Authority
WO
WIPO (PCT)
Prior art keywords
auditory
information
user
instruction
perception system
Prior art date
Application number
PCT/CN2021/079689
Other languages
French (fr)
Chinese (zh)
Inventor
曹庆恒
Original Assignee
曹庆恒
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 曹庆恒 filed Critical 曹庆恒
Priority to PCT/CN2021/079689 priority Critical patent/WO2022188022A1/en
Priority to CN202180000425.0A priority patent/CN113196390B/en
Publication of WO2022188022A1 publication Critical patent/WO2022188022A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01DMEASURING NOT SPECIALLY ADAPTED FOR A SPECIFIC VARIABLE; ARRANGEMENTS FOR MEASURING TWO OR MORE VARIABLES NOT COVERED IN A SINGLE OTHER SUBCLASS; TARIFF METERING APPARATUS; MEASURING OR TESTING NOT OTHERWISE PROVIDED FOR
    • G01D21/00Measuring or testing not otherwise provided for
    • G01D21/02Measuring two or more variables by means not covered by a single other subclass
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]

Definitions

  • the present invention relates to the technical field of information and communication, in particular to an auditory-based perception system and a method for using the same.
  • Hearing is the most important way for humans to perceive external information besides vision.
  • the human auditory system uses the information in it to perceive the sound source, information, space, location and environment.
  • the use of hearing to perceive external information has become an important way for them to obtain information.
  • the main purpose of the present invention is to provide a perception system based on hearing and a method of using the same, which can help people better use hearing to perceive external information, enhance the effect of perception, and can help blind people or normal people in low light conditions. Improve the efficiency of walking, finding objects, using computers, and smart devices/smart systems in an environment with
  • the present invention provides an auditory-based perception system, the system includes: a user interaction module, an information acquisition module and an analysis and processing module,
  • the user interaction module is used for receiving an instruction and feeding back the feedback information to the user as an auditory perception signal
  • the information acquisition module is used to acquire information, and the information is used for analysis and processing by the analysis processing module in combination with the instruction;
  • the analysis and processing module is configured to perform analysis and calculation according to the instruction and the information, execute the instruction and/or obtain feedback information.
  • the conversion of feedback information into auditory perception signals is completed by the user interaction module or the analysis processing module.
  • the auditory perception signal represents information through at least one of the frequency, rhythm, melody, interval, orientation, distance, size, height, length, and timbre of the sound.
  • the auditory perception signal includes a speech signal.
  • the user interaction module includes an instruction acquisition module and an auditory perception signal output module
  • the auditory perception signal output module includes at least one of an earphone, a bone conduction earphone, a speaker, a hearing aid, and a brain-computer interface. one.
  • the instruction acquisition module includes a voice recognition device, a sound recognition device, a gesture recognition device, a body motion recognition device, an expression recognition device, a body signal recognition device, a smart wearable device, a smart tablet, At least one of the mobile phone, mouse, keyboard, smart handle, smart cane, smart ring, and smart bracelet.
  • the information acquisition module includes an image sensor, a radar device, a radio frequency identification device, a positioning device, an audio acquisition device, an infrared device, an ultraviolet device, a laser scanner, a metal detector, a temperature sensor device, light sensing device, touch sensing device, air pressure sensor, water pressure sensor, olfactory recognition device, magnetic field detection device, wind detection device, humidity detection device, electric power detection device, speed detection device, altitude detection device, chemical analysis device, radiation detection at least one of the devices.
  • the system can also feed back information to the user through non-auditory signals, or feed back information to the user through a combination of auditory signals and non-auditory signals.
  • the auditory-based perception system further includes a data transmission module, and the data transmission module converts the instruction accepted by the user interaction module and the information acquired by the information acquisition module or analyzed.
  • the instructions and/or information processed by the processing module are sent to the network/system/server/smart device, and the server/network/system/smart device performs analysis and calculation according to the instruction and the information, executes the instruction and/or converts the The result is transmitted to the data transmission module.
  • the auditory-based perception system can also obtain information for analysis and processing in combination with instructions from the network/system/server/smart device through the data transmission module or the information acquisition module.
  • the auditory-based perception system as described above, which is used for assisting walking, assisting movement, sports training, navigation, assisting driving, assisting parking, positioning, location guidance, finding targets, reflecting pictures, reflecting objects, Detection, reconnaissance, exploration, design, maintenance, equipment use, device use, learning, teaching, shopping, office, social, games, entertainment, film and television, computer, health testing, disease diagnosis, surgical treatment, virtual concerts, virtual reality technology at least one of.
  • the above-mentioned auditory-based perception system as an auditory-based operating system, can be used alone or in combination with other systems to operate computers, artificial intelligence, smart devices, and virtual reality devices.
  • the present invention also provides a method for using an auditory-based perception system, the method comprising:
  • the feedback information is converted into an auditory perception signal and fed back to the user.
  • An auditory-based perception system and its using method of the present invention include: a user interaction module, an information acquisition module and an analysis and processing module.
  • the method includes: receiving a user instruction; acquiring information for analyzing and processing the instruction; performing analysis and calculation according to the instruction and the information, executing the instruction and/or obtaining feedback information; if there is feedback information , which converts the feedback information into auditory perception signals and feeds them back to the user.
  • the auditory-based perception system and its using method of the present invention can help people better use their hearing to perceive external information, enhance the effect of perception, and can help blind people or normal people when there is insufficient light or poor light.
  • FIG. 1 is a schematic diagram of an auditory-based perception system according to the first embodiment of the present invention.
  • FIG. 2 is a method flowchart of a method for using an auditory-based perception system according to a second embodiment of the present invention.
  • FIG. 1 is a schematic diagram of an auditory-based perception system according to the first embodiment of the present invention.
  • the auditory-based perception system of the present invention includes: a user interaction module 10 , an information acquisition module 20 and an analysis processing module 30 .
  • the user interaction module 10 is used to receive the instruction and feedback the feedback information to the user as an auditory perception signal; the information acquisition module 20 is used to acquire information, and the information is used for the analysis processing module 30 to analyze and process the instruction; the analysis processing module 30 It is used for analyzing and calculating according to the instruction and the information, executing the instruction and/or obtaining feedback information.
  • the working flow of the auditory-based perception system of the present invention is:
  • the user interaction module 10 receives user instructions.
  • the user interaction module 10 includes an instruction acquisition module and an auditory perception signal output module.
  • the instruction obtaining module is used to obtain the instruction issued by the user.
  • Users can convert relevant information through voice, gestures, movements, expressions, body signals such as body temperature, heartbeat, blood pressure, breathing, etc. as instructions, or operate tablet computers, mobile phones, mice, keyboards, smart handles, smart canes, smart canes, etc.
  • the instruction acquisition module may include a voice recognition device, a voice recognition device, a gesture recognition device, a body motion recognition device, an expression recognition device, a body signal recognition device, smart wearable devices, One of a smart tablet, a mobile phone, a mouse, a keyboard, a smart handle, a smart walking stick, a smart finger ring, and a smart wristband, and may also include other devices suitable for receiving user instructions. Commands can also be issued at timed, periodically, or when certain conditions are triggered, depending on the settings.
  • the auditory perception signal output module is used to feed back information to the user through the auditory perception signal.
  • the auditory perception signal output module may include at least one of earphones, bone conduction earphones, speakers, hearing aids, and brain-computer interfaces, or other suitable devices. or a combination of related devices.
  • the auditory-based perception system of the present invention can also feed back information to the user through non-auditory signals, for example, it can feed back information to the user through a blind tablet, an intelligent blind handle, an intelligent blind mouse, etc., or through a combination of auditory signals and non-auditory signals. Feedback information to users.
  • the auditory perception signal refers to representing information through at least one of the characteristics of sound frequency, rhythm, melody, interval, orientation, distance, size, height, length, and timbre, etc., so that the user can use the signal to perceptual information.
  • spatial orientation information can be conveyed to the user through sound. Since the transmission characteristics of sound waves transmitted by a sound source to a specific orientation can be expressed as a function data set, this function data set representing the sound wave transmission characteristics can be used to process audio signals, so that the audio signals reflect the sound waves transmitted by the sound source to The transmission characteristics of this azimuth.
  • the sound shows the transmission characteristics of the sound source transmitting the sound wave to the azimuth, so that the user can feel the virtual sound source spatial azimuth.
  • the specific orientation may include the direction, location, height, and the like of the sound.
  • the information acquisition module 20 may include an image sensor, a radar device, a radio frequency identification device, a positioning device, an audio acquisition device, an infrared device, an ultraviolet device, a laser scanner, a metal detector, and a temperature sensing device.
  • light sensing device touch sensing device, air pressure sensor, water pressure sensor, olfactory recognition device, magnetic field detection device, wind detection device, humidity detection device, power detection device, speed detection device, altitude detection device, chemical analysis device, radiation detection device At least one of them may also be used by other suitable devices to obtain relevant information.
  • the analysis and processing module 30 performs analysis and calculation according to the instruction and the information, executes the instruction and/or obtains feedback information.
  • a three-dimensional space model is established for the blind walking instruction and the obtained information such as the user's current position, destination position, and obstacles.
  • the space model may also include information related to the time dimension.
  • the model may be a mapping entity A model of a scene/object; it can also be a model that maps a virtual scene/object, such as an operating system, an operation interface, a game, a virtual system, etc.; it can also be a combination of entity and virtual.
  • the feedback information is converted into an auditory perception signal, which is then fed back to the user by the user interaction module 10 .
  • Converting the feedback information into auditory perception signals is completed by the user interaction module 10 or the analysis processing module 30, and may also be completed by other devices or devices. For example, the route of the blind person walking and the guidance and reminders given according to the actual situation are converted into auditory perception signals and fed back to the user.
  • an auditory perception signal that the sound source is at the target position can be sent out, so that the user can perceive the location information through the auditory perception signal, so as to walk to the location.
  • the auditory perception signal is adjusted as the distance between the user and the target position changes, so that the user can walk to the position. It can be that every time the user moves the position, according to the change of the user's current position and the target position, the sound source sends out an auditory perception signal at the target position that is transmitted to the user's current position, so that the user can continuously perceive the target position during the movement process. Finally successfully reached the target position. If there are obstacles in the route, you can use other sound signals with different frequencies, rhythm, melody, interval, bearing, distance, size, height, length and timbre to send out the auditory perception of information such as obstacle location, distance, height, and danger. signal, so that the user can perceive the position, distance, height, danger and other information of the obstacle through the auditory perception signal, so as to bypass the obstacle.
  • the meaning of sound signals and their combinations of different frequencies, rhythms, melody, intervals, orientations, distances, sizes, heights, lengths and timbres used to represent destinations and obstacles, targets, objects, content, and their combinations can be preset, And it is a signal that the user has been trained to distinguish and understand the meaning of, and it can also be an existing sound signal that can contain information, such as a speech signal of an existing language or other regular sound signals.
  • an auditory-based perception system can set and/or train the definitions of sound signals of different frequencies, rhythms, melody, intervals, orientations, distances, sizes, pitches, lengths, and timbres, such as object definition, target definition, orientation Definition, definition of distance, definition of color, definition of temperature, definition of high and low, definition of warning/danger signal, definition of operation signal, definition of operation result, etc.
  • the auditory-based perception system of the present invention can also feed back information to the user by combining the auditory perception signal with other signals, for example, the auditory perception signal can be combined with the Braille signal to feed back information.
  • the auditory-based perception system of the present invention can be used for various purposes, such as walking, finding objects, games, computers, virtual concerts, virtual reality technology, etc.
  • amblyopia, myopia, hyperopia, presbyopia, eye fatigue, etc. or when it is inconvenient to watch carefully, such as driving, you can easily use the computer based on the auditory perception system to paint, play, compose, write, work, and study.
  • Etc. enhance the importance of hearing in these fields, enhance the effect of people's use of auditory perception information, so that humans, especially the blind, can live more conveniently.
  • Step 1 The user issues an instruction to find the object.
  • the user may issue an instruction through voice, and the system acquires the user's voice instruction through the microphone worn on the user and recognizes the instruction. Users can also issue commands through gestures, actions, or other means.
  • the user's instruction can be obtained through a related device such as a camera.
  • Step 2 After the system obtains the user instruction, it obtains the relevant information through the information obtaining module 20 . First, find the item you are looking for, and determine the location of the item; second, determine the current location of the user; and then obtain the surrounding environment information.
  • the image data in the space can be obtained from different angles through multiple image sensors for spatial modeling and position calculation.
  • the image sensor can be set in a suitable position in the room, or can be worn on the user's body, for example: the image can be The sensor, the microphone for obtaining instructions, and the earphone for feeding back the auditory perception signal to the user are integrated into a head-worn portable device, which is worn on the user's head.
  • Step 3 Perform analysis and calculation according to the instruction and the information to obtain feedback information.
  • a spatial model is established through the acquired information, mainly through multi-angle spatial image data, as well as the position information and related size of the objects in the space. It is also possible to obtain the established spatial model through the network or other means, or to modify the existing spatial model to obtain a new spatial model that conforms to the actual situation. After that, use the space model and the location information of users and items to plan a suitable fetching path.
  • the acoustic model is established according to the spatial model, and the correlation function of the acoustic wave transmission is obtained, which is used to calculate the auditory perception signal.
  • the beam tracking algorithm can be used to establish the acoustic model, calculate the intersection of the relevant beam and the space, and obtain the correlation function of the sound wave transmission.
  • the audio signal processed by this correlation function is converted into sound through the playback device, the sound is The transmission characteristics of the sound wave transmitted by the sound source to the azimuth are displayed, so that the user can feel the virtual sound source spatial azimuth.
  • the feedback information is converted into auditory perception signals and fed back to the user.
  • a virtual sound source is produced at the location of the item, and after processing the correlation function of sound wave transmission, the auditory perception signal is obtained, which is output to the user through the earphone. If the planned fetching path is not a straight path, the path can be divided into multiple straight segments, and then, a sound source is virtualized at the end point of the first straight segment, and the auditory perception signal is calculated and fed back to the user. After the end point of each straight line segment, enter the second straight line segment.
  • a sound source is virtualized at the end point of the second straight line segment, and the auditory perception signal is calculated and fed back to the user to guide the user to the point, and so on until reaching the The location of the item being sought.
  • the corresponding auditory perception signal can be set and fed back to the user, so that the user can be continuously corrected and reminded.
  • the auditory-based perception system of the present invention can also be used for virtual concerts.
  • the scene of the virtual concert venue the acoustic model is established.
  • each instrument, part, etc. is virtualized into different sound sources, and these sound sources can be located in different positions.
  • the correlation functions of the sound wave transmission from different sound sources to the user's location are calculated separately.
  • the music generated by each virtual sound source is processed by the correlation function of sound wave transmission and then superimposed to obtain the final auditory perception signal of the concert and output to the user.
  • the correlation function for calculating acoustic wave transmission may be a set of head related transfer function data (Head Related Transfer Function, HRTF), a set of interaural time difference data (Interaural Time Difference, ITD), and a set of interaural intensity difference data (IID) Any appropriate set of data that can characterize the transmission characteristics of sound waves emitted by a sound source to a certain azimuth.
  • ITD refers to the time difference between the sound signal reaching both ears due to the distance difference between the sound source and the left and right ears.
  • IID refers to the difference in intensity of the acoustic signal when it reaches both ears due to the difference in the distance between the sound source and the left and right ears.
  • Both ITD and IID are functions of sound source location and sound wave frequency.
  • HRTF is the acoustic transfer function from the sound source to both ears in the free field, which is used to describe the characteristic changes that occur when the sound wave emitted by the sound source in the free sound field is incident at a certain point in the ear canal at a certain angle.
  • HRTF is a function of the location of the sound source, the frequency of the sound wave, and the shape and properties of the body surface.
  • the unit impulse response from the sound source to the anthropometric point is called the Head Related Impulse Response (HRIR).
  • HRTF is the Fourier transform of HRIR.
  • the audio signals can respectively represent the sound waves transmitted by the sound source to multiple specific azimuths. Azimuth transmission characteristics.
  • a virtual auditory environment can be constructed. On this basis, if the user's real physical orientation is projected as a specific orientation in the virtual auditory environment, the difference between the user's different real physical orientations and the different specific orientations in the virtual auditory environment will occur. By establishing a corresponding relationship between them, users can hear sound effects that are consistent with their own physical orientations according to their own physical orientations.
  • It can be set to allow the user to move the position.
  • the correlation function of the sound wave transmission of different sound sources is recalculated.
  • the user can really enjoy the concert on the spot as if he were there. Users can also issue instructions to adjust the sounding position, volume, etc. of certain instruments and parts, and enjoy their own concerts at will.
  • the auditory-based perception system as described above, which is used for assisting walking, assisting movement, sports training, navigation, assisting driving, assisting parking, positioning, location guidance, finding targets, reflecting pictures, reflecting objects, Detection, reconnaissance, exploration, design, maintenance, equipment use, device use, learning, teaching, shopping, office, social, games, entertainment, film and television, computer, health testing, disease diagnosis, surgical treatment, virtual concerts, virtual reality technology at least one of. For example: in the process of walking, remind the user of the route, the change of road conditions and the specific location of the obstacle through the position, distance, signal type, etc.
  • the auditory signal in sports training, through the auditory signal to remind the athlete whether the angle and distance of the action meet the training requirements Or guide the athlete’s actions; in assisted driving, the driver/pilot’s route or route and the location and distance of related objects can be reminded through auditory signals; position guidance can enable users to accurately grasp the target position through auditory signals, such as when inserting a key into the keyhole , the auditory signal can reflect the relative orientation and distance between the keyhole and the key, and it can be quickly aligned even if it is invisible/invisible; the target can be found by radar or infrared equipment.
  • the auditory signal feeds back the relative orientation and distance of the target/person to the user; when it is used to reflect the picture, the picture to be reflected can be obtained through the image sensor, and then the image recognition software parses the picture content into images such as points, lines, graphics, colors, etc.
  • the system converts the relevant information elements and their position information into auditory signals, so that the user can receive the relevant information of the screen according to the auditory signal, or the system can feedback the screen information of the position through the auditory information according to the position specified by the user. For the user, with the movement of the user-specified position in different positions on the screen, the system can transmit the screen information to the user through auditory information.
  • the screen here can be a real screen or a virtual screen stored in the system; When designing, it can increase the information feedback of the designer in terms of space, and have a more three-dimensional and intuitive feeling about the scheme involved.
  • the above application can be achieved by the auditory-based perception system of the present invention alone or in combination with other systems and devices.
  • the auditory-based perception system of the present invention the above-mentioned auditory-based perception system, as an auditory-based operating system, the auditory-based perception system can be used alone or in combination with other systems for operating computers, artificial intelligence, and smart devices. , virtual reality device or other suitable device.
  • Existing computer systems usually use a video operation interface. For blind people or people with poor eyesight, or when ordinary people use it at a distance, the information fed back by the computer system cannot be well transmitted to users.
  • the existing visual-based interface is limited in the amount of information and the form of information, and many times, it cannot fully, vividly and accurately accept instructions and feedback information.
  • the feedback information is converted into auditory perception signals and fed back to the user, and the user's instructions can be received in multiple dimensions, so that the blind or poor-sighted people, or ordinary people can easily obtain the information from a long distance.
  • the information fed back by the computer system can also increase the way of existing computer interaction and enhance the effect of interaction, reduce the difficulty of relevant personnel using the computer system and control equipment, and improve the effect of computer use.
  • the computer system based on the auditory perception system can realize the operation of position, route, quantity, size, temperature, time, degree, shape, state, and object, as well as the existing computer system, and can also include object recognition/discrimination/expansion, object Movement of virtual locations, object modification/deletion, generation, alteration, etc. It is also possible to use the auditory-based perception system of the present invention to control the device to complete the operation of the target object, for example: realize the operation of manipulators, robots, smart furniture, unmanned vehicles, drones through the auditory-based perception system of the present invention. , electronic paper books, etc.
  • a computer system combined with an auditory-based perception system can increase the spatial dimension of information and other dimensions of information that can be carried by hearing on the basis of existing computer applications, greatly improving the application efficiency and use experience of computers.
  • the auditory-based perception system of the present invention may further include a data transmission module, the data transmission module transmits the instructions received by the user interaction module and the information acquired by the information acquisition module or the instructions processed by the analysis processing module and/or Or the information is sent to the server/network/system/smart device, the server/network/system/smart device performs analysis and calculation according to the instruction and the information, executes the instruction and/or transmits the result to the data transmission module.
  • Specific networks/systems/smart devices include: Internet, Internet of Things, satellite networks, local area networks, smart office systems, smart home systems, smart phones, smart TVs, smart cars, smart roads, smart cities, drones, smart robots, smart Kitchen, smart clothing, etc.
  • the data transmission module the data is transmitted to the server/network/system/smart device for analysis and calculation, which can increase the data processing capability of the auditory-based perception system of the present invention, and can also expand the application range of the auditory-based perception system of the present invention.
  • the calculation amount of the analysis processing module can be reduced, the hardware requirement for the analysis processing module can be reduced, and the cost and weight of the auditory-based perception system of the present invention can be reduced.
  • the auditory-based perception system of the present invention can also obtain information from the Internet, the Internet of Things, or other information systems, servers, and smart devices through a data transmission module or an information acquisition module, for analysis and calculation in combination with instructions.
  • Specific sources of information can include: Internet, Internet of Things, satellite networks, local area networks, smart office systems, smart home systems, smart phones, smart speakers, smart cars, smart roads, smart cities, drones, smart robots, smart kitchens, smart Clothing, smart glasses, etc.
  • the model can be established by establishing a complete model by a single system, or by establishing a part of the model by multiple systems and devices based on unified signal/information standards, and then by one or more of the systems.
  • servers can be integrated into a complete set of models.
  • the information needed to build the model can be obtained through smart appliances, smart furniture, smart houses (homes, wards, hospitals, schools, factories), smart roads, smart phones, smart speakers, smart cars, smart city systems, image sensors, A positioning device and an audio acquisition device are used to obtain it.
  • FIG. 2 is a method flowchart of a method for using an auditory-based perception system according to a second embodiment of the present invention.
  • the use method of the auditory-based perception system of the present invention includes:
  • S3 perform analysis and calculation according to the instruction and the information, execute the instruction and/or obtain feedback information
  • the method of using an auditory-based perception system of the present invention corresponds to the technical features of an auditory-based perception system of the present invention. Reference can be made to the foregoing description of the auditory-based perception system, which will not be repeated here.
  • an auditory-based perception system and a method for using the same of the present invention include: a user interaction module, an information acquisition module, and an analysis and processing module.
  • the method includes: receiving a user instruction; acquiring information for analyzing and processing the instruction; performing analysis and calculation according to the instruction and the information, executing the instruction and/or obtaining feedback information; if there is feedback information , which converts the feedback information into auditory perception signals and feeds them back to the user.
  • the auditory-based perception system and its using method of the present invention can help people better use their hearing to perceive external information, enhance the effect of perception, and can help blind people or normal people when there is insufficient light or poor light.

Abstract

A hearing-based perception system and method for using same. The system comprises: a user interaction module (10), an information acquisition module (20), and an analysis and processing module (30). The method comprises: receiving a user instruction (S1); acquiring information for analysis and processing in combination with the instruction (S2); performing analysis and calculation according to the instruction and the information, executing the instruction and/or obtaining feedback information (S3); and if there is feedback information, converting the feedback information into an auditory perception signal and feeding same back to a user (S4). The hearing-based perception system and the method for using same can help people better use hearing to perceive information from the outside world, enhances the perception effect, and can increase the efficiency of walking, looking for an object, using a computer and smart device/smart system, etc. when a user is in conditions such as low light, poor light, too strong light, amblyopia, myopia, hyperopia, presbyopia, and eye fatigue, or when it is inconvenient for the user to look carefully, such as during driving.

Description

一种基于听觉的感知系统及其使用方法An auditory-based perception system and method of using the same 技术领域technical field
本发明涉及信息通讯技术领域,特别是涉及一种基于听觉的感知系统及其使用方法。The present invention relates to the technical field of information and communication, in particular to an auditory-based perception system and a method for using the same.
背景技术Background technique
听觉是人类除视觉外最重要的感知外界信息的途径。人的听觉系统听到声波后,利用其中的信息产生对声源、信息、空间、定位及环境进行感知。尤其对于盲人来说,由于无法利用视觉来感知外界信息,因此利用听觉来感知外界信息成为他们重要的获取信息的途径。Hearing is the most important way for humans to perceive external information besides vision. After hearing the sound waves, the human auditory system uses the information in it to perceive the sound source, information, space, location and environment. Especially for blind people, because they cannot use vision to perceive external information, the use of hearing to perceive external information has become an important way for them to obtain information.
因此,迫切需要一种基于听觉的感知系统,能够帮助人们更好的利用听觉来感知外界的信息,增强感知的效果和效率。Therefore, there is an urgent need for a perception system based on hearing, which can help people better use hearing to perceive external information and enhance the effect and efficiency of perception.
发明内容SUMMARY OF THE INVENTION
本发明的主要目的是:提供一种基于听觉的感知系统及其使用方法,能够帮助人们更好的利用听觉来感知外界的信息,增强感知的效果,并可以帮助盲人或者帮助正常人在光线不足的环境中提升行走、寻物、使用计算机以及智能设备/智能系统等的效率。The main purpose of the present invention is to provide a perception system based on hearing and a method of using the same, which can help people better use hearing to perceive external information, enhance the effect of perception, and can help blind people or normal people in low light conditions. Improve the efficiency of walking, finding objects, using computers, and smart devices/smart systems in an environment with
为实现上述目的,本发明提供了一种基于听觉的感知系统,所述系统包括:用户交互模块、信息获取模块和分析处理模块,In order to achieve the above object, the present invention provides an auditory-based perception system, the system includes: a user interaction module, an information acquisition module and an analysis and processing module,
所述用户交互模块用于接收指令并将反馈的信息以听觉感知信号反馈给用户;The user interaction module is used for receiving an instruction and feeding back the feedback information to the user as an auditory perception signal;
所述信息获取模块用于获取信息,所述信息用于供分析处理模块结合指令进行分析处理;The information acquisition module is used to acquire information, and the information is used for analysis and processing by the analysis processing module in combination with the instruction;
所述分析处理模块用于根据所述指令和所述信息进行分析计算,执行所述指令和/或得出反馈的信息。The analysis and processing module is configured to perform analysis and calculation according to the instruction and the information, execute the instruction and/or obtain feedback information.
如上所述的基于听觉的感知系统,将反馈的信息转换为听觉感知信号是由用户交互模块或分析处理模块完成。In the above-mentioned auditory-based perception system, the conversion of feedback information into auditory perception signals is completed by the user interaction module or the analysis processing module.
如上所述的基于听觉的感知系统,所述听觉感知信号是通过声音的频率、节奏、旋律、间隔、方位、距离、大小、高低、长短和音色中的至少一项来表示信息。In the above-mentioned auditory-based perception system, the auditory perception signal represents information through at least one of the frequency, rhythm, melody, interval, orientation, distance, size, height, length, and timbre of the sound.
如上所述的基于听觉的感知系统,所述听觉感知信号包括语音信号。In the auditory-based perception system as described above, the auditory perception signal includes a speech signal.
如上所述的基于听觉的感知系统,所述用户交互模块包括指令获取模块和听觉感知信号输出模块,所述听觉感知信号输出模块包括耳机、骨传导耳机、扬声器、助听器、脑机接口中的至少一项。In the above-mentioned hearing-based perception system, the user interaction module includes an instruction acquisition module and an auditory perception signal output module, and the auditory perception signal output module includes at least one of an earphone, a bone conduction earphone, a speaker, a hearing aid, and a brain-computer interface. one.
如上所述的基于听觉的感知系统,所述指令获取模块包括语音识别装置、声音识别装置、手势识别装置、肢体动作识别装置、表情识别装置、身体信号识别装置、智能可穿戴设备、智能平板、手机、鼠标、键盘、智能手柄、智能手杖、智能指环、智能手环中的至少一项。In the above-mentioned hearing-based perception system, the instruction acquisition module includes a voice recognition device, a sound recognition device, a gesture recognition device, a body motion recognition device, an expression recognition device, a body signal recognition device, a smart wearable device, a smart tablet, At least one of the mobile phone, mouse, keyboard, smart handle, smart cane, smart ring, and smart bracelet.
如上所述的基于听觉的感知系统,所述信息获取模块包括图像传感器、雷达装置、无线射频识别装置、定位装置、音频获取装置、红外装置、紫外装置、激光扫描器、金属探测器、温感装置、光感装置、触感装置、气压传感器、水压传感器、嗅觉识别装置、磁场探测装置、风力探测装置、湿度探测装置、电力探测装置、速度探测装置、高度探测装置、化学分析装置、放射线探测装置中的至少一个。In the above-mentioned hearing-based perception system, the information acquisition module includes an image sensor, a radar device, a radio frequency identification device, a positioning device, an audio acquisition device, an infrared device, an ultraviolet device, a laser scanner, a metal detector, a temperature sensor device, light sensing device, touch sensing device, air pressure sensor, water pressure sensor, olfactory recognition device, magnetic field detection device, wind detection device, humidity detection device, electric power detection device, speed detection device, altitude detection device, chemical analysis device, radiation detection at least one of the devices.
如上所述的基于听觉的感知系统,所述系统还能通过非听觉信号向用户反馈信息,或通过听觉信号及非听觉信号联合向用户反馈信息。With the above-mentioned auditory-based perception system, the system can also feed back information to the user through non-auditory signals, or feed back information to the user through a combination of auditory signals and non-auditory signals.
如上所述的基于听觉的感知系统,所述基于听觉的感知系统还包括数据传输模块,所述数据传输模块将所述用户交互模块接受的指令和所述信息获取模块获取的信息或者是经分析处理模块处理的指令和/或信息发送至网络/系统/服务器/智能设备,所述服务器/网络/系统/智能设备根据所述指令和所述信息进行分析计算,执行所述指令和/或将结果传输至所述数据传输模块。The above-mentioned auditory-based perception system, the auditory-based perception system further includes a data transmission module, and the data transmission module converts the instruction accepted by the user interaction module and the information acquired by the information acquisition module or analyzed. The instructions and/or information processed by the processing module are sent to the network/system/server/smart device, and the server/network/system/smart device performs analysis and calculation according to the instruction and the information, executes the instruction and/or converts the The result is transmitted to the data transmission module.
如上所述的基于听觉的感知系统,所述基于听觉的感知系统还能通过所述数据传输模块或者信息获取模块从网络/系统/服务器/智能设备获取用于结合指令进行分析处理的信息。In the above-mentioned auditory-based perception system, the auditory-based perception system can also obtain information for analysis and processing in combination with instructions from the network/system/server/smart device through the data transmission module or the information acquisition module.
如上所述的基于听觉的感知系统,所述基于听觉的感知系统用于辅助行走、辅助运动、运动训练、导航、辅助驾驶、辅助停车、定位、位置引导、发现目标、反映画面、反映物体、探测、侦查、勘探、设计、维修、设备使用、装置使用、学习、教学、购物、办公、社交、游戏、娱乐、影视、计算机、健康测试、疾病诊断、手术治疗、虚拟音乐会、虚拟现实技术中的至少一项。The auditory-based perception system as described above, which is used for assisting walking, assisting movement, sports training, navigation, assisting driving, assisting parking, positioning, location guidance, finding targets, reflecting pictures, reflecting objects, Detection, reconnaissance, exploration, design, maintenance, equipment use, device use, learning, teaching, shopping, office, social, games, entertainment, film and television, computer, health testing, disease diagnosis, surgical treatment, virtual concerts, virtual reality technology at least one of.
如上所述的基于听觉的感知系统,所述基于听觉的感知系统作为基于听觉的操作系统可单独或与其他系统结合,用于操作计算机、人工智能、智能设备、虚拟现实设备。The above-mentioned auditory-based perception system, as an auditory-based operating system, can be used alone or in combination with other systems to operate computers, artificial intelligence, smart devices, and virtual reality devices.
本发明还提供一种基于听觉的感知系统的使用方法,所述方法包括:The present invention also provides a method for using an auditory-based perception system, the method comprising:
接收用户指令;receive user instructions;
获取用于结合指令进行分析处理的信息;Obtain information for analysis and processing combined with instructions;
根据所述指令和所述信息进行分析计算,执行所述指令和/或得出反馈的信息;Perform analysis and calculation according to the instruction and the information, execute the instruction and/or obtain feedback information;
如果有反馈的信息,将反馈的信息转化为听觉感知信号反馈给用户。If there is feedback information, the feedback information is converted into an auditory perception signal and fed back to the user.
本发明的一种基于听觉的感知系统及其使用方法,所述系统包括:用户交互模块、信息获取模块和分析处理模块。所述方法包括:接收用户指令;获取用于结合指令进行分析处理的信息;根据所述指令和所述信息进行分析计算,执行所述指令和/或得出反馈的信息;如果有反馈的信息,将反馈的信息转化为听觉感知信号反馈给用户。通过本发明的一种基于听觉的感知系统及其使用方法,能够帮助人们更好的利用听觉来感知外界的信息,增强感知的效果,并可以帮助盲人或者帮助正常人在光线不足、光线不佳、光线太强、弱视、近视、远视、老花、眼疲劳等环境中,或者是不方便仔细观看的时候如驾驶时,提升行走、寻物、使用计算机以及智能设备/智能系统等的效率。An auditory-based perception system and its using method of the present invention include: a user interaction module, an information acquisition module and an analysis and processing module. The method includes: receiving a user instruction; acquiring information for analyzing and processing the instruction; performing analysis and calculation according to the instruction and the information, executing the instruction and/or obtaining feedback information; if there is feedback information , which converts the feedback information into auditory perception signals and feeds them back to the user. The auditory-based perception system and its using method of the present invention can help people better use their hearing to perceive external information, enhance the effect of perception, and can help blind people or normal people when there is insufficient light or poor light. , too strong light, amblyopia, myopia, hyperopia, presbyopia, eye fatigue and other environments, or when it is inconvenient to watch carefully, such as when driving, to improve the efficiency of walking, finding objects, using computers, and smart devices/intelligent systems.
附图说明Description of drawings
图1为本发明第一实施例一种基于听觉的感知系统的示意图。FIG. 1 is a schematic diagram of an auditory-based perception system according to the first embodiment of the present invention.
图2为本发明第二实施例一种基于听觉的感知系统的使用方法的方法流程图。FIG. 2 is a method flowchart of a method for using an auditory-based perception system according to a second embodiment of the present invention.
具体实施方式Detailed ways
为进一步阐述本发明达成预定目的所采取的技术手段及功效,以下结合附图及实施例,对本发明的具体实施方式,详细说明如下。In order to further illustrate the technical means and effects adopted by the present invention to achieve the predetermined purpose, the specific embodiments of the present invention are described in detail below in conjunction with the accompanying drawings and embodiments.
本发明第一实施例参阅图1。图1为本发明第一实施例一种基于听觉的感知系统的示意图。如图所示,本发明的基于听觉的感知系统包括:用户交互模块10、信息获取模块20和分析处理模块30。Refer to FIG. 1 for the first embodiment of the present invention. FIG. 1 is a schematic diagram of an auditory-based perception system according to the first embodiment of the present invention. As shown in the figure, the auditory-based perception system of the present invention includes: a user interaction module 10 , an information acquisition module 20 and an analysis processing module 30 .
用户交互模块10用于接收指令并将反馈的信息以听觉感知信号反馈给用户;信息获取模块20用于获取信息,所述信息用于供分析处理模块30对指令进行分析处理;分析处理模块30用于根据所述指令和所述信息进行分析计算,执行所述指令和/或得出反馈的信息。The user interaction module 10 is used to receive the instruction and feedback the feedback information to the user as an auditory perception signal; the information acquisition module 20 is used to acquire information, and the information is used for the analysis processing module 30 to analyze and process the instruction; the analysis processing module 30 It is used for analyzing and calculating according to the instruction and the information, executing the instruction and/or obtaining feedback information.
本发明的基于听觉的感知系统的工作的流程是:The working flow of the auditory-based perception system of the present invention is:
首先,用户交互模块10接收用户指令。在本发明中,用户交互模块10包括指令获取模块和听觉感知信号输出模块。指令获取模块用于获取用户发出的指令。用户可以通过语音、手势、动作、表情、身体信号如体温、心跳、血压、呼吸等进行相关信息转换后作为指令,或通过操作平板电脑、手机、鼠标、键盘、智能手柄、智能手杖、智能可穿戴设备等,或其他适合的方式发出指令,因此,指令获取模块可以包括语音识别装置、声音识别装置、手势识别装置、肢体动作识别装置、表情识别装置、身体信号识别装置、智能可穿戴设备、智能平板、手机、鼠标、键盘、智能手柄、智能手杖、智能指环、智能手环中的一项,还可以包括其他适合接收用户指令的装置。指令还可以 根据设置,定时、定期、触发特定条件时发出。听觉感知信号输出模块用于将信息通过听觉感知信号反馈给用户,听觉感知信号输出模块可以包括耳机、骨传导耳机、扬声器、助听器、脑机接口中的至少一项,也可以是其他适合的装置或者相关装置的组合。本发明的基于听觉的感知系统也可以通过非听觉信号向用户反馈信息,例如:可以通过盲人平板、智能盲人手柄、智能盲人鼠标等向用户反馈信息,也可以是通过听觉信号及非听觉信号联合向用户反馈信息。First, the user interaction module 10 receives user instructions. In the present invention, the user interaction module 10 includes an instruction acquisition module and an auditory perception signal output module. The instruction obtaining module is used to obtain the instruction issued by the user. Users can convert relevant information through voice, gestures, movements, expressions, body signals such as body temperature, heartbeat, blood pressure, breathing, etc. as instructions, or operate tablet computers, mobile phones, mice, keyboards, smart handles, smart canes, smart canes, etc. Wearable devices, etc., or other suitable ways to issue instructions, therefore, the instruction acquisition module may include a voice recognition device, a voice recognition device, a gesture recognition device, a body motion recognition device, an expression recognition device, a body signal recognition device, smart wearable devices, One of a smart tablet, a mobile phone, a mouse, a keyboard, a smart handle, a smart walking stick, a smart finger ring, and a smart wristband, and may also include other devices suitable for receiving user instructions. Commands can also be issued at timed, periodically, or when certain conditions are triggered, depending on the settings. The auditory perception signal output module is used to feed back information to the user through the auditory perception signal. The auditory perception signal output module may include at least one of earphones, bone conduction earphones, speakers, hearing aids, and brain-computer interfaces, or other suitable devices. or a combination of related devices. The auditory-based perception system of the present invention can also feed back information to the user through non-auditory signals, for example, it can feed back information to the user through a blind tablet, an intelligent blind handle, an intelligent blind mouse, etc., or through a combination of auditory signals and non-auditory signals. Feedback information to users.
在本发明中,听觉感知信号是指通过声音的频率、节奏、旋律、间隔、方位、距离、大小、高低、长短和音色等特征中的至少一项来表示信息,使用户能够通过该信号来感知信息。例如:可以通过声音向用户传递空间方位信息。由于声源传输声波至某一具体方位的传输特性可以被表达为函数数据集合,这种表征声波传输特性的函数数据集合可被用于处理音频信号,使音频信号体现所述声源传输声波至该方位的传输特性。当这种经过处理的音频信号经由播放设备转化为声音时,该声音即表现出所述声源传输声波至该方位的传输特性,使用户能够感受到虚拟的声源空间方位。具体方位可以包括声音的方向、位置、高低等等。In the present invention, the auditory perception signal refers to representing information through at least one of the characteristics of sound frequency, rhythm, melody, interval, orientation, distance, size, height, length, and timbre, etc., so that the user can use the signal to perceptual information. For example, spatial orientation information can be conveyed to the user through sound. Since the transmission characteristics of sound waves transmitted by a sound source to a specific orientation can be expressed as a function data set, this function data set representing the sound wave transmission characteristics can be used to process audio signals, so that the audio signals reflect the sound waves transmitted by the sound source to The transmission characteristics of this azimuth. When the processed audio signal is converted into sound through the playback device, the sound shows the transmission characteristics of the sound source transmitting the sound wave to the azimuth, so that the user can feel the virtual sound source spatial azimuth. The specific orientation may include the direction, location, height, and the like of the sound.
接下来,需要获取用于结合指令进行分析处理的信息。对用户指令进行分析处理可能会用到一些信息,例如:用户指令是盲人行走至某处地点,那么,首先需要获取用户当前位置和目的地位置,还需要获取路途中的障碍物信息,有了充足的信息之后,才能针对用户指令 分析处理得出适当的反馈。本发明的基于听觉的感知系统,信息获取模块20可以包括图像传感器、雷达装置、无线射频识别装置、定位装置、音频获取装置、红外装置、紫外装置、激光扫描器、金属探测器、温感装置、光感装置、触感装置、气压传感器、水压传感器、嗅觉识别装置、磁场探测装置、风力探测装置、湿度探测装置、电力探测装置、速度探测装置、高度探测装置、化学分析装置、放射线探测装置中的至少一个,也可以是其他适合的装置用来获取相关信息。Next, it is necessary to obtain information for analyzing and processing in conjunction with the instruction. Some information may be used in the analysis and processing of user instructions. For example, if the user's instruction is for a blind person to walk to a certain place, then, first of all, it is necessary to obtain the user's current position and destination position, and also need to obtain the obstacle information on the road. With Only after sufficient information is available can appropriate feedback be obtained for the analysis and processing of user instructions. In the hearing-based perception system of the present invention, the information acquisition module 20 may include an image sensor, a radar device, a radio frequency identification device, a positioning device, an audio acquisition device, an infrared device, an ultraviolet device, a laser scanner, a metal detector, and a temperature sensing device. , light sensing device, touch sensing device, air pressure sensor, water pressure sensor, olfactory recognition device, magnetic field detection device, wind detection device, humidity detection device, power detection device, speed detection device, altitude detection device, chemical analysis device, radiation detection device At least one of them may also be used by other suitable devices to obtain relevant information.
之后,分析处理模块30再根据所述指令和所述信息进行分析计算,执行所述指令和/或得出反馈的信息。例如:针对盲人行走的指令和获取的用户当前位置、目的地位置、障碍物等信息,建立三维空间模型,空间模型还可以包括与时间维度相关的信息,在本发明中,模型可以是映射实体场景/对象的模型;也可以是映射虚拟场景/对象的模型,如操作系统、操作界面、游戏、虚拟系统等;也可以是实体与虚拟的结合。建立模型后,将用户指令的相关对象代入模型中,规划出适合盲人的出行路线,并在盲人出行的过程中,根据实际情况给出提示或指引用户按照规划的路线行走。After that, the analysis and processing module 30 performs analysis and calculation according to the instruction and the information, executes the instruction and/or obtains feedback information. For example: a three-dimensional space model is established for the blind walking instruction and the obtained information such as the user's current position, destination position, and obstacles. The space model may also include information related to the time dimension. In the present invention, the model may be a mapping entity A model of a scene/object; it can also be a model that maps a virtual scene/object, such as an operating system, an operation interface, a game, a virtual system, etc.; it can also be a combination of entity and virtual. After the model is established, the relevant objects instructed by the user are substituted into the model to plan a travel route suitable for the blind, and in the process of traveling for the blind, a prompt is given according to the actual situation or the user is guided to follow the planned route.
最后,如果有反馈的信息,将反馈的信息转化为听觉感知信号,再由用户交互模块10反馈给用户。将反馈的信息转换为听觉感知信号是由用户交互模块10或分析处理模块30完成,也可以由其他设备或装置完成。例如:将盲人行走的路线及根据实际情况给出的指引与提醒转化成听觉感知信号反馈给用户。可以在需要指引用户走到某个 位置时,发出声源在目标位置的听觉感知信号,使用户能够通过听觉感知信号感知到该位置信息,从而走到该位置。在用户实际行走过程中,随着用户与目标位置之间的距离变化调整听觉感知信号,使用户能走到该位置。可以是在用户每次移动位置时,重新根据用户当前的位置与目标位置的变化,发出声源在目标位置发出传输至用户当前位置的听觉感知信号,使用户在移动过程中不断感知目标位置,最终成功到达目标位置。如果路线中存在障碍物,可以用其他频率、节奏、旋律、间隔、方位、距离、大小、高低、长短和音色不同的声音信号发出表示障碍物位置、距离、高矮、危险程度等信息的听觉感知信号,使用户能够通过听觉感知信号感知到该障碍物的位置、距离、高矮、危险程度等信息,从而绕过该障碍物。Finally, if there is feedback information, the feedback information is converted into an auditory perception signal, which is then fed back to the user by the user interaction module 10 . Converting the feedback information into auditory perception signals is completed by the user interaction module 10 or the analysis processing module 30, and may also be completed by other devices or devices. For example, the route of the blind person walking and the guidance and reminders given according to the actual situation are converted into auditory perception signals and fed back to the user. When the user needs to be guided to go to a certain position, an auditory perception signal that the sound source is at the target position can be sent out, so that the user can perceive the location information through the auditory perception signal, so as to walk to the location. During the actual walking process of the user, the auditory perception signal is adjusted as the distance between the user and the target position changes, so that the user can walk to the position. It can be that every time the user moves the position, according to the change of the user's current position and the target position, the sound source sends out an auditory perception signal at the target position that is transmitted to the user's current position, so that the user can continuously perceive the target position during the movement process. Finally successfully reached the target position. If there are obstacles in the route, you can use other sound signals with different frequencies, rhythm, melody, interval, bearing, distance, size, height, length and timbre to send out the auditory perception of information such as obstacle location, distance, height, and danger. signal, so that the user can perceive the position, distance, height, danger and other information of the obstacle through the auditory perception signal, so as to bypass the obstacle.
用于表示目的地和障碍物、目标、对象、内容的不同频率、节奏、旋律、间隔、方位、距离、大小、高低、长短和音色的声音信号及其组合的含义可以是预先设定的,并且是用户经过训练可以分辨并理解其中含义的信号,也可以是现有的可以包含信息的声音信号,如现有语言的语音信号或其他规则的声音信号。例如:基于听觉的感知系统可以设置和/或训练不同频率、节奏、旋律、间隔、方位、距离、大小、高低、长短和音色的声音信号的定义,如对象的定义、目标的定义、方位的定义、距离的定义、颜色的定义、温度的定义、高低的定义、警告/危险信号的定义、操作信号的定义、操作结果的定义等。经过设置或训练后,用户听到设置为有代表意义的声音信号后,可以非常快的意识到该声音信号或声音信号的组合所表示的意义。本发明 的基于听觉的感知系统,也可以通过听觉感知信号与其他信号结合将信息反馈给用户,例如:可以通过听觉感知信号与盲文信号结合反馈信息。The meaning of sound signals and their combinations of different frequencies, rhythms, melody, intervals, orientations, distances, sizes, heights, lengths and timbres used to represent destinations and obstacles, targets, objects, content, and their combinations can be preset, And it is a signal that the user has been trained to distinguish and understand the meaning of, and it can also be an existing sound signal that can contain information, such as a speech signal of an existing language or other regular sound signals. For example: an auditory-based perception system can set and/or train the definitions of sound signals of different frequencies, rhythms, melody, intervals, orientations, distances, sizes, pitches, lengths, and timbres, such as object definition, target definition, orientation Definition, definition of distance, definition of color, definition of temperature, definition of high and low, definition of warning/danger signal, definition of operation signal, definition of operation result, etc. After setting or training, after hearing the sound signal set as a representative meaning, the user can quickly realize the meaning represented by the sound signal or a combination of sound signals. The auditory-based perception system of the present invention can also feed back information to the user by combining the auditory perception signal with other signals, for example, the auditory perception signal can be combined with the Braille signal to feed back information.
本发明的基于听觉的感知系统,可以用于行走、寻物、游戏、计算机、虚拟音乐会、虚拟现实技术等多种用途,盲人或正常人在光线不好、光线不佳、光线太强、弱视、近视、远视、老花、眼疲劳等情况下,或者是不方便仔细观看的时候如驾驶时,可以方便的利用基于听觉的感知系统的计算机进行绘画、演奏、作曲、书写、工作、学习等,提升听觉在这些领域的重要性,增强人们使用听觉感知信息的效果,使人类尤其是盲人可以更方便的生活。The auditory-based perception system of the present invention can be used for various purposes, such as walking, finding objects, games, computers, virtual concerts, virtual reality technology, etc. In the case of amblyopia, myopia, hyperopia, presbyopia, eye fatigue, etc., or when it is inconvenient to watch carefully, such as driving, you can easily use the computer based on the auditory perception system to paint, play, compose, write, work, and study. Etc., enhance the importance of hearing in these fields, enhance the effect of people's use of auditory perception information, so that humans, especially the blind, can live more conveniently.
下面以本发明的基于听觉的感知系统帮助盲人寻物为例具体说明。The following will take the hearing-based perception system of the present invention to help blind people find objects as an example for specific description.
第一步:用户发出寻物的指令。用户可以是通过语音来发出指令,系统通过佩戴在用户身上的麦克风获取了用户的语音指令并识别了该指令。用户也可以通过手势、动作或其他方式发出指令。此时,可以通过相关设备如摄像头等获取用户的指令。Step 1: The user issues an instruction to find the object. The user may issue an instruction through voice, and the system acquires the user's voice instruction through the microphone worn on the user and recognizes the instruction. Users can also issue commands through gestures, actions, or other means. At this time, the user's instruction can be obtained through a related device such as a camera.
第二步:系统得到了用户指令后,通过信息获取模块20获取相关信息。首先,先找到要寻找的物品,确定该物品的位置;其次,确定用户当前所处的位置上;然后获取周围的环境信息。Step 2: After the system obtains the user instruction, it obtains the relevant information through the information obtaining module 20 . First, find the item you are looking for, and determine the location of the item; second, determine the current location of the user; and then obtain the surrounding environment information.
可以是通过多个图像传感器从不同角度获取空间内的影像数据用于空间建模及位置的计算,图像传感器可以设置在房间内适合的位置,也可以随身佩戴在用户身上,例如:可以将图像传感器、用于获取指令的麦克风、用于反馈听觉感知信号给用户的耳机集成为一个头戴式可随身携带装备,佩戴在用户头部。The image data in the space can be obtained from different angles through multiple image sensors for spatial modeling and position calculation. The image sensor can be set in a suitable position in the room, or can be worn on the user's body, for example: the image can be The sensor, the microphone for obtaining instructions, and the earphone for feeding back the auditory perception signal to the user are integrated into a head-worn portable device, which is worn on the user's head.
也可以是通过定位装置来获取空间内位置信息,或者通过物联网获取各物品的位置、大小等信息。It is also possible to obtain position information in space through a positioning device, or obtain information such as the position and size of each item through the Internet of Things.
第三步:根据所述指令和所述信息进行分析计算,得出反馈的信息。首先,通过获取的信息建立空间模型,主要是通过多角度的空间影像数据,以及空间内的物品的位置信息和相关的大小尺寸等等。也可以是通过网络或其他方式获取已经建立的空间模型,或者是在已有的空间模型上进行修改,得到新的符合实际的空间模型。之后,再利用空间模型以及用户和物品的位置信息,规划出适合的取物路径。Step 3: Perform analysis and calculation according to the instruction and the information to obtain feedback information. First, a spatial model is established through the acquired information, mainly through multi-angle spatial image data, as well as the position information and related size of the objects in the space. It is also possible to obtain the established spatial model through the network or other means, or to modify the existing spatial model to obtain a new spatial model that conforms to the actual situation. After that, use the space model and the location information of users and items to plan a suitable fetching path.
还可以根据存放物品的空间/设备/家具/装置,规划拿到物品的方法,所述寻物取物还可以是分步实现。例如:1,走到储物房间门口;2,开门进入储物房间;3,走到储物柜前;4,打开第二格柜门;5,拿到要找的物品。It is also possible to plan a method for getting the item according to the space/equipment/furniture/device where the item is stored, and the finding and retrieving the item can also be implemented in steps. For example: 1, go to the door of the storage room; 2, open the door to enter the storage room; 3, go to the locker; 4, open the second cabinet door; 5, get the item you are looking for.
然后,再根据空间模型建立声学模型,得到声波传输的相关函数,用于计算听觉感知信号。建立声学模型可以采用波束跟踪算法来建立声学模型,计算相关波束与空间的相交性,得到声波传输的相关函数, 当经过这种相关函数处理的音频信号经由播放设备转化为声音时,该声音即表现出所述声源传输声波至该方位的传输特性,使用户能够感受到虚拟的声源空间方位。Then, the acoustic model is established according to the spatial model, and the correlation function of the acoustic wave transmission is obtained, which is used to calculate the auditory perception signal. To establish the acoustic model, the beam tracking algorithm can be used to establish the acoustic model, calculate the intersection of the relevant beam and the space, and obtain the correlation function of the sound wave transmission. When the audio signal processed by this correlation function is converted into sound through the playback device, the sound is The transmission characteristics of the sound wave transmitted by the sound source to the azimuth are displayed, so that the user can feel the virtual sound source spatial azimuth.
第四步,将反馈的信息转化为听觉感知信号反馈给用户。将物品所在位置虚拟一个声源发声,经过声波传输的相关函数处理后,得出听觉感知信号,经由耳机输出给用户。如果规划的取物路径不是直线路径,可以将该路径分成多个直线段,然后,在第一个直线段的终点虚拟一个声源,计算出听觉感知信号反馈给用户,当用户走到第一个直线段的终点后,进入第二个直线段,此时在第二个直线段的终点虚拟一个声源,计算出听觉感知信号反馈给用户引导用户走到该点,以此类推,直到到达寻找的物品的位置。还可以根据用户移动的路线和规划路线,方向、高低的偏差、以及时间限制/紧迫程度、设置相应听觉感知信号反馈给用户,以使用户能够被不断校正、提醒。In the fourth step, the feedback information is converted into auditory perception signals and fed back to the user. A virtual sound source is produced at the location of the item, and after processing the correlation function of sound wave transmission, the auditory perception signal is obtained, which is output to the user through the earphone. If the planned fetching path is not a straight path, the path can be divided into multiple straight segments, and then, a sound source is virtualized at the end point of the first straight segment, and the auditory perception signal is calculated and fed back to the user. After the end point of each straight line segment, enter the second straight line segment. At this time, a sound source is virtualized at the end point of the second straight line segment, and the auditory perception signal is calculated and fed back to the user to guide the user to the point, and so on until reaching the The location of the item being sought. According to the user's moving route and planned route, direction, height deviation, and time limit/urgency, the corresponding auditory perception signal can be set and fed back to the user, so that the user can be continuously corrected and reminded.
本发明的基于听觉的感知系统,还可以用于虚拟音乐会。首先,虚拟音乐会场所的场景,建立声学模型。再将各乐器、声部等分别虚拟成不同的声源,这些声源可以位于不同的位置。分别计算不同声源至用户位置的声波传输的相关函数。将各虚拟声源所产生的音乐,经过声波传输的相关函数处理后进行叠加,得出最终的音乐会的听觉感知信号输出给用户。The auditory-based perception system of the present invention can also be used for virtual concerts. First, the scene of the virtual concert venue, the acoustic model is established. Then, each instrument, part, etc. is virtualized into different sound sources, and these sound sources can be located in different positions. The correlation functions of the sound wave transmission from different sound sources to the user's location are calculated separately. The music generated by each virtual sound source is processed by the correlation function of sound wave transmission and then superimposed to obtain the final auditory perception signal of the concert and output to the user.
在本发明中,计算声波传输的相关函数可以是头部相关传输函数数据(Head Related Transfer Function,HRTF)集合,耳间时间差数 据(Interaural Time Difference,ITD)集合,耳间强度差数据(IID)集合等任何适当的能够表征声源发出的声波传输至某一方位的传输特性的数据集合。ITD是指由于声源离左、右耳的距离差异,使得声信号到达双耳时的时间差。IID是指由于声源离左、右耳的距离差异,使得声信号到达双耳时的强度差。ITD和IID均是声源位置和声波频率的函数。当声源定位数据集合为ITD和IID数据集合时,用户可以分辨声源位于其左侧还是右侧。HRTF是自由场情况下从声源到双耳的声学传输函数,其用来描述在自由声场中的声源发出的声波,以一定角度入射到耳道内某点时所发生的特征变化。HRTF是声源位置、声波频率以及人体表面形状和性质的函数。从声源到人体测量点的单位脉冲响应称为头部相关脉冲响应(Head Related Impulse Response,HRIR).HRTF是HRIR的傅立叶变换。当声源定位数据集合为HRTF数据集合时,用户可以分辨声源位于其前方、后方、上方、下方、左侧还是右側。In the present invention, the correlation function for calculating acoustic wave transmission may be a set of head related transfer function data (Head Related Transfer Function, HRTF), a set of interaural time difference data (Interaural Time Difference, ITD), and a set of interaural intensity difference data (IID) Any appropriate set of data that can characterize the transmission characteristics of sound waves emitted by a sound source to a certain azimuth. ITD refers to the time difference between the sound signal reaching both ears due to the distance difference between the sound source and the left and right ears. IID refers to the difference in intensity of the acoustic signal when it reaches both ears due to the difference in the distance between the sound source and the left and right ears. Both ITD and IID are functions of sound source location and sound wave frequency. When the sound source localization data set is the ITD and IID data sets, the user can distinguish whether the sound source is located on its left or right. HRTF is the acoustic transfer function from the sound source to both ears in the free field, which is used to describe the characteristic changes that occur when the sound wave emitted by the sound source in the free sound field is incident at a certain point in the ear canal at a certain angle. HRTF is a function of the location of the sound source, the frequency of the sound wave, and the shape and properties of the body surface. The unit impulse response from the sound source to the anthropometric point is called the Head Related Impulse Response (HRIR). HRTF is the Fourier transform of HRIR. When the sound source localization data set is the HRTF data set, the user can distinguish whether the sound source is located in front, behind, above, below, left or right.
如果用多个函数数据集合分别表征声源传输声波至多个具体方位的传输特性,并用该多个函数数据集合分别处理音频信号,就可以使音频信号分别表现出所述声源传输声波至多个具体方位的传输特性。借助这种方案可以构建虚拟的听觉环境,在此基础上,如果将用户的现实物理方位投射为虚拟听觉环境中的具体方位,在用户的不同现实物理方位和虚拟听觉环境中的不同具体方位之间建立对应关系,就可以使用户按照自身物理方位的不同,听到与自身物理方位相符合的声音效果。If multiple function data sets are used to represent the transmission characteristics of the sound waves transmitted by the sound source to multiple specific azimuths respectively, and the multiple function data sets are used to process the audio signals respectively, the audio signals can respectively represent the sound waves transmitted by the sound source to multiple specific azimuths. Azimuth transmission characteristics. With this solution, a virtual auditory environment can be constructed. On this basis, if the user's real physical orientation is projected as a specific orientation in the virtual auditory environment, the difference between the user's different real physical orientations and the different specific orientations in the virtual auditory environment will occur. By establishing a corresponding relationship between them, users can hear sound effects that are consistent with their own physical orientations according to their own physical orientations.
可以设定允许用户移动位置,当用户的位置发生变化或用户耳朵的方向发生变化时,重新计算不同声源的声波传输的相关函数,用户可以宛若身临其境,真正在现场欣赏音乐会。用户还可以发出指令,调整某些乐器、声部的发声位置,音量大小等,随心所欲的欣赏属于自己的音乐会。It can be set to allow the user to move the position. When the user's position changes or the direction of the user's ear changes, the correlation function of the sound wave transmission of different sound sources is recalculated. The user can really enjoy the concert on the spot as if he were there. Users can also issue instructions to adjust the sounding position, volume, etc. of certain instruments and parts, and enjoy their own concerts at will.
如上所述的基于听觉的感知系统,所述基于听觉的感知系统用于辅助行走、辅助运动、运动训练、导航、辅助驾驶、辅助停车、定位、位置引导、发现目标、反映画面、反映物体、探测、侦查、勘探、设计、维修、设备使用、装置使用、学习、教学、购物、办公、社交、游戏、娱乐、影视、计算机、健康测试、疾病诊断、手术治疗、虚拟音乐会、虚拟现实技术中的至少一项。例如:在行走过程中通过听觉信号的方位、远近、信号类别等,提醒用户路线、路况的变化和障碍物具体位置;在运动训练中,通过听觉信号提醒运动员动作的角度、距离是否符合训练要求或指引运动员动作;在辅助驾驶中,可以通过听觉信号提醒司机/飞行员路线或者航线以及相关物体的位置和远近;位置引导可以通过听觉信号使用户准确的掌握目标位置,如将钥匙插入钥匙孔时,听觉信号可以反映钥匙孔与钥匙的相对方位以及距离,即便在看不见/看不清的情况下也能快速对准;发现目标可以是通过雷达或者红外等设备发现目标物/人后,通过听觉信号将目标物/人的相对方位、距离反馈给用户;用于反映画面时,可以通过图像传感器获取需要反映的画面,然后图像识别软件将画面内容解析为点、线条、图形、颜色等图像的不同信息要素,系统将相关信息要素和其 位置信息转化为听觉信号,使用户能够根据听觉信号接收到画面的相关信息,或者系统可以根据用户指定位置,将该位置的画面信息通过听觉信息反馈给用户,随着用户指定位置在画面上不同位置的移动,系统可以通过听觉信息将画面信息传递给用户,这里的画面可以是实真实的画面,也可以是存储在系统中的虚拟画面;用于设计时,可以增加设计师在空间方面的信息反馈,对涉及方案有更立体直观的感受。实现上述应用可以是本发明的基于听觉的感知系统单独或与其他系统、设备相结合后共同完成。The auditory-based perception system as described above, which is used for assisting walking, assisting movement, sports training, navigation, assisting driving, assisting parking, positioning, location guidance, finding targets, reflecting pictures, reflecting objects, Detection, reconnaissance, exploration, design, maintenance, equipment use, device use, learning, teaching, shopping, office, social, games, entertainment, film and television, computer, health testing, disease diagnosis, surgical treatment, virtual concerts, virtual reality technology at least one of. For example: in the process of walking, remind the user of the route, the change of road conditions and the specific location of the obstacle through the position, distance, signal type, etc. of the auditory signal; in sports training, through the auditory signal to remind the athlete whether the angle and distance of the action meet the training requirements Or guide the athlete’s actions; in assisted driving, the driver/pilot’s route or route and the location and distance of related objects can be reminded through auditory signals; position guidance can enable users to accurately grasp the target position through auditory signals, such as when inserting a key into the keyhole , the auditory signal can reflect the relative orientation and distance between the keyhole and the key, and it can be quickly aligned even if it is invisible/invisible; the target can be found by radar or infrared equipment. The auditory signal feeds back the relative orientation and distance of the target/person to the user; when it is used to reflect the picture, the picture to be reflected can be obtained through the image sensor, and then the image recognition software parses the picture content into images such as points, lines, graphics, colors, etc. The system converts the relevant information elements and their position information into auditory signals, so that the user can receive the relevant information of the screen according to the auditory signal, or the system can feedback the screen information of the position through the auditory information according to the position specified by the user. For the user, with the movement of the user-specified position in different positions on the screen, the system can transmit the screen information to the user through auditory information. The screen here can be a real screen or a virtual screen stored in the system; When designing, it can increase the information feedback of the designer in terms of space, and have a more three-dimensional and intuitive feeling about the scheme involved. The above application can be achieved by the auditory-based perception system of the present invention alone or in combination with other systems and devices.
本发明的基于听觉的感知系统,如上所述的基于听觉的感知系统,所述基于听觉的感知系统作为基于听觉的操作系统可单独或与其他系统结合,用于操作计算机、人工智能、智能设备、虚拟现实设备或其他适合的设备。现有的计算机系统通常使用视频操作界面,对于盲人或视力不好的人,或者普通人在远距离使用时,不能很好的将计算机系统反馈的信息传递给用户。另外现有基于视觉的界面承载的信息量和信息形式的限制,很多时候不能充分、形象、准确的接受指令和反馈信息。采用本发明的基于听觉的感知系统,将反馈的信息转化为听觉感知信号反馈给用户,并且可以多维度接受用户指令,可以实现盲人或视力不好的人,或者普通人在远距离方便的获取计算机系统反馈的信息,也可以增加现有计算机交互的方式和增强交互的效果,降低相关人员使用计算机系统及操控设备的难度,提高计算机使用效果。The auditory-based perception system of the present invention, the above-mentioned auditory-based perception system, as an auditory-based operating system, the auditory-based perception system can be used alone or in combination with other systems for operating computers, artificial intelligence, and smart devices. , virtual reality device or other suitable device. Existing computer systems usually use a video operation interface. For blind people or people with poor eyesight, or when ordinary people use it at a distance, the information fed back by the computer system cannot be well transmitted to users. In addition, the existing visual-based interface is limited in the amount of information and the form of information, and many times, it cannot fully, vividly and accurately accept instructions and feedback information. By adopting the auditory-based perception system of the present invention, the feedback information is converted into auditory perception signals and fed back to the user, and the user's instructions can be received in multiple dimensions, so that the blind or poor-sighted people, or ordinary people can easily obtain the information from a long distance. The information fed back by the computer system can also increase the way of existing computer interaction and enhance the effect of interaction, reduce the difficulty of relevant personnel using the computer system and control equipment, and improve the effect of computer use.
基于听觉的感知系统的计算机系统可以和现有计算机系统一样,实现位置、路线、数量、大小、温度、时间、程度、形状、状态、对象的操作,还可以包括对象识别/区分/展开、对象虚拟位置的移动、对象修改/删除、生成、变更等等。还可以通过利用本发明的基于听觉的感知系统控制设备完成对目标实物的操作,例如:通过本发明的基于听觉的感知系统实现操作机械臂、机器人、智能家具、无人驾驶车辆、无人机、电纸书等等。结合了基于听觉的感知系统的计算机系统能够在现有计算机应用的基础上,增加信息的空间维度以及听觉可以承载的其他信息维度,大大提高计算机的应用效率和使用体验。The computer system based on the auditory perception system can realize the operation of position, route, quantity, size, temperature, time, degree, shape, state, and object, as well as the existing computer system, and can also include object recognition/discrimination/expansion, object Movement of virtual locations, object modification/deletion, generation, alteration, etc. It is also possible to use the auditory-based perception system of the present invention to control the device to complete the operation of the target object, for example: realize the operation of manipulators, robots, smart furniture, unmanned vehicles, drones through the auditory-based perception system of the present invention. , electronic paper books, etc. A computer system combined with an auditory-based perception system can increase the spatial dimension of information and other dimensions of information that can be carried by hearing on the basis of existing computer applications, greatly improving the application efficiency and use experience of computers.
本发明的基于听觉的感知系统,还可以包括数据传输模块,所述数据传输模块将所述用户交互模块接受的指令和所述信息获取模块获取的信息或者是经分析处理模块处理的指令和/或信息发送至服务器/网络/系统/智能设备,所述服务器/网络/系统/智能设备根据所述指令和所述信息进行分析计算,执行所述指令和/或将结果传输至所述数据传输模块。具体网络/系统/智能设备包括:互联网、物联网、卫星网络、局域网、智能办公系统、智能家居系统、智能手机、智能电视、智能汽车、智慧道路、智慧城市、无人机、智能机器人、智能厨房、智能服装等。通过数据传输模块将数据传输至服务器/网络/系统/智能设备进行分析计算,可以增加本发明的基于听觉的感知系统的数据处理能力,也可以扩大本发明的基于听觉的感知系统的应用范围,同时可以降低分析处理模块的计算量,降低对分析处理模块的硬件需求,降低本发明的基于听觉的感知系统的成本和重量。The auditory-based perception system of the present invention may further include a data transmission module, the data transmission module transmits the instructions received by the user interaction module and the information acquired by the information acquisition module or the instructions processed by the analysis processing module and/or Or the information is sent to the server/network/system/smart device, the server/network/system/smart device performs analysis and calculation according to the instruction and the information, executes the instruction and/or transmits the result to the data transmission module. Specific networks/systems/smart devices include: Internet, Internet of Things, satellite networks, local area networks, smart office systems, smart home systems, smart phones, smart TVs, smart cars, smart roads, smart cities, drones, smart robots, smart Kitchen, smart clothing, etc. Through the data transmission module, the data is transmitted to the server/network/system/smart device for analysis and calculation, which can increase the data processing capability of the auditory-based perception system of the present invention, and can also expand the application range of the auditory-based perception system of the present invention. At the same time, the calculation amount of the analysis processing module can be reduced, the hardware requirement for the analysis processing module can be reduced, and the cost and weight of the auditory-based perception system of the present invention can be reduced.
本发明的基于听觉的感知系统也可以通过数据传输模块或者信息获取模块从互联网、物联网,或其他信息系统、服务器、智能设备中获取信息,用于结合指令分析计算。具体信息来源可以包括:互联网、物联网、卫星网络、局域网、智能办公系统、智能家居系统、智能手机、智能音箱、智能汽车、智慧道路、智慧城市、无人机、智能机器人、智能厨房、智能服装、智能眼镜等。通过结合上述网络、系统、服务器、智能设备获取信息,可以获取更丰富更全面的信息,并且降低对信息获取模块20的硬件需求,降低本发明的基于听觉的感知系统的成本和重量。The auditory-based perception system of the present invention can also obtain information from the Internet, the Internet of Things, or other information systems, servers, and smart devices through a data transmission module or an information acquisition module, for analysis and calculation in combination with instructions. Specific sources of information can include: Internet, Internet of Things, satellite networks, local area networks, smart office systems, smart home systems, smart phones, smart speakers, smart cars, smart roads, smart cities, drones, smart robots, smart kitchens, smart Clothing, smart glasses, etc. By acquiring information in combination with the above network, system, server, and smart device, richer and more comprehensive information can be acquired, and the hardware requirements for the information acquisition module 20 can be reduced, thereby reducing the cost and weight of the auditory-based perception system of the present invention.
本发明的基于听觉的感知系统,建立模型的方式可以是由单个系统建立完整的模型,或者由多个系统、设备基于统一的信号/信息标准分别建立一部分模型,再由其中一个或多个系统或服务器进行整合成为一套完整的模型。建立模型所需信息的获取方式可以是通过智能电器、智能家具、智能房屋(家庭、病房、医院、学校、工厂)、智能道路、智能手机、智能音箱、智能汽车、智能城市系统、图像传感器、定位装置、音频获取装置来获取。In the auditory-based perception system of the present invention, the model can be established by establishing a complete model by a single system, or by establishing a part of the model by multiple systems and devices based on unified signal/information standards, and then by one or more of the systems. Or servers can be integrated into a complete set of models. The information needed to build the model can be obtained through smart appliances, smart furniture, smart houses (homes, wards, hospitals, schools, factories), smart roads, smart phones, smart speakers, smart cars, smart city systems, image sensors, A positioning device and an audio acquisition device are used to obtain it.
本发明第二实施例参阅图2。图2为本发明第二实施例一种基于听觉的感知系统的使用方法的方法流程图。如图所示,本发明的基于听觉的感知系统的使用方法包括:Refer to FIG. 2 for the second embodiment of the present invention. FIG. 2 is a method flowchart of a method for using an auditory-based perception system according to a second embodiment of the present invention. As shown in the figure, the use method of the auditory-based perception system of the present invention includes:
S1:接收用户指令;S1: Receive user instructions;
S2:获取用于结合指令进行分析处理的信息;S2: Obtain information for analyzing and processing in combination with the instruction;
S3:根据所述指令和所述信息进行分析计算,执行所述指令和/或得出反馈的信息;S3: perform analysis and calculation according to the instruction and the information, execute the instruction and/or obtain feedback information;
S4:如果有反馈的信息,将反馈的信息转化为听觉感知信号反馈给用户。S4: If there is feedback information, convert the feedback information into an auditory perception signal and feed it back to the user.
本发明的一种基于听觉的感知系统的使用方法与本发明的一种基于听觉的感知系统的技术特征一一对应,可以参照前述一种基于听觉的感知系统的说明,在此不再赘述。The method of using an auditory-based perception system of the present invention corresponds to the technical features of an auditory-based perception system of the present invention. Reference can be made to the foregoing description of the auditory-based perception system, which will not be repeated here.
综上所述,本发明的一种基于听觉的感知系统及其使用方法,所述系统包括:用户交互模块、信息获取模块和分析处理模块。所述方法包括:接收用户指令;获取用于结合指令进行分析处理的信息;根据所述指令和所述信息进行分析计算,执行所述指令和/或得出反馈的信息;如果有反馈的信息,将反馈的信息转化为听觉感知信号反馈给用户。通过本发明的一种基于听觉的感知系统及其使用方法,能够帮助人们更好的利用听觉来感知外界的信息,增强感知的效果,并可以帮助盲人或者帮助正常人在光线不足、光线不佳、光线太强、弱视、近视、远视、老花、眼疲劳等环境中,或者是不方便仔细观看的时候如驾驶时,提升行走、寻物、使用计算机以及智能设备/智能系统等的效率。To sum up, an auditory-based perception system and a method for using the same of the present invention include: a user interaction module, an information acquisition module, and an analysis and processing module. The method includes: receiving a user instruction; acquiring information for analyzing and processing the instruction; performing analysis and calculation according to the instruction and the information, executing the instruction and/or obtaining feedback information; if there is feedback information , which converts the feedback information into auditory perception signals and feeds them back to the user. The auditory-based perception system and its using method of the present invention can help people better use their hearing to perceive external information, enhance the effect of perception, and can help blind people or normal people when there is insufficient light or poor light. , too strong light, amblyopia, myopia, hyperopia, presbyopia, eye fatigue and other environments, or when it is inconvenient to watch carefully, such as when driving, to improve the efficiency of walking, finding objects, using computers, and smart devices/intelligent systems.
以上所述的具体实施方式,对本发明的目的、技术方案和有益效果进行了进一步详细说明,所应理解的是,以上所述仅为本发明的具体实施方式而已,并不用于限定本发明的保护范围,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The specific embodiments described above further describe the objectives, technical solutions and beneficial effects of the present invention in detail. It should be understood that the above descriptions are only specific embodiments of the present invention, and are not intended to limit the scope of the present invention. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention shall be included within the protection scope of the present invention.

Claims (13)

  1. 一种基于听觉的感知系统,其特征在于,所述系统包括:用户交互模块、信息获取模块和分析处理模块,An auditory-based perception system, characterized in that the system comprises: a user interaction module, an information acquisition module and an analysis and processing module,
    所述用户交互模块用于接收指令并将反馈的信息以听觉感知信号反馈给用户;The user interaction module is used for receiving an instruction and feeding back the feedback information to the user as an auditory perception signal;
    所述信息获取模块用于获取信息,所述信息用于供分析处理模块结合指令进行分析处理;The information acquisition module is used to acquire information, and the information is used for analysis and processing by the analysis processing module in combination with the instruction;
    所述分析处理模块用于根据所述指令和所述信息进行分析计算,执行所述指令和/或得出反馈的信息。The analysis and processing module is configured to perform analysis and calculation according to the instruction and the information, execute the instruction and/or obtain feedback information.
  2. 根据权利要求1所述的基于听觉的感知系统,其特征在于:将反馈的信息转换为听觉感知信号是由用户交互模块或分析处理模块完成。The auditory-based perception system according to claim 1, wherein the conversion of feedback information into auditory perception signals is completed by a user interaction module or an analysis and processing module.
  3. 根据权利要求1所述的基于听觉的感知系统,其特征在于:所述听觉感知信号是通过声音的频率、节奏、旋律、间隔、方位、距离、大小、高低、长短和音色中的至少一项来表示信息。The auditory-based perception system according to claim 1, wherein the auditory perception signal is at least one of frequency, rhythm, melody, interval, orientation, distance, size, height, length and timbre of the sound to represent information.
  4. 根据权利要求3所述的基于听觉的感知系统,其特征在于:所述听觉感知信号包括语音信号。The auditory-based perception system of claim 3, wherein the auditory perception signal comprises a speech signal.
  5. 根据权利要求1所述的基于听觉的感知系统,其特征在于:所述用户交互模块包括指令获取模块和听觉感知信号输出模块,所述听觉感知信号输出模块是耳机、扬声器、助听器、脑机接口中的至少一项。The auditory-based perception system according to claim 1, wherein the user interaction module comprises an instruction acquisition module and an auditory perception signal output module, and the auditory perception signal output module is an earphone, a speaker, a hearing aid, a brain-computer interface at least one of.
  6. 根据权利要求3所述的基于听觉的感知系统,其特征在于:所述指令获取模块包括语音识别装置、声音识别装置、手势识别装置、肢体动作识别装置、表情识别装置、身体信号识别装置、智能可穿戴设备、智能平板、手机、鼠标、键盘、智能手柄、智能手杖、智能指环、智能手环中的至少一项。The auditory-based perception system according to claim 3, wherein the instruction acquisition module comprises a voice recognition device, a voice recognition device, a gesture recognition device, a body motion recognition device, an expression recognition device, a body signal recognition device, an intelligent At least one of wearable devices, smart tablets, mobile phones, mice, keyboards, smart handles, smart canes, smart rings, and smart wristbands.
  7. 根据权利要求1所述的基于听觉的感知系统,其特征在于:所述信息获取模块包括图像传感器、雷达装置、无线射频识别装置、定位装置、音频获取装置、红外装置、紫外装置、激光扫描器、金属探测器、温感装置、光感装置、触感装置、气压传感器、水压传感器、嗅觉识别装置、磁场探测装置、风力探测装置、湿度探测装置、电力探测装置、速度探测装置、高度探测装置、化学分析装置、放射线探测装置中的至少一个。The auditory-based perception system according to claim 1, wherein the information acquisition module comprises an image sensor, a radar device, a radio frequency identification device, a positioning device, an audio acquisition device, an infrared device, an ultraviolet device, and a laser scanner , metal detector, temperature sensing device, light sensing device, touch sensing device, air pressure sensor, water pressure sensor, olfactory recognition device, magnetic field detection device, wind detection device, humidity detection device, power detection device, speed detection device, altitude detection device , at least one of a chemical analysis device and a radiation detection device.
  8. 根据权利要求1所述的基于听觉的感知系统,其特征在于:所述基于听觉的感知系统还能通过非听觉信号向用户反馈信息,或通过听觉信号及非听觉信号联合向用户反馈信息。The auditory-based perception system according to claim 1, wherein the auditory-based perception system can also feed back information to the user through non-auditory signals, or feedback information to the user through a combination of auditory signals and non-auditory signals.
  9. 根据权利要求1所述的基于听觉的感知系统,其特征在于:所述基于听觉的感知系统还包括数据传输模块,所述数据传输模块将所述用户交互模块接受的指令和所述信息获取模块获取的信息或者是经分析处理模块处理的指令和/或信息发送至网络/系统/服务器/智能设备,所述服务器/网络/系统/智能设备根据所述指令和所述信息进行分析计算,执行所述指令和/或将结果传输至所述数据传输模块。The auditory-based perception system according to claim 1, characterized in that: the auditory-based perception system further comprises a data transmission module, and the data transmission module transfers the instruction accepted by the user interaction module to the information acquisition module. The acquired information or the instructions and/or information processed by the analysis processing module are sent to the network/system/server/smart device, and the server/network/system/smart device performs analysis and calculation according to the instruction and the information, and executes the The instructions and/or results are transmitted to the data transmission module.
  10. 根据权利要求9所述的基于听觉的感知系统,其特征在于:所述基于听觉的感知系统还能通过所述数据传输模块或者信息获取模块从网络/系统/服务器/智能设备获取用于结合指令进行分析处理的信息。The auditory-based perception system according to claim 9, characterized in that: the auditory-based perception system can also obtain instructions for combining from a network/system/server/smart device through the data transmission module or the information acquisition module Information for analytical processing.
  11. 根据权利要求1-10中任一权利要求所述的基于听觉的感知系统,其特征在于:所述基于听觉的感知系统用于辅助行走、辅助运动、运动训练、导航、辅助驾驶、辅助停车、定位、位置引导、发现目标、反映画面、反映物体、探测、侦查、勘探、设计、维修、设备使用、装置使用、学习、教学、购物、办公、社交、游戏、娱乐、影视、计算机、健康测试、疾病诊断、手术治疗、虚拟音乐会、虚拟现实技术中的至少一项。The auditory-based perception system according to any one of claims 1-10, wherein the auditory-based perception system is used for assisting walking, assisting movement, sports training, navigation, assisting driving, assisting parking, Positioning, location guidance, finding targets, reflecting images, reflecting objects, detection, reconnaissance, exploration, design, maintenance, equipment use, device use, learning, teaching, shopping, office, social interaction, games, entertainment, film and television, computers, health testing , at least one of disease diagnosis, surgical treatment, virtual concert, and virtual reality technology.
  12. 根据权利要求1所述的基于听觉的感知系统,其特征在于:所述基于听觉的感知系统作为基于听觉的操作系统可单独或与其他系统结合,用于操作计算机、人工智能、智能设备、虚拟现实设备。The auditory-based perception system according to claim 1, wherein the auditory-based perception system can be used alone or in combination with other systems as an auditory-based operating system for operating computers, artificial intelligence, smart devices, virtual Realistic device.
  13. 一种如权利要求1-12中任一权利要求所述的基于听觉的感知系统的使用方法,其特征在于,所述方法包括:A method of using an auditory-based perception system according to any one of claims 1-12, wherein the method comprises:
    接收用户指令;receive user instructions;
    获取用于结合指令进行分析处理的信息;Obtain information for analysis and processing combined with instructions;
    根据所述指令和所述信息进行分析计算,执行所述指令和/或得出反馈的信息;Perform analysis and calculation according to the instruction and the information, execute the instruction and/or obtain feedback information;
    如果有反馈的信息,将反馈的信息转化为听觉感知信号反馈给用户。If there is feedback information, the feedback information is converted into auditory perception signals and fed back to the user.
PCT/CN2021/079689 2021-03-09 2021-03-09 Hearing-based perception system and method for using same WO2022188022A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2021/079689 WO2022188022A1 (en) 2021-03-09 2021-03-09 Hearing-based perception system and method for using same
CN202180000425.0A CN113196390B (en) 2021-03-09 2021-03-09 Auditory sense system and application method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/079689 WO2022188022A1 (en) 2021-03-09 2021-03-09 Hearing-based perception system and method for using same

Publications (1)

Publication Number Publication Date
WO2022188022A1 true WO2022188022A1 (en) 2022-09-15

Family

ID=76976987

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/079689 WO2022188022A1 (en) 2021-03-09 2021-03-09 Hearing-based perception system and method for using same

Country Status (2)

Country Link
CN (1) CN113196390B (en)
WO (1) WO2022188022A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113840588A (en) * 2021-08-15 2021-12-24 曹庆恒 Touch sensing system and use method thereof
CN113975585A (en) * 2021-09-10 2022-01-28 袁穗薇 Diversified training method for children

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN203825313U (en) * 2013-12-16 2014-09-10 智博锐视(北京)科技有限公司 Blind navigation glasses
CN104983511A (en) * 2015-05-18 2015-10-21 上海交通大学 Voice-helping intelligent glasses system aiming at totally-blind visual handicapped
CN106214436A (en) * 2016-07-22 2016-12-14 上海师范大学 A kind of intelligent blind guiding system based on mobile phone terminal and blind-guiding method thereof
US20170303052A1 (en) * 2016-04-18 2017-10-19 Olive Devices LLC Wearable auditory feedback device
EP3432606A1 (en) * 2018-03-09 2019-01-23 Oticon A/s Hearing aid system
CN109831631A (en) * 2019-01-04 2019-05-31 华南理工大学 A kind of view of view-based access control model attention characteristic-sense of hearing conversion blind-guiding method
CN110559127A (en) * 2019-08-27 2019-12-13 上海交通大学 intelligent blind assisting system and method based on auditory sense and tactile sense guide
CN111643324A (en) * 2020-07-13 2020-09-11 江苏中科智能制造研究院有限公司 Intelligent glasses for blind people

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN203825313U (en) * 2013-12-16 2014-09-10 智博锐视(北京)科技有限公司 Blind navigation glasses
CN104983511A (en) * 2015-05-18 2015-10-21 上海交通大学 Voice-helping intelligent glasses system aiming at totally-blind visual handicapped
US20170303052A1 (en) * 2016-04-18 2017-10-19 Olive Devices LLC Wearable auditory feedback device
CN106214436A (en) * 2016-07-22 2016-12-14 上海师范大学 A kind of intelligent blind guiding system based on mobile phone terminal and blind-guiding method thereof
EP3432606A1 (en) * 2018-03-09 2019-01-23 Oticon A/s Hearing aid system
CN109831631A (en) * 2019-01-04 2019-05-31 华南理工大学 A kind of view of view-based access control model attention characteristic-sense of hearing conversion blind-guiding method
CN110559127A (en) * 2019-08-27 2019-12-13 上海交通大学 intelligent blind assisting system and method based on auditory sense and tactile sense guide
CN111643324A (en) * 2020-07-13 2020-09-11 江苏中科智能制造研究院有限公司 Intelligent glasses for blind people

Also Published As

Publication number Publication date
CN113196390B (en) 2024-04-05
CN113196390A (en) 2021-07-30

Similar Documents

Publication Publication Date Title
AU2023200677B2 (en) System and method for augmented and virtual reality
Hu et al. An overview of assistive devices for blind and visually impaired people
Csapó et al. A survey of assistive technologies and applications for blind users on mobile platforms: a review and foundation for research
Geronazzo et al. Interactive spatial sonification for non-visual exploration of virtual maps
CN104011788A (en) System And Method For Augmented And Virtual Reality
WO2022188022A1 (en) Hearing-based perception system and method for using same
Schwarze et al. A camera-based mobility aid for visually impaired people
Hub et al. Interactive tracking of movable objects for the blind on the basis of environment models and perception-oriented object recognition methods
Giudice et al. Spatial learning and navigation using a virtual verbal display
WO2023019376A1 (en) Tactile sensing system and method for using same
May et al. Spotlights and soundscapes: On the design of mixed reality auditory environments for persons with visual impairment
Du et al. Human–robot collaborative control in a virtual-reality-based telepresence system
Wang et al. A survey of 17 indoor travel assistance systems for blind and visually impaired people
Mazuryk et al. History, applications, technology and future
D. Gomez et al. See ColOr: an extended sensory substitution device for the visually impaired
Mihelj et al. Introduction to virtual reality
Thalmann et al. Virtual reality software and technology
Röber et al. Interacting With Sound: An Interaction Paradigm for Virtual Auditory Worlds.
Sardana et al. Introducing locus: a nime for immersive exocentric aural environments
Olivetti Belardinelli et al. Sonification of spatial information: audio-tactile exploration strategies by normal and blind subjects
Zhang et al. A survey of immersive visualization: Focus on perception and interaction
Bellotto A multimodal smartphone interface for active perception by visually impaired
Jones et al. Use of Immersive Audio as an Assistive Technology for the Visually Impaired–A Systematic Review
Magnenat-Thalmann et al. Virtual reality software and technology
Luna Introduction to Virtual Reality

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21929504

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21929504

Country of ref document: EP

Kind code of ref document: A1