US12372981B2 - Method and system for rhythmic motion control of robot based on neural oscillator - Google Patents
Method and system for rhythmic motion control of robot based on neural oscillatorInfo
- Publication number
- US12372981B2 US12372981B2 US18/266,643 US202218266643A US12372981B2 US 12372981 B2 US12372981 B2 US 12372981B2 US 202218266643 A US202218266643 A US 202218266643A US 12372981 B2 US12372981 B2 US 12372981B2
- Authority
- US
- United States
- Prior art keywords
- phase
- robot
- joint position
- neural oscillator
- network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
- G05D1/08—Control of attitude, i.e. control of roll, pitch, or yaw
- G05D1/0891—Control of attitude, i.e. control of roll, pitch, or yaw specially adapted for land vehicles
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B62—LAND VEHICLES FOR TRAVELLING OTHERWISE THAN ON RAILS
- B62D—MOTOR VEHICLES; TRAILERS
- B62D57/00—Vehicles characterised by having other propulsion or other ground- engaging means than wheels or endless track, alone or in addition to wheels or endless track
- B62D57/02—Vehicles characterised by having other propulsion or other ground- engaging means than wheels or endless track, alone or in addition to wheels or endless track with ground-engaging propulsion means, e.g. walking members
- B62D57/032—Vehicles characterised by having other propulsion or other ground- engaging means than wheels or endless track, alone or in addition to wheels or endless track with ground-engaging propulsion means, e.g. walking members with alternately or sequentially lifted supporting base and legs; with alternately or sequentially lifted feet or skid
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
- G05D1/40—Control within particular dimensions
- G05D1/49—Control of attitude, i.e. control of roll, pitch or yaw
- G05D1/495—Control of attitude, i.e. control of roll, pitch or yaw to ensure stability
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D1/00—Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
- G05D1/60—Intended control result
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D2101/00—Details of software or hardware architectures used for the control of position
- G05D2101/10—Details of software or hardware architectures used for the control of position using artificial intelligence [AI] techniques
- G05D2101/15—Details of software or hardware architectures used for the control of position using artificial intelligence [AI] techniques using machine learning, e.g. neural networks
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05D—SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
- G05D2109/00—Types of controlled vehicles
- G05D2109/10—Land vehicles
- G05D2109/12—Land vehicles with legs
Definitions
- the present disclosure provides a method and a system for rhythmic motion control of a robot based on a neural oscillator.
- a control structure designed by the present disclosure which is composed of a neural oscillator and a pattern formation network, can ensure formation of an expected rhythmic motion behavior; and meanwhile, a designed action space of joint position increment can effectively accelerate the training process of rhythmic motion reinforcement learning.
- the present disclosure provides a method for rhythmic motion control of a robot based on a neural oscillator, including:
- rhythmic motion widely exists in human and animal behaviors, such as in walking, running and steering. It is very important to change a motion pattern flexibly for animals to pass smoothly in a harsh environment. Therefore, it is an important subject in biology and robotics to study mechanism of different biological driving rhythmic motions. It has been found in physiological researches that a central pattern generator, that is, a neural circuit of an organism in spinal cord, plays a key role in generation of rhythmic motion, which can produce appropriate rhythmic information to modulate output of motoneurons. Command information from a motion area of midbrain and sensory afferent information from proprioceptors and exteroceptors can change rhythm patterns to adapt to different motion scenes. Inspired by this, some researchers provide rhythmic information by designing a simple phase oscillator so as to obtain rhythmic motion behavior instructions.
- a bio-neural oscillator namely a rhythm generator (RG)
- RG rhythm generator
- this embodiment uses an RG network to adjust phase transition time of a robot sole trajectory between a swinging stage and a standing stage, and a pattern formation (PF) network to output 12 motor control commands of the robot.
- PF pattern formation
- the RG network determines durations of flexor and extensor phases, while the PF network is responsible for generating information that periodically activates flexor and extensor motoneurons.
- this embodiment instantiates a proposed bionic rhythmic motion control structure by encouraging the robot to keep its sole contact with ground when in a swinging phase and when in a supporting phase upon feet are lifted. Existence of periodic rhythm signals of legs ensures formation of animal-like rhythmic motion behaviors of the legged robot.
- a training process can focus on training the legged robot to complete a main motion task, such as forward motion, left-right motion and steering motion.
- phase estimation of the legs provided by the RG network can also improve accurate estimation of an ontology speed and state by a robot platform when a strategy is deployed on a real robot.
- state estimation technology of the quadruped robot requires contact phase information of the legs of the robot in contact with the ground to fuse measurement information from an inertial measurement unit (IMU) and joint state information so as to complete estimation of a state of a whole body, or use force sensors to realize detection of retrogression contact information.
- IMU inertial measurement unit
- addition of the sensors may increase overall cost and power consumption of the robot and reduce robustness of the system.
- a RL strategy in this embodiment outputs the joint position increment, which is added with a target joint position command at a previous moment to obtain a motor control command at a current moment.
- Design of this new action space can accelerate training speed of rhythmic motion, because an action range that can be explored with the RL strategy is limited near a joint position at the current moment.
- a maximum motor speed all of target joint position commands that can cause joints of the robot to exceed the maximum motor speed are not conducive to a training process, and design of this action space avoids exploration and selection of some invalid motor commands, thus greatly accelerating the training process.
- This embodiment provides a method for rhythmic motion control of a robot based on a neural oscillator, aiming at naturally stimulating rhythmic motion behaviors of a quadruped robot, which is inspired by biological motion mechanism, and accelerating a RL learning process.
- a proposed learning framework can be divided into two parts: a bionic control structure composed of RG and PF networks, and design of a new action space (joint position increment), as shown in FIG. 2 , which is a diagram of rhythmic motion mechanism of a spinal animal.
- a motion problem of the quadruped robot is regarded as a partially observable Markov decision processes (POMDP) S, A, R, P, ⁇ , where S and A represent a state and the action space, respectively; R(S t ,S t+1 ) ⁇ represents the reward function; P(s t+1
- the quadruped robot takes an action ⁇ in a current state s, gets a scalar reward r, and then moves to a next state s t+1 , which is determined by a probability distribution of state transition P(s t+1
- An overall goal of training of the quadruped robot is to find an optimal strategy ⁇ ⁇ * to maximize a discount reward in the future, ⁇ * is:
- an input state S t ⁇ 60 includes a three-dimensional control command ⁇ (including a forward velocity ⁇ circumflex over (v) ⁇ x , a lateral velocity ⁇ circumflex over (v) ⁇ y and a steering angular rate ⁇ circumflex over ( ⁇ ) ⁇ z ), a three-dimensional linear velocity of a base v, a three-dimensional angular rate of the base ⁇ , a three-dimensional rotation direction ⁇ g (expressed as a direction of a gravity vector in an IMU coordinate system), a 12-dimensional joint position q, a 12-dimensional joint velocity ⁇ dot over (q) ⁇ , a 12-dimensional joint position error q e (the joint position q minus the target joint position ⁇ circumflex over (q) ⁇ ), a 8-dimensional RG phase generated by the RG network and a 4-dimensional RG frequency f (where the RG phase ⁇ is represented by sine and cosine functions).
- a three-dimensional control command ⁇ including a forward
- the RL strategy in this embodiment outputs the joint position increment ⁇ q t , and the target joint position q t , at the current moment is defined as:
Landscapes
- Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Aviation & Aerospace Engineering (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Physics & Mathematics (AREA)
- Automation & Control Theory (AREA)
- Combustion & Propulsion (AREA)
- Transportation (AREA)
- Mechanical Engineering (AREA)
- Chemical & Material Sciences (AREA)
- Manipulator (AREA)
- Feedback Control In General (AREA)
Abstract
Description
-
- acquiring a current state of the robot, and a phase and a frequency generated by the neural oscillator; and
- obtaining a control instruction according to the acquired current state, phase and frequency and a preset reinforcement learning network so as to control the robot.
-
- where ϕt represents the phase at the current moment; ϕt−1 represents the frequency at the previous moment; f represents the frequency; and represents the time step.
-
- a data acquisition module configured to acquire a current state of the robot, and a phase and a frequency generated by the neural oscillator; and
- a control module configured to obtain a control instruction according to the acquired current state, phase and frequency and a preset reinforcement learning network so as to control the robot.
-
- where {circumflex over (q)}t−1 is a target joint position at a previous moment.
-
- where ϕ∈[0,2π) represents a current leg is in a supporting phase when ϕ∈[0,π), and in a swinging phase when ϕ∈[π,2π); ϕt−1 represents a frequency at the previous moment; f represents the frequency; T represents the time step; and % represents a remainder operation.
2. Lateral Velocity:
3. Angular Rate:
4. Balance:
5. Body Twist:
6. Sole Sideslip:
7. Sole Support:
8. Sole Empty:
9. z Axis Velocity of Sole:
10. Joint Restrictions:
11. Joint Torque:
12. Joint Velocity:
13. Smooth Strategy Output:
14. RG Frequency:
15. RG Phase:
| TABLE 1 |
| Upper and lower limits of randomized |
| physical parameters and sensor noise |
| Parameter name | Lower limit | Upper limit | |||
| Mass | 80% | 120% | |
| Inertial | 50% | 150% |
| Delay | 16 | ms | 28 | ms |
| Power ratio of motor | 80% | 120% | |
| Terrain friction | 0.2 | 1 |
| Linear velocity noise | −0.08 | m/s | 0.08 | m/s | ||
| Angular rate noise | −0.3 | rad/s | 0.3 | rad/s | ||
| Rotation angle noise | −0.08 | rad | 0.08 | rad | ||
| Joint position noise | −0.01 | rad | 0.01 | rad | ||
| Joint velocity noise | −1.2 | rad/s | 1.2 | rad/s | ||
| TABLE 2 |
| PPO hyperparameter settings |
| Hyperparameter | Value | ||
| Batch size | 32768(1024 × 32) | ||
| Mini-batch size | 8192(1024 × 8) | ||
| Clip range | 0.2 | ||
| Entropy efficient | 0.0025 | ||
| Learning rate | 3.5e−4 | ||
| Discount factor | 0.992 | ||
| GAE discount factor | 0.95 | ||
| Desired KL-divergence | 0.015 | ||
-
- a data acquisition module configured to acquire a current state of the robot, and a phase and a frequency generated by the neural oscillator; and
- a control module configured to obtain a control instruction according to the acquired current state, phase and frequency and a preset reinforcement learning network so as to control the robot.
Claims (8)
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210334049.5 | 2022-03-31 | ||
| CN202210334049.5A CN114740875B (en) | 2022-03-31 | 2022-03-31 | Robot rhythmic motion control method and system based on neural oscillator |
| CN2022103340495 | 2022-03-31 | ||
| PCT/CN2022/125984 WO2023184933A1 (en) | 2022-03-31 | 2022-10-18 | Neural oscillator-based method and system for controlling rhythmic motion of robot |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20250021107A1 US20250021107A1 (en) | 2025-01-16 |
| US12372981B2 true US12372981B2 (en) | 2025-07-29 |
Family
ID=82279146
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US18/266,643 Active US12372981B2 (en) | 2022-03-31 | 2022-10-18 | Method and system for rhythmic motion control of robot based on neural oscillator |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US12372981B2 (en) |
| CN (1) | CN114740875B (en) |
| WO (1) | WO2023184933A1 (en) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114740875B (en) | 2022-03-31 | 2024-08-02 | 山东大学 | Robot rhythmic motion control method and system based on neural oscillator |
| CN114859737B (en) * | 2022-07-08 | 2022-09-27 | 中国科学院自动化研究所 | Quadruped robot gait transition method, device, equipment and medium |
| CN116788384A (en) * | 2023-05-08 | 2023-09-22 | 广西电网有限责任公司北海供电局 | Multimode operation foot type inspection robot and motion control method |
| CN117565037B (en) * | 2023-11-22 | 2025-09-23 | 西北工业大学 | Real-time motion control method for crawler robots based on phase domain synthesis |
Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2006289602A (en) | 2005-03-17 | 2006-10-26 | Sony Corp | Robot apparatus and control method thereof |
| KR101100657B1 (en) * | 2003-08-25 | 2012-01-03 | 가부시키가이샤 고쿠사이 덴키 츠신 기소 기주츠 겐큐쇼 | Robot device and attitude control method of robot |
| US20130138020A1 (en) | 2011-11-30 | 2013-05-30 | Honda Motor Co., Ltd | Walking assist device |
| CN108772836A (en) | 2018-05-08 | 2018-11-09 | 东南大学 | Backbone type multi-foot robot based on CPG and its bionic movement control method |
| CN109866904A (en) | 2019-04-09 | 2019-06-11 | 哈尔滨工程大学 | A kind of movement of bionical jellyfish class underwater robot and method for control speed |
| CN110286592A (en) | 2019-06-28 | 2019-09-27 | 山东建筑大学 | A method and system for multimodal motion of robotic fish based on BP neural network |
| CN113616193A (en) | 2021-07-07 | 2021-11-09 | 燕山大学 | Rhythm coordination characteristic characterization method based on self-adaptive CPG parameter identification |
| CN114740875A (en) | 2022-03-31 | 2022-07-12 | 山东大学 | Robot rhythmic motion control method and system based on neural oscillator |
| CN114047697B (en) * | 2021-11-05 | 2023-08-25 | 河南科技大学 | Four-foot robot balance inverted pendulum control method based on deep reinforcement learning |
| CN116795123A (en) * | 2022-03-12 | 2023-09-22 | 兰州大学 | A method for quadruped robots to adapt to complex terrain |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12172309B2 (en) * | 2018-04-22 | 2024-12-24 | Google Llc | Systems and methods for learning agile locomotion for multiped robots |
| CN111208822A (en) * | 2020-02-17 | 2020-05-29 | 清华大学深圳国际研究生院 | Quadruped robot gait control method based on reinforcement learning and CPG controller |
| CN113093779B (en) * | 2021-03-25 | 2022-06-07 | 山东大学 | Robot motion control method and system based on deep reinforcement learning |
-
2022
- 2022-03-31 CN CN202210334049.5A patent/CN114740875B/en active Active
- 2022-10-18 US US18/266,643 patent/US12372981B2/en active Active
- 2022-10-18 WO PCT/CN2022/125984 patent/WO2023184933A1/en not_active Ceased
Patent Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR101100657B1 (en) * | 2003-08-25 | 2012-01-03 | 가부시키가이샤 고쿠사이 덴키 츠신 기소 기주츠 겐큐쇼 | Robot device and attitude control method of robot |
| JP2006289602A (en) | 2005-03-17 | 2006-10-26 | Sony Corp | Robot apparatus and control method thereof |
| US20130138020A1 (en) | 2011-11-30 | 2013-05-30 | Honda Motor Co., Ltd | Walking assist device |
| CN108772836A (en) | 2018-05-08 | 2018-11-09 | 东南大学 | Backbone type multi-foot robot based on CPG and its bionic movement control method |
| CN109866904A (en) | 2019-04-09 | 2019-06-11 | 哈尔滨工程大学 | A kind of movement of bionical jellyfish class underwater robot and method for control speed |
| CN110286592A (en) | 2019-06-28 | 2019-09-27 | 山东建筑大学 | A method and system for multimodal motion of robotic fish based on BP neural network |
| CN113616193A (en) | 2021-07-07 | 2021-11-09 | 燕山大学 | Rhythm coordination characteristic characterization method based on self-adaptive CPG parameter identification |
| CN114047697B (en) * | 2021-11-05 | 2023-08-25 | 河南科技大学 | Four-foot robot balance inverted pendulum control method based on deep reinforcement learning |
| CN116795123A (en) * | 2022-03-12 | 2023-09-22 | 兰州大学 | A method for quadruped robots to adapt to complex terrain |
| CN114740875A (en) | 2022-03-31 | 2022-07-12 | 山东大学 | Robot rhythmic motion control method and system based on neural oscillator |
Non-Patent Citations (5)
| Title |
|---|
| Dec. 12, 2022 International Search Report issued in International Patent Application No. PCT/CN2022/125984. |
| Dec. 12, 2022 Written Opinion issued in International Patent Application No. PCT/CN2022/125984. |
| Merged Original and Translation of Publication CN 114047697 B (Year: 2021). * |
| Merged Original and Translation of Publication CN 116795123 A (Year: 2022). * |
| Merged Original and Translation of Publication KR 101100657 B1 (Year: 2004). * |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2023184933A1 (en) | 2023-10-05 |
| CN114740875A (en) | 2022-07-12 |
| US20250021107A1 (en) | 2025-01-16 |
| CN114740875B (en) | 2024-08-02 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12372981B2 (en) | Method and system for rhythmic motion control of robot based on neural oscillator | |
| US7657345B2 (en) | Robot and attitude control method of robot | |
| Abreu et al. | Learning low level skills from scratch for humanoid robot soccer using deep reinforcement learning | |
| US20070220637A1 (en) | Robot apparatus and method of controlling the same | |
| CN113093779B (en) | Robot motion control method and system based on deep reinforcement learning | |
| Sheng et al. | Bio-inspired rhythmic locomotion for quadruped robots | |
| Li et al. | Ai-cpg: Adaptive imitated central pattern generators for bipedal locomotion learned through reinforced reflex neural networks | |
| CN108469732A (en) | Robot working space's adaptability traveling control system based on DMP and method | |
| Schumacher et al. | Natural and robust walking using reinforcement learning without demonstrations in high-dimensional musculoskeletal models | |
| Liu et al. | Rhythmic-reflex hybrid adaptive walking control of biped robot | |
| Chen et al. | Deep reinforcement learning based co-optimization of morphology and gait for small-scale legged robot | |
| Jiang et al. | Stable skill improvement of quadruped robot based on privileged information and curriculum guidance | |
| Bi et al. | Efficient Dynamic Locomotion of Quadruped Robot via Adaptive Diagonal Gait: J. Bi et al. | |
| Liu et al. | Workspace trajectory generation method for humanoid adaptive walking with dynamic motion primitives | |
| Hitomi et al. | Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot | |
| Bachar | Developing controllers for biped humanoid locomotion | |
| Geng et al. | Planar biped walking with an equilibrium point controller and state machines | |
| CN117311271A (en) | Deep reinforcement learning robot motion control method and system based on prior knowledge | |
| Li et al. | Terrain-guided symmetric locomotion generation for quadrupedal robots via reinforcement learning | |
| Huang et al. | Smooth stride length change of rat robot with a compliant actuated spine based on cpg controller | |
| Teixeira et al. | Biped locomotion-improvement and adaptation | |
| Berseth | Scalable deep reinforcement learning for physics-based motion control | |
| CN119002232B (en) | Humanoid robot gait control method based on double-stage training strategy | |
| Haserbek et al. | Model-free end-to-end learning of agile quadrupedal locomotion over challenging terrain | |
| Liu et al. | A Synthesized Neural Control System for Bioinspired Robots to Achieve Diverse Locomotion |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: SHANDONG UNIVERSITY, CHINA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ZHANG, WEI;CHEN, YANYUN;SHENG, JIAPENG;AND OTHERS;SIGNING DATES FROM 20230601 TO 20230605;REEL/FRAME:063919/0765 |
|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO SMALL (ORIGINAL EVENT CODE: SMAL); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |