Zhiyuan Xiao

| CV | Email | Github |

I completed my Bachelor's in Mathematics and Applied Mathematics at South China University of Technology (SCUT).

I have been a Research Assistant at School of Aeronautics and Astronautics, Sun Yat-Sen University (SYSU), supervised by Assoc Prof. Qingrui Zhang.

I am currently finishing my master degree in National University of Singapore (NUS).


  Publications

SYNLOCO: Synthesizing Central Pattern Generator and Reinforcement Learning for Quadruped Locomotion
Xinyu Zhang*, Zhiyuan Xiao*, Qingrui Zhang, Wei Pan
Accepted by CDC 2024

webpage | pdf | abstract | bibtex | arXiv | video

The Central Pattern Generator (CPG) is adept at generating rhythmic gait patterns characterized by consistent timing and adequate foot clearance. Yet, its open-loop configuration often compromises the system's control performance in response to environmental variations. On the other hand, Reinforcement Learning (RL), celebrated for its model-free properties, has gained significant traction in robotics due to its inherent adaptability and robustness. However, initiating traditional RL approaches from the ground up presents computational challenges and a heightened risk of converging to suboptimal local minima. In this paper, we propose an innovative quadruped locomotion framework, SYNLOCO, by synthesizing CPG and RL that can ingeniously integrate the strengths of both methods, enabling the development of a locomotion controller that is both stable and natural. Furthermore, we introduce a set of performance-driven reward metrics that augment the learning of locomotion control. To optimize the learning trajectory of SYNLOCO, a two-phased training strategy is presented. Our empirical evaluation, conducted on a Unitree GO1 robot under varied conditions -- including distinct velocities, terrains, and payload capacities -- showcases SYNLOCO's ability to produce consistent and clear-footed gaits across diverse scenarios. The developed controller exhibits resilience against substantial parameter variations, underscoring its potential for robust real-world applications.

  @misc{zhang_synloco_2023,
        title = {{SYNLOCO}: Synthesizing Central Pattern Generator and 
                 Reinforcement Learning for Quadruped Locomotion},
        url = {http://arxiv.org/abs/2310.06606},
        shorttitle = {{SYNLOCO}},
        publisher = {{arXiv}},
        author = {Zhang, Xinyu and Xiao, Zhiyuan and
                  Zhang, Qingrui and Pan, Wei},
        urldate = {2023-10-25},
        date = {2023-10-10},
        eprinttype = {arxiv},
        eprint = {2310.06606 [cs]},
        keywords = {Computer Science - Robotics}
  }

PA-LOCO: Learning Perturbation-Adaptive Locomotion for Quadruped Robots
Zhiyuan Xiao, Xinyu Zhang, Xiang Zhou, Qingrui Zhang
Accepted by IROS 2024

webpage | pdf | abstract | bibtex | arXiv | video

Numerous locomotion controllers have been designed based on Reinforcement Learning (RL) to facilitate blind quadrupedal locomotion traversing challenging terrains. Nevertheless, locomotion control is still a challenging task for quadruped robots traversing diverse terrains amidst unforeseen disturbances. Recently, privileged learning has been employed to learn reliable and robust quadrupedal locomotion over various terrains based on a teacher-student architecture. However, its one-encoder structure is not adequate in addressing external force perturbations. The student policy would experience inevitable performance degradation due to the feature embedding discrepancy between the feature encoder of the teacher policy and the one of the student policy. Hence, this paper presents a privileged learning framework with multiple feature encoders and a residual policy network for robust and reliable quadruped locomotion subject to various external perturbations. The multi-encoder structure can decouple latent features from different privileged information, ultimately leading to enhanced performance of the learned policy in terms of robustness, stability, and reliability. The efficiency of the proposed feature encoding module is analyzed in depth using extensive simulation data. The introduction of the residual policy network helps mitigate the performance degradation experienced by the student policy that attempts to clone the behaviors of a teacher policy. The proposed framework is evaluated on a Unitree GO1 robot, showcasing its performance enhancement over the state-of-the-art privileged learning algorithm through extensive experiments conducted on diverse terrains. Ablation studies are conducted to illustrate the efficiency of the residual policy network.

  @INPROCEEDINGS{xiao_paloco_2024,
    author={Xiao, Zhiyuan and Zhang, Xinyu and Zhou, Xiang and Zhang, Qingrui},
    booktitle={2024 IEEE/RSJ International Conference on
               Intelligent Robots and Systems (IROS)}, 
    title={PA-LOCO: Learning Perturbation-Adaptive Locomotion for Quadruped Robots}, 
    year={2024},
    pages={9110-9115},
    doi={10.1109/IROS58592.2024.10801753}
  }

 Projects 

Learning Robust Perceptive Locomotion for Quadrupedal Robots
Zhiyuan Xiao
Reproduction
Last update: 2025-01-26

webpage | reference | abstract

Legged robots that can operate autonomously in remote and hazardous environments will greatly increase opportunities for exploration into underexplored areas. Exteroceptive perception is crucial for fast and energy-efficient locomotion: Perceiving the terrain before making contact with it enables planning and adaptation of the gait ahead of time to maintain speed and stability. However, using exteroceptive perception robustly for locomotion has remained a grand challenge in robotics. Snow, vegetation, and water visually appear as obstacles on which the robot cannot step or are missing altogether due to high reflectance. In addition, depth perception can degrade due to difficult lighting, dust, fog, reflective or transparent surfaces, sensor occlusion, and more. For this reason, the most robust and general solutions to legged locomotion to date rely solely on proprioception. This severely limits locomotion speed because the robot has to physically feel out the terrain before adapting its gait accordingly. Here, we present a robust and general solution to integrating exteroceptive and proprioceptive perception for legged locomotion. We leverage an attention-based recurrent encoder that integrates proprioceptive and exteroceptive input. The encoder is trained end to end and learns to seamlessly combine the different perception modalities without resorting to heuristics. The result is a legged locomotion controller with high robustness and speed. The controller was tested in a variety of challenging natural and urban environments over multiple seasons and completed an hour-long hike in the Alps in the time recommended for human hikers.


      

 Bibliography Library 
sym

Bibliography Library for Quadruped Locomotion
Zhiyuan Xiao, Xinyu Zhang
Last update: 2024-03-15

pdf | abstract

Quadruped bibliography library


  Nostalgia
sym

Trio
Jan. 2023. Shot in Nanao Island

origin | fold

Waste of time. Nothing folded here.

sym

Mask
Oct. 2022. Shot at Healthy Code Bridge

origin | fold

Photographor: Zhamao the Great.

sym

Design of Class Uniform
Oct. 2020.

origin | fold

Don't click the 'fold' button again.



Website template from here and here