Research

Learning to Control for Capturing the Space Target

Time : Sept. 2021 - Present
Tsinghua Navigation and Control Lab (as researcher, advisor: prof. Tao Zhang)
Team members: Shengjie Wang, Yuxue Cao, Xiang Zheng

Abstract

* Developed a hierachical decoupling optimization algorithm to realize 6D-pose multi-target trajectory planning for the free-floating space robot.
* In order to reduce the complexity of exploration, the whole system consists of two layers : the high-level policy completes the collision-free trajectory planning of the end-effector pose; the low-level policy divides the pose reaching problem into two sub-problems (position and orientation). 
* Thanks to introducing the expert policy and hindlight experice replay, the policies reach the optimal quickly optimized by reinforcement learning.
* Our approach generalizes to capture a target satellite under different poses, despite being trained without the gripper.

Time : Mar. 2018 - Present
Beijing Advanced Innovation Center for Intelligent Robots and Systems (BAICIRS) (as researcher, advisor: prof. Qing Shi)
Team members: Shengjie Wang, Junhui Gao, Yuxuan Wang, Fansheng Meng, Yi Hao

Abstract

* Developed a miniature rat-inspired robot integrated with $12$ actuated DOFs and a size of $202\times55\times75 mm^3$ . The morphology and weight of the robotic rat is the same as a mature rat. The mechanical structure of the robotic rat makes full use of the connecting rod mechanism to transfer the movement. 
* Designed an optimal Central Pattern Generator (CPG) network through Particle Swarm Optimization (PSO) algorithm which guaranteed adjustable frequency and amplitude to determine gait pattern and stride frequency. This method solves how to choose the optimal parameters of the CPG network based on Kimura model.
* Designed a hierarchical multi-modal locomotion controller for a rat-inspired quadruped robot based on optimization with bionic constraints. Consisting of three layers, our method can execute multiple modes including crouch-standing action, walking forward, turning around, crawling and recovery from a fall.
* Compared with the quadruped robots in similar size, the motion performance of our robot has some certain advantages. The maximum speed of the robotic rat reaches 15 cm/s, and the radius of turning is less than 6 cm. Except for them, it can carry loads that are 60% of its weight.

Multi-Target Trajectory Planning Strategy of a 6-DoF Free-Floating Space Robot

Time : Sept. 2019 - Mar. 2021
Tsinghua Navigation and Control Lab (as researcher, advisor: prof. Tao Zhang)
Team members: Shengjie Wang, Yuxue Cao, Xiang Zheng

Abstract

* Developed an identification algorithm of the target position and estimated the relative pose between the space robot and target satellite using visual simultaneous localization and mapping (SLAM).
* Designed a robust multi-target capture strategy of a 6-DoF free-floating space robot within a large initial space; Proposed the Action Ensembles based on Poisson distribution (AEP) method to improve the Proximal Policy Optimization (PPO) algorithm, boosting the terminal capture precision under a certain low threshold of error.
* Evaluated our approach on three tasks: grasping objects in the case of different masses of the base satellite, external disturbances at joints, and even single joint failure, without any further fine-tuning.

An End-To-End Trajectory Planning for a 6-DoF Free-floating Space Robot

Time : Sept. 2020 - Present
Tsinghua Navigation and Control Lab (as researcher, advisor: prof. Tao Zhang)
Team members: Shengjie Wang, Yuxue Cao, Xiang Zheng

Abstract

* Designed an end-to-end planning system of perception, planning and control, which is divided into three layers of convolution network and three layers of perceptron network in policy layer, and velocity tracking PD controller in driver layer.
* Implemented a deep reinforcement learning algorithm based on maximum entropy for the learning of parameters in the network, and had better performance comparing with other baseline algorithms.
* Evaluated the kinematic and dynamic characteristics of the trained strategy and carried out antidisturbance experiments to test its robustness.

Development of a Detection and Pose Estimation System for Free-floating Space Robots Based on Deep Learning

Time : Mar. 2019 – Present
Tsinghua Navigation and Control Lab (as researcher, advisor: prof. Tao Zhang)
Team members: Shaopeng Li, Shengjie Wang, Wenxuan Fan

Abstract

* Proposed a dataset construction system of spatial target perception tasks based on virtual environments, constructs dataset of spatial target feature detection and pose estimation suitable for this research task.
* Designed a method based on the Mask R-CNN to the feature detection of spatial non-cooperative target, which can complete the features’ pixel-level segmentation at the same time of detection.
* Designed a supervised learning method to estimate the pose of target satellite, and a semi-supervised learning method, which uses pose data for supervised learning and the reconstruction error between images to learn the relationship and internal relationship between images, so as to enhance the universality of the algorithm.

Real-Time Mask Face Detection On the CPU Platform

Time : Mar. 2020 – Jun. 2020
Course Project (Teacher: Prof. Changshui Zhang)
Team members: Shengjie Wang

Abstract

* Designed two fast mask face detection methods based on the MobileNetV2-SSD model and the YOLOv3-tiny model.
* Evaluated two methods on accuracy and detecting speed, and studied the effects of network super parameters, data enhancement, optimizer, number of anchor boxes, NMS type and picture size on the performance of the model.
* Considering the robustness of algorithm, we introduced some noises on images to enhance data and achieve the accuracy-robustness balance.

code

An Effective Rain Prediction Method based on Seq2Seq Model with Attention Mechanism

Time : Mar. 2020 – Jun. 2020
Course Project (Teacher: Prof. Changshui Zhang)
Team members: Shengjie Wang, Yuguang Dong

Abstract

* Designed a rain prediction method based on Seq2Seq model with attention mechanism 
* Designed other machine learning methods to predict the volume of rain, such as SVR, MLP, LSTM and XGBoost.
* Evaluated our method on the testing dataset and visualized the attention map to illustrate the different importance of various factors.

code

An Intelligent Player in 1v1 Holden Game based on Deep Q-learning

Time : Oct. 2019 – Mar. 2020
Personal Business
Team members: Shengjie Wang, Tianyu Wang

Abstract

* Developed a platform based on python for 1V1 holdem game.
* Designed an agent based on Double Deep Q-learning to compete another agent based on Monte Carlo method. 
* Testified our method on the environment where the opponent knows the fully observation, and results show the winning percentage of our method is approximately the opponent.

code

A Passively-Stable Four-legged Robot Capable of Leaping Obstacle

Time : Oct. 2017 – Oct. 2018
Beijing Advanced Innovation Center for Intelligent Robots and Systems (BAICIRS) (as researcher, advisor: prof. Qing Shi)
Outstanding course design thesis in Beijing Institute of Technology
Team members: Shengjie Wang, Haoxiang Qi, Cheng Chen

Abstract

* Designed a special foot structure, which is composed of three semicircular arcs of carbon fiber. 
* Designed an ejection mechanism based on spring energy storage, in which the energy of spring is stored by rack and pinion mechanism.
* Developed a control system using WiFi communication, integrated PID control based on position tracking, and realized the modes including walking, turning and leaping obstacle.

A Two-wheel Robot with Adjustable Wheelbase Used for Pipeline Detection

Time : Sept. 2016 – Sept. 2017
Beijing Advanced Innovation Center for Intelligent Robots and Systems (BAICIRS) (as researcher, advisor: prof. Qing Shi)
Outstanding course design thesis in Beijing Institute of Technology
Team members: Shengjie Wang, Zhihong Xia, Daotong Zhang

Abstract

* Designed a two-wheel robot, and implemented a method of changing the wheelbase based on rack and pinion mechanism. 
* Developed a control system based on WiFi communication, integrated the function of remote image transmission, and testified some experiments in narrow pipelines.

Shengjie Wang

Research

Learning to Control for Capturing the Space Target

Development of a Miniature Quadruped Robotic Rat Capable of Multi-Modal Motions

Multi-Target Trajectory Planning Strategy of a 6-DoF Free-Floating Space Robot

An End-To-End Trajectory Planning for a 6-DoF Free-floating Space Robot

Development of a Detection and Pose Estimation System for Free-floating Space Robots Based on Deep Learning

Real-Time Mask Face Detection On the CPU Platform

An Effective Rain Prediction Method based on Seq2Seq Model with Attention Mechanism

An Intelligent Player in 1v1 Holden Game based on Deep Q-learning

A Passively-Stable Four-legged Robot Capable of Leaping Obstacle

A Two-wheel Robot with Adjustable Wheelbase Used for Pipeline Detection