Our MI-Motion Dataset comprises approximately 167k sequence frames involving 3~6 subjects. Each subject is represented by 20 body 3D keypoints, and performed interactions with other individuals with different interaction levels. To simulate various activity scenes, we divide the dataset into five subsets: park, street, indoor, special locations, and complex crowd.
The MI-Motion dataset can be downloaded from Google Drive and Baidu Disk. You can also download the pretrained models of all the baselines in Google Drive. More details could be found in the Project Page. About more data details and training instructions, please see README.
Please see our benchmark results on short-term and long-term prediction in the paper. We proivde our benchmark code and other baseline codes in supplementary material. For ultra-long-term prediction, we attach the visualization results below.
Park | Indoor | Street | Special Locations | Complex Crowd | |
---|---|---|---|---|---|
HRI | |||||
MRT | |||||
TBIFormer | |||||
SocialTGCN | |||||
GT |