Inverse Reinforcement Learning
A variant of Imitation Learning, which also goes by the name of Inverse Optimal Control.
It can also be thought of as a form of Preference-based RL, in which the demonstrated trajectories are assumed to be preferred to unseen ones, defining implicit preferences.