@KingsmanVince
@kbin.socialindeed it would be great if the authors did so. I personally found some non-official implementations:
IIRC DeTr generate a sequence to predict boxes of objects. I think this paradigm can be applied to such models. "Think before you locate" could be a new path to explore.