Amazing work. How do you implement the multimodality in your decoder?
Amazing work. How do you implement the multimodality in your decoder?