Picture it in your mind: generating high level visual representations from textual descriptions