Pure Language Processing and Pc Imaginative and prescient was once two utterly totally different fields. Effectively, at the very least again after I began to be taught machine studying and deep studying, I really feel like there are a number of paths to comply with, and every of them, together with NLP and Pc Imaginative and prescient, directs me to a totally totally different world. Over time, we will now observe that AI turns into increasingly superior, with the intersection between a number of fields of research getting extra frequent, together with the 2 I simply talked about.
As we speak, many language fashions have functionality to generate photos based mostly on the given immediate. That’s one instance of the bridge between NLP and Pc Imaginative and prescient. However I suppose I’ll reserve it for my upcoming article because it is a little more complicated. As an alternative, on this article I’m going to debate the easier one: picture captioning. Because the identify suggests, that is basically a method the place a selected mannequin accepts a picture and returns a textual content that describes the enter picture.
One of many earliest papers on this matter is the one titled “Present and Inform: A Neural Picture Caption Generator” written by Vinyals et al. again in 2015 [1]. On this article, I’ll concentrate on implementing the deep studying mannequin proposed within the…