Try it yourself! This is a ~980K parameter image captioning transformer running entirely in your browser using pure JavaScript.
The model was trained on Flickr8k dataset and learned to generate captions for images. It works best with:
Read the blog post to learn how this model was built.
Drag & drop an image here, or click to select
Or try a sample image: