Image Captioning Demo

Try it yourself! This is a ~980K parameter image captioning transformer running entirely in your browser using pure JavaScript.

The model was trained on Flickr8k dataset and learned to generate captions for images. It works best with:

Read the blog post to learn how this model was built.

Loading model...

Drag & drop an image here, or click to select

Or try a sample image:

Dog running
Preview
Generating caption...

Generated Caption:

Model: ~980K parameters | Image: 128x128 | Character-level tokenization | Pure JavaScript
← Back to blog post