Vision-language model for generating Arabic image captions using Bidirectional Transformers (BiT) and advanced feature fusion.
-
Updated
Jul 17, 2025 - Jupyter Notebook
Vision-language model for generating Arabic image captions using Bidirectional Transformers (BiT) and advanced feature fusion.
Add a description, image, and links to the bertforimagecaptioning topic page so that developers can more easily learn about it.
To associate your repository with the bertforimagecaptioning topic, visit your repo's landing page and select "manage topics."