Object detection Using Detection Transformer (Detr) on custom dataset

English Channel Video titled: "Object detection Using Detection Transformer (Detr) on custom dataset" 📌 **What You’ll Learn:** ✅ Step-by-step implementation of **DETR on a custom dataset** ✅ How CNN features are fed into a **Transformer encoder-decoder architecture** ✅ Understanding **self-attention, cross-attention, and positional encoding** ✅ Using **bipartite graph matching** for ground truth assignment ✅ Practical example using a **bone fracture dataset** from Roboflow 🔗 **Dataset Used:** https://universe.roboflow.com/roboflow-100/bone-fracture-7fylg

Image Classification Using Swin Transformer

English Channel Video titled : "Image Classification Using Swin Transformer" 📌 **What You’ll Learn:** ✅ Step-by-step implementation of a **custom Swin Transformer model** ✅ How Swin Transformers combine **global attention with local feature extraction** ✅ Understanding **shifted windows** for efficient image processing ✅ How to handle **large-scale image datasets** ✅ Applications in **image classification, object detection, and segmentation**

YouTube (EN)📥 Download ZIP

Vision Transformer for Image Classification Using transfer learning

English Channel Video titled: "Vision Transformer for Image Classification Using transfer learning" 📌 **What You’ll Learn:** ✅ Step-by-step **implementation of ViT using transfer learning** ✅ How Vision Transformers apply **Transformer architecture from NLP to computer vision** ✅ Understanding **patch embeddings, positional encoding, and self-attention** ✅ How ViT compares with conventional CNNs on image classification tasks ✅ Tips for using **transfer learning** to improve model performance

YouTube (EN)📥 Download ZIP

Image Classification Using Vision Transformer

English Channel Video titled : "Image Classification Using Vision Transformer | ViTs" In this tutorial, we explore how **Vision Transformers**, introduced by the Google Brain team in 2020, can achieve **competitive performance on image classification tasks**, often rivaling traditional CNNs. 📌 **What You’ll Learn:** ✅ How Vision Transformers apply **Transformer architecture from NLP to computer vision** ✅ Step-by-step **implementation for image classification** ✅ Understanding **patch embeddings, positional encoding, and self-attention** ✅ How ViT compares with traditional CNNs on benchmark datasets

YouTube (EN)📥 Download ZIP

Image Classification Using Vision Transformer | ViTs on Google Colab

📌 **What You’ll Learn:** ✅ Setting up a **Google Colab environment** for ViT training ✅ Preparing and loading your **custom image dataset** ✅ Training a **Vision Transformer (ViT) model** step by step ✅ Evaluating model performance and metrics ✅ Tips for improving accuracy and efficiency

YouTube (EN)YouTube (HI)📥 Download ZIP

Transformers for Computer Vision