Skip to content

Commit 960111b

Browse files
committed
feat: add Vision Transformer (ViT) implementation for image classification
- Implement complete ViT architecture with patch embedding - Add positional encoding with learnable CLS token - Include scaled dot-product attention mechanism - Implement transformer encoder blocks with layer normalization - Add feed-forward network with GELU activation - Include comprehensive docstrings and type hints - Add doctests for all functions - Provide example usage demonstrating the complete pipeline Fixes #13326
1 parent 96aa436 commit 960111b

1 file changed

Lines changed: 425 additions & 0 deletions

File tree

0 commit comments

Comments
 (0)