Commit 960111b

committed

feat: add Vision Transformer (ViT) implementation for image classification

- Implement complete ViT architecture with patch embedding - Add positional encoding with learnable CLS token - Include scaled dot-product attention mechanism - Implement transformer encoder blocks with layer normalization - Add feed-forward network with GELU activation - Include comprehensive docstrings and type hints - Add doctests for all functions - Provide example usage demonstrating the complete pipeline Fixes #13326

1 parent 96aa436 commit 960111bCopy full SHA for 960111b

1 file changed

computer_vision
- vision_transformer.py

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit 960111b

Uh oh!

File tree

0 commit comments