cs336
Here are 20 public repositories matching this topic...
The NoteBook and Assignments implemention via Learning CS336 Spring 2026😛
-
Updated
May 16, 2026 - Jupyter Notebook
实现CS336的作业1,并从头开始构建一个transformer模型。Implement CS336's job 1 and build a transformer from scratch
-
Updated
Aug 1, 2025 - Python
CS336 Final Project - Vietnamese Legal Text Retrieval
-
Updated
Feb 4, 2022 - Python
Rutgers CS:336 | Principles of Info and Data Management Spring 2022
-
Updated
May 10, 2022 - Java
CS336 作业 5:基于 Qwen2.5 模型的 LLM 对齐与推理强化学习。完整实现了监督微调(SFT)与组相对策略优化(GRPO)算法,并在 GSM8K 数据集上完成零样本、在策与离策的训练与评估对比。
-
Updated
Apr 1, 2026 - Python
-
Updated
Jan 17, 2021 - Java
Project for Information Retrieval Courses
-
Updated
Jan 26, 2025 - Jupyter Notebook
Implementation of a Decoder-only Transformer language model from scratch for CS336, featuring a byte-level BPE tokenizer, RoPE, Multi-Head Self-Attention and SwiGLU FFN. Trained on TinyStories with 1.39 Val Loss.
-
Updated
Feb 25, 2026 - Python
Living comparison table of LLM architectural choices (norm, attention, MoE, positional encoding, and more) from the Original Transformer (2017) to frontier models (2026). Based on Harm de Vries's figure, Sebastian Raschka's Big LLM Architecture Comparison, and Tatsunori Hashimoto's Stanford CS 336 lecture.
-
Updated
Mar 12, 2026
Stanford University's CS336 module - Assignment 1. Core language-modeling systems including BPE tokenization, transformer layers, optimization, data loading, checkpointing, and training utilities..
-
Updated
May 26, 2026 - Jupyter Notebook
Naive-to-optimized BPE tokenizer — profiling-driven tweaks, parallel pre-tokenization, and smart merging for 7× faster training.
-
Updated
Jul 9, 2025 - Python
-
Updated
Jan 29, 2021
-
Updated
Apr 10, 2026 - Python
Improve this page
Add a description, image, and links to the cs336 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the cs336 topic, visit your repo's landing page and select "manage topics."