Aerospace Engineer and Machine Learning Researcher, PhD student at KTH Royal Institute of Technology
Highlights
- Pro
Pinned Loading
-
NousResearch/hermes-agent
NousResearch/hermes-agent PublicThe agent that grows with you
-
openai/baselines
openai/baselines PublicOpenAI Baselines: high-quality implementations of reinforcement learning algorithms
-
lasgroup/SDPO
lasgroup/SDPO PublicReinforcement Learning via Self-Distillation (SDPO)
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




