Research Engineer / ML Engineer focused on LLM post-training workflows, model evaluation, and reliable production ML systems.
I build reproducible Python/PyTorch experiments, Hugging Face-based training and evaluation pipelines, retrieval systems, and production-oriented ML services. My portfolio is organized around the kind of work I want to do more of: post-training, evaluation, robust ML infrastructure, and practical systems that can be inspected, rerun, and monitored.
| Project | Why it is relevant |
|---|---|
| LLM Evaluation Harness | Lightweight evaluation harness for behavior regression checks, prompt/output datasets, rubric-style metrics, error taxonomy, and generated evaluation reports. |
| Russian LLM Pretraining and SFT | Experiment report and reproducible scaffold for tokenizer training, causal LM pretraining, LoRA SFT, model cards, and evaluation reporting. |
| Semantic Retrieval for arXiv Papers | Dense retrieval pipeline with BGE embeddings, FAISS indexing, MRR@5 evaluation, latency profiling, and error analysis. |
| Multi-Task Information Extraction on NEREL | PyTorch/Transformers multi-task model for NER plus document-level multi-label classification with threshold tuning and test analysis. |
| Text-to-Image Product Search with Fine-Tuned CLIP | CLIP fine-tuning and embedding-index workflow for text-to-image catalog retrieval. |
| ML Engineering Projects | Production ML portfolio covering Airflow, DVC, MLflow, FastAPI serving, monitoring, recommendation systems, and reproducible pipelines. |
- LLM post-training: SFT workflows, LoRA adapters, tokenizer/data preparation, experiment reporting.
- Model evaluation: task metrics, regression checks, error taxonomies, qualitative review loops.
- Reliable ML systems: reproducible training, artifact hygiene, CI, typed Python modules, testable pipelines.
- Production ML/MLOps: model serving, monitoring, orchestration, experiment tracking, and data validation.
Python, PyTorch, Hugging Face Transformers/Datasets, TRL, PEFT/LoRA, scikit-learn, pandas, NumPy, FAISS, MLflow, DVC, FastAPI, Docker, Airflow, SQL, GitHub Actions.
- Email: sorocawrk@outlook.com
- Kaggle: kaggle.com/trupologhelper
- CV: available on request