Aleksandr Sorokin TrupologDS

Aleksandr Sorokin

Research Engineer / ML Engineer focused on LLM post-training workflows, model evaluation, and reliable production ML systems.

I build reproducible Python/PyTorch experiments, Hugging Face-based training and evaluation pipelines, retrieval systems, and production-oriented ML services. My portfolio is organized around the kind of work I want to do more of: post-training, evaluation, robust ML infrastructure, and practical systems that can be inspected, rerun, and monitored.

Featured Work

Project	Why it is relevant
LLM Evaluation Harness	Lightweight evaluation harness for behavior regression checks, prompt/output datasets, rubric-style metrics, error taxonomy, and generated evaluation reports.
Russian LLM Pretraining and SFT	Experiment report and reproducible scaffold for tokenizer training, causal LM pretraining, LoRA SFT, model cards, and evaluation reporting.
Semantic Retrieval for arXiv Papers	Dense retrieval pipeline with BGE embeddings, FAISS indexing, MRR@5 evaluation, latency profiling, and error analysis.
Multi-Task Information Extraction on NEREL	PyTorch/Transformers multi-task model for NER plus document-level multi-label classification with threshold tuning and test analysis.
Text-to-Image Product Search with Fine-Tuned CLIP	CLIP fine-tuning and embedding-index workflow for text-to-image catalog retrieval.
ML Engineering Projects	Production ML portfolio covering Airflow, DVC, MLflow, FastAPI serving, monitoring, recommendation systems, and reproducible pipelines.

Focus Areas

LLM post-training: SFT workflows, LoRA adapters, tokenizer/data preparation, experiment reporting.
Model evaluation: task metrics, regression checks, error taxonomies, qualitative review loops.
Reliable ML systems: reproducible training, artifact hygiene, CI, typed Python modules, testable pipelines.
Production ML/MLOps: model serving, monitoring, orchestration, experiment tracking, and data validation.

Core Stack

Python, PyTorch, Hugging Face Transformers/Datasets, TRL, PEFT/LoRA, scikit-learn, pandas, NumPy, FAISS, MLflow, DVC, FastAPI, Docker, Airflow, SQL, GitHub Actions.

Contact

Email: sorocawrk@outlook.com
Kaggle: kaggle.com/trupologhelper
CV: available on request

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aleksandr Sorokin TrupologDS

Block or report TrupologDS

Aleksandr Sorokin

Featured Work

Focus Areas

Core Stack

Contact

Pinned Loading

Uh oh!