62 views
0 applications 0 responses
💼 Responsibilities:
— Fine-tune & optimize LLMs
— Full pipeline: dataset → training → deployment
— Model serving & inference optimization
— Monitoring (metrics, performance)
— Collaboration with the team & PM
🧩 Requirements:
— 3+ years of hands-on LLM/NLP experience
— End-to-end training pipeline experience
— Model optimization & quantization
— Grafana, Prometheus
— Python, PyTorch
— Git, RunPod, AWS (SageMaker + core services)
— English B2+ (communication with client)
— Ukrainian or Russian C1+ / Native
— Residence outside Russia & Belarus
⭐️ Nice to have:
Hugging Face (Transformers, Datasets), LoRA/QLoRA, Quantization (bitsandbytes/AutoGPTQ), CUDA 12.x, FastAPI, vLLM / Triton, Docker/K8s, vector DBs (FAISS / Milvus), RAG pipelines
🎁 We offer:
— Fully remote, full-time
— Salary: $3000–$4000 (depends on experience)
— 20 paid vacation days + 5 sick days + additional days off
📩 Send your resume to @idigorice — replies within 1–3 days.