Dmytro Staroselskyi
Offer Dmytro work on your next project.
Rating
Language proficiency level
Skills and abilities
Programming
Services
Portfolio
-
Eval-Lab — Regression of prompts and models
AI & Machine LearningWeb dashboard for regression testing of prompts and models. Running a test set through two models/prompts — comparison based on 4 sub-scores.
Technically interesting aspects:
— LLM-as-judge through 5 providers (OpenRouter, Anthropic via tool-use, Gemini, Groq, mock)
… — 4 sub-scores for each case: correctness, relevance, completeness, prompt_quality
— Cap on final score for poor prompt — prevents strong model from masking poor prompt
— Per-provider throttle and retry with backoff + Retry-After
— Mock mode for running without API keys (CI-friendly, $0)
— Editing secrets in logs
Stack: FastAPI, async SQLAlchemy, Alembic, httpx, Pydantic, vanilla JS, Docker.
-
AI Research Agent on LangGraph
AI & Machine LearningAn agent on LangGraph that autonomously plans searches, simultaneously browsing the web and arXiv, synthesizes responses with sources.
Technically interesting aspects:
— Parallel fan-out/fan-in graph with a reducer on tool_errors (the graph does not crash on provider failure)
… — 4-provider search fallback: Tavily → SearchAPI → DuckDuckGo → Wikipedia
— Source-grounded synthesis: refuses to respond without sources (anti-hallucination)
— PostgreSQL persistence with SQLAlchemy + Alembic (4 migrations)
— Robust JSON parsing with code-fence stripping and regex fallback
Stack: LangGraph, FastAPI, PostgreSQL, SQLAlchemy, Alembic, OpenRouter, Tavily, BeautifulSoup, Docker.
-
AI Radar — RAG Research Assistant
Bot DevelopmentRAG-bot that collects AI news from 7 sources and answers questions with links to the sources.
What is interesting technically:
— Hybrid search: BM25 + vector search Qdrant, merged through Reciprocal Rank Fusion, plus cross-encoder reranker
… — RAGAS evaluation of answer quality (faithfulness, answer_relevancy)
— Auto-indexing via APScheduler every 6 hours
— Telegram bot as a thin httpx client to FastAPI
Stack: Python, FastAPI, LangChain, Qdrant, rank_bm25, sentence-transformers, aiogram, APScheduler, Docker, GitHub Actions.