Alexey Zaytsev
AI/ML Engineer & Systems Architect
Full-stack ML engineer specializing in generative AI, distributed inference, and model fine-tuning. Combines first-principles deep learning with low-level systems and hardware engineering.
//Relevant Skills
//Education
Université Denis Diderot
LETI University
//Languages
//Featured Projects
Tensors for human consumption. Widely adopted by AI researchers.
From-scratch autograd engine, enough to train a tiny GPT-2 style model.
Open-source multi-provider chat interface, similar to OpenWebUI and t3chat.
AI-native documentation dehydrator for agentic access. CLI, web UI, and MCP server.
LLM-driven SVG/ASCII art generation gallery with iterative visual refinement.
Custom ComfyUI nodes for fine-grained diffusion latent manipulation.
GPU power and fan control utility built on NVIDIA NVML.
//Experience (ML Systems)
AI/ML Engineer (Consulting)
July 2024 - PresentSelect client engagements: • AI Storyboarding Platform (Python backend): Integrated new image generation providers and refactored the generation abstraction layer across use cases (assets, scenes, inpainting). • Earnings Call Analysis & CEO Training: Engineered a low-latency transcription + LLM pipeline (AWS Transcribe / Deepgram) with speaker diarization and financial metrics extraction. Built a companion CEO training module that simulates analyst Q&A via TTS, transcribes spoken responses, and scores both content quality and delivery (WPM, filler words). • Open-source AI projects: congusto-chat, lovely-docs, pelican, latent-tools, nvml-tool.
Generative AI Lead
Nov 2023 - July 2024Led development of the generative AI platform supporting image, video, and 3D model generation. Served tens of thousands of users, generating hundreds of thousands of images: • Architecture & Inference: Built an end-to-end generation pipeline. The frontend talked to the backend that talked to Supabase/PostgreSQL, which queued tasks via RabbitMQ. Distributed GPU workers picked up the jobs, generated the media, uploaded to S3 (served via CloudFront+imgproxy), and pinged a backend webhook, which finally streamed updates to the client via Server-Sent Events. • Model Fine-tuning: Extensive work with Stable Diffusion inference, custom model fine-tuning, and LoRAs to optimize generation quality and capabilities. • Custom NSFW Detector: Trained and deployed an EfficientNet classifier to replace off-the-shelf filters with high false-positive rates, reaching production-grade precision directly in the distributed GPU worker pipeline. • MLOps & DevOps: Orchestrated and scaled the distributed inference infrastructure.
AI/ML Engineer (Consulting)
Nov 2022 - Nov 2023Independent consulting focused on AI applications and prototypes. Select engagements: • Dog Finder: An experimental missing-dog app exploring both facial recognition and noseprint uniqueness. Trained EfficientNet with triplet loss. • SAT Practice Generator: A dynamically updating test prep system seeded by real data. Built with Streamlit, Airtable, and OpenAI LLMs-migrating from GPT-3.5 to GPT-4 largely solved our initial correctness and consistency struggles. • GPTEditor: An AI-assisted copywriting tool-like Cursor, but for prose. Users highlight a paragraph and instruct the LLM on how to rewrite it. Built with SvelteKit, Tiptap, and Supabase.
Electronics Engineering & Hardware Leadership
Sept 2015 - May 2020Led hardware product development from prototype to manufacturing across power electronics and embedded systems, while managing teams and production workflows in Shenzhen.
Research Engineering (Biology & Instrumentation)
Jan 2014 - Sept 2015Built experimental tools and conducted wet-lab research in microfluidics and molecular biology, contributing to published scientific work.
Low-level Systems Software Engineer
Jan 2005 - July 2012Developed kernel-level software (drivers, bootloaders, BSPs) for telecom and storage systems across ARM, MIPS, and x86 platforms.