I lead end-to-end delivery of large-scale Generative AI solutions for financial institutions — from multi-agent architectures and patent-pending hallucination detection to production agentic systems serving millions.
Designing production multi-agent platforms, RAG architectures, and LLM-powered automation — including a patent-pending hallucination detection framework commercially deployed across banking clients.
Architecting scalable, secure cloud infrastructure across AWS, Azure, and GCP — from serverless microservices to GPU clusters for model training and inference at enterprise scale.
Automating RCSA workflows, fraud detection, credit risk modeling, and regulatory compliance — deploying AI-powered risk intelligence across major financial institutions in LATAM and North America.
Building high-performance data pipelines, model training infrastructure, and real-time analytics systems — from Snowflake ETL to LoRA fine-tuning and Monte Carlo scenario analysis.
AWS Strands + Bedrock agentic platform enabling automated SQL querying, model training, and real-time Monte Carlo & Bayesian scenario analysis for 30,000+ employees at a major regional bank.
Full-stack from on-premises infrastructure setup through agent development, testing, and production rollout — delivered end-to-end in 6 months.
AI-powered Risk & Control Self-Assessment solution deployed across 6 different regional banks, automating complex compliance workflows with multi-agent orchestration.
Real-time agentic global legal search engine for a global ride-sharing company, performing automated compliance analysis across international regulatory frameworks.
Led full lifecycle of a Knowledge Retrieval platform — process mapping, business case, requirements, cross-regional build, and CoE launch — leveraging Azure and advanced RAG for 10,000+ documents across EMEA, Americas, and India.
Trained a Small Language Model for a Central Asian government using LoRA fine-tuning, DPO, and knowledge distillation, deployed within an Agentic RAG architecture.
Locally-hosted, AI-powered tax prep assistant mirroring the TurboTax guided experience. Combines Mistral OCR, dual-LLM analysis (Claude + OpenAI with RAG), confidence scoring, IRS Form 1040 generation, and autonomous agentic orchestration.
View on GitHub →End-to-end pipeline for fine-tuning HunyuanVideo (13B params) with LoRA to generate personalized videos from text prompts. Uses a trigger-token approach to bind specific subjects during training, enabling placement into novel scenes during inference.
View on GitHub →
AI-powered legacy code transformation — converting enterprise COBOL to modern Python with full logic preservation and test coverage.
Watch on YouTube →
Multi-agent platform leveraging AWS Strands and Bedrock for automated SQL querying, model training, and real-time scenario analysis.
Watch on YouTube →
AI-powered Risk & Control Self-Assessment automation — streamlining compliance workflows with multi-agent orchestration.
Watch on YouTube →
Full walkthrough of the AI-powered tax preparation assistant — from document upload and OCR to dual-LLM analysis, confidence scoring, and IRS Form 1040 generation.
Watch on Vimeo →
End-to-end LoRA fine-tuning pipeline — from video normalization and Gemini-powered captioning through multi-GPU training to personalized video generation from text prompts.
Watch on YouTube →The gap between a working prototype and a production agentic system is not incremental — it is architectural. Exploring seven critical failure modes and the infrastructure to survive them.
Read Deep Dive → LinkedInBefore I ever wrote a line of code for McKinsey, I was running 18-wheelers across 48 states. I founded Fast River Logistics at 22 and spent seven years learning that the hardest engineering problems aren't technical — they're about people, systems, and relentless execution under pressure.
That operator's mindset followed me through a Computer Engineering Master's at Duke, DARPA-funded research in adversarial AI at the Applied Machine Learning Lab, and into McKinsey — where I now lead the end-to-end delivery of enterprise Generative AI solutions for financial institutions worldwide. I've shipped agentic systems serving tens of thousands of users, hold a patent pending on LLM hallucination detection, and was promoted three times faster than the standard timeline.
Fluent in English and Spanish, conversational in Russian, and learning Arabic — I bring a global perspective and a builder's intensity to every system I architect.
Download RésuméSpecialist → Engagement Manager in under 1 year at McKinsey. Standard timeline is 3+ years.
Novel hallucination detection methodology for LLMs and agentic systems, commercially deployed across banking clients.
MS Computer Engineering (3.8 GPA). Developed adversarial detection models for the Department of Defense.
Built Fast River Logistics from zero to 48-state operations with 6 years of consistent profit growth.
English & Spanish (native/bilingual), Russian (elementary) — effective across global teams.
Graduate TA at Duke Fuqua (MBA) and Pratt (Engineering). Student mentor for Duke Athletics and Pratt's DEI Committee Subcommittee Chairman.
NCSA Research Fellow (UIUC), XSEDE EMPOWER Apprentice, PEARC'19 conference publication, nominated for Best Oral Presentation at ISRS'19.
Whether you're exploring AI transformation, scaling agentic systems, or modernizing financial infrastructure — I'd love to hear about your challenge.