Francisco Reveriano — Expert Generative AI Engagement Manager @ McKinsey

// What I Do

Deep expertise across
the AI & cloud stack

🧠

GenAI & Agentic Systems

Designing production multi-agent platforms, RAG architectures, and LLM-powered automation — including a patent-pending hallucination detection framework commercially deployed across banking clients.

AWS Strands LangGraph Bedrock Agents Claude API OpenAI Agents RAG

Deep Dive

Patent-pending hallucination detection for LLM and agentic responses — deployed commercially across multiple banking clients
Trained a Small Language Model using LoRA fine-tuning, DPO, and knowledge distillation for a Central Asian government tax platform
Built a GraphRAG Compliance Engine performing real-time agentic global legal search for a global ride-sharing company
Multi-agent orchestration with function calling, vector databases, and prompt engineering at enterprise scale

Production Agent Systems

Patent Pending

☁️

Cloud & Infrastructure

Architecting scalable, secure cloud infrastructure across AWS, Azure, and GCP — from serverless microservices to GPU clusters for model training and inference at enterprise scale.

AWS Azure GCP Lambda Labs Docker Kubernetes Terraform

Deep Dive

Designed best-in-class AWS architecture for agent orchestration using Strands + Bedrock
Deployed SLM on Lambda Lab + Azure hybrid GPU server stack for government production workloads
Built full-stack on-premises infrastructure for real-time agentic call center operations
MCP / A2A protocol integration, OpenTelemetry observability, and Terraform-based IaC

Cloud Platforms

E2E

Infra to Production

🏦

Banking & Risk Automation

Automating RCSA workflows, fraud detection, credit risk modeling, and regulatory compliance — deploying AI-powered risk intelligence across major financial institutions in LATAM and North America.

RCSA Fraud Detection Credit Risk CCAR Compliance

Deep Dive

RCSA agentic solution deployed across 6+ regional banks with multi-agent compliance orchestration
AI-driven reconciliation algorithms for syndicated lending at a top-3 Japanese bank — 92% backlog reduction
Validated fraud, optimization, and credit risk models for multiple LATAM and North American banks
Led C&I CCAR modeling and automated validation suites at U.S. Bancorp

92%

Backlog Reduction

16+

Banks Engaged

⚡

Data Engineering & ML Ops

Building high-performance data pipelines, model training infrastructure, and real-time analytics systems — from Snowflake ETL to LoRA fine-tuning and Monte Carlo scenario analysis.

Python PyTorch Snowflake CUDA CI/CD FastAPI

Deep Dive

Automated SQL querying + model training with real-time Monte Carlo & Bayesian scenario analysis for 30k+ employees
Developed McKinsey's Model Bias Testing Framework — identified critical biases in 3 production healthcare models
Built object detection neural networks and adversarial detection models for DARPA / Department of Defense
Researched neural network video compression on IBM HAL cluster at NCSA

30k+

Employees Analyzed

DARPA

Research Partner

// Selected Work

Impact-driven
projects at scale

Multi-Agent Workforce Planning Platform

AWS Strands + Bedrock agentic platform enabling automated SQL querying, model training, and real-time Monte Carlo & Bayesian scenario analysis for 30,000+ employees at a major regional bank.

30k+

Employees Covered

Technical Stack

AWS Strands agent orchestration
Amazon Bedrock (Anthropic Claude)
Automated SQL generation & execution
Real-time Monte Carlo simulation
Bayesian scenario modeling

Impact & Outcomes

Covers 30,000+ employees in real-time
Automated model training pipeline
Best-in-class AWS agent architecture
Executive-ready reporting dashboards

Real-Time Agentic Call Center Solution

Full-stack from on-premises infrastructure setup through agent development, testing, and production rollout — delivered end-to-end in 6 months.

57%

Efficiency Gain

What Was Built

On-premises infrastructure design & setup
Real-time agentic response system
End-to-end testing & QA framework
Production deployment & monitoring

Results

57% efficiency improvement in response handling
6-month concept-to-production timeline
Full-stack ownership: infra → agents → deploy

RCSA Agentic Automation

AI-powered Risk & Control Self-Assessment solution deployed across 6 different regional banks, automating complex compliance workflows with multi-agent orchestration.

RCSA Deployments

Architecture

Multi-agent compliance orchestration
Automated risk assessment workflows
Cross-bank deployment framework
Regulatory document analysis

Scale

6 regional banks in production
Complex compliance workflow automation
Standardized RCSA across institutions

GraphRAG Compliance Engine

Real-time agentic global legal search engine for a global ride-sharing company, performing automated compliance analysis across international regulatory frameworks.

Global

Legal Coverage

How It Works

Knowledge graph of global regulations
Agentic retrieval-augmented generation
Real-time legal search across jurisdictions
Automated compliance gap analysis

Capabilities

Multi-jurisdictional regulatory coverage
Natural language legal querying
Continuous regulatory update ingestion

GenAI Knowledge Retrieval — Commercial Bank

Led full lifecycle of a Knowledge Retrieval platform — process mapping, business case, requirements, cross-regional build, and CoE launch — leveraging Azure and advanced RAG for 10,000+ documents across EMEA, Americas, and India.

85%

Adoption Rate

Leadership Scope

10+ cross-functional technical teams
McKinsey, client, and vendor coordination
Global delivery: Europe, India, U.S.

Outcomes

85% user adoption rate
Enterprise-wide knowledge retrieval
Commercial Investment Bank deployment

SLM Fine-Tuning — Government Tax Platform

Trained a Small Language Model for a Central Asian government using LoRA fine-tuning, DPO, and knowledge distillation, deployed within an Agentic RAG architecture.

SLM

Custom Model

ML Techniques

LoRA fine-tuning for domain adaptation
Direct Preference Optimization (DPO)
Knowledge distillation from larger models
Agentic RAG deployment architecture

Infrastructure

Lambda Labs GPU training cluster
Azure production serving stack
Government-grade security compliance

Taxy.AI — Agentic Tax Preparation Assistant

Locally-hosted, AI-powered tax prep assistant mirroring the TurboTax guided experience. Combines Mistral OCR, dual-LLM analysis (Claude + OpenAI with RAG), confidence scoring, IRS Form 1040 generation, and autonomous agentic orchestration.

View on GitHub →

9-Step

Wizard UI

Technical Stack

Autonomous n0 agent loop with TodoWrite planning
Dual-LLM analysis (Anthropic Claude + OpenAI Assistants/RAG)
Mistral OCR 3 for document extraction
React + Vite frontend, FastAPI backend
OpenTelemetry tracing & JSONL audit trail

Impact & Outcomes

IRS Form 1040 AcroForm PDF generation (23 fields)
GREEN/AMBER/RED/YELLOW confidence scoring engine
97 automated tests with digital twin framework
Real-time SSE streaming with human-in-the-loop

LoRA MultiModal Fine-Tuning

End-to-end pipeline for fine-tuning HunyuanVideo (13B params) with LoRA to generate personalized videos from text prompts. Uses a trigger-token approach to bind specific subjects during training, enabling placement into novel scenes during inference.

View on GitHub →

13B

Parameter Model

Key Technical Features

HunyuanVideo 13B base model with ~100MB LoRA adapter
Trigger-token subject binding for personalized generation
Gemini 2.5 Flash automated captioning pipeline
6x data augmentation (temporal crops + horizontal flips)
FP8 quantization & bfloat16 mixed-precision training
FFmpeg/OpenCV normalization (768×512, 24fps)

// Career

From founder to
enterprise AI leader

2024 – Present

Expert Engagement Manager — AI

McKinsey & Company, Austin, TX

Leading end-to-end delivery of large-scale GenAI solutions for financial institutions — Knowledge Retrieval platforms, Data Quality GenAI systems, Banking Control AI, and Knowledge Graph RAG frameworks. Patent pending on hallucination detection. Promoted 3× faster than standard timeline.

2023 – 2024

Specialist — Data Science & Analytics

McKinsey & Company, Boston, MA

Patented hallucination detection framework. Designed AI reconciliation algorithms reducing backlog by 92% at a top-3 Japanese bank. Built McKinsey's Model Bias Testing Framework.

2021 – 2023

Senior Analyst

McKinsey & Company

Leading expert in Banking Fraud/AML analytics and transformations. Developed best-in-class Fairness and Bias modeling standards. Expertise in Loan Operations and Model Risk Management across international clients.

2020 – 2021

Lead Quantitative Risk Model Developer

U.S. Bank, Minneapolis, MN

Advanced from intern to Lead in 6 months. Led C&I CCAR modeling, developed first full Python model development pipeline, and converted CCAR/CECL SAS code to Python. Built Wholesale hazard and failure time models.

2020

AI/FinTech Machine Learning Engineer

Neocova, St. Louis, MO

Managed four teams (24 interns) in an Agile environment. Developed ML models for Community Bank valuation and deployed FinTech valuation tools on Azure/R/Python in record six weeks.

2020

ML & AI Summer Consultant

Retinal Care Inc., Durham, NC

Developed and deployed a Deep Learning Rank Model for Diabetic Retinopathy identification, outperforming current industry models. Led a cross-discipline team of four through the full development cycle.

2019 – 2021

Graduate Research Assistant

Duke Applied Machine Learning Lab

Worked on classified Department of Defense Machine Learning projects. Implemented YOLOv3 and CenterNet models in PyTorch. Multiple publications and conference presentations.

2019

Research Fellow

National Center for Supercomputing Applications (NCSA), UIUC

Researched statistical learning for graphene nanomanufacturing and designed a deep learning framework for near-duplicate image detection that outperformed all prior work. Nominated for Best Oral Presentation at ISRS'19.

2013 – 2020

Founder & CEO

Fast River Logistics Inc., Houston, TX

Founded and scaled an interstate freight trucking company from 0 to 14 vehicles and 18 employees. Expanded to all 48 states and Mexico with 6 consecutive years of profit growth.

// About

The person behind
the architecture

Before I ever wrote a line of code for McKinsey, I was running 18-wheelers across 48 states. I founded Fast River Logistics at 22 and spent seven years learning that the hardest engineering problems aren't technical — they're about people, systems, and relentless execution under pressure.

That operator's mindset followed me through a Computer Engineering Master's at Duke, DARPA-funded research in adversarial AI at the Applied Machine Learning Lab, and into McKinsey — where I now lead the end-to-end delivery of enterprise Generative AI solutions for financial institutions worldwide. I've shipped agentic systems serving tens of thousands of users, hold a patent pending on LLM hallucination detection, and was promoted three times faster than the standard timeline.

Fluent in English and Spanish, conversational in Russian, and learning Arabic — I bring a global perspective and a builder's intensity to every system I architect.

Download Résumé Download Cover Letter

🚀

3× Faster Promotion

Specialist → Engagement Manager in under 1 year at McKinsey. Standard timeline is 3+ years.

📜

Patent Pending

Novel hallucination detection methodology for LLMs and agentic systems, commercially deployed across banking clients.

🎓

Duke + DARPA Research

MS Computer Engineering (3.8 GPA). Developed adversarial detection models for the Department of Defense.

🏗️

Founder at 22

Built Fast River Logistics from zero to 48-state operations with 6 years of consistent profit growth.

🌍

Multilingual

English & Spanish (native/bilingual), Russian (elementary), Arabic (beginner) — effective across global teams.

📚

Teaching & Mentorship

Graduate TA at Duke Fuqua (MBA) and Pratt (Engineering). Student mentor for Duke Athletics and Pratt's DEI Committee Subcommittee Chairman.

🔬

Published Researcher

NCSA Research Fellow (UIUC), XSEDE EMPOWER Apprentice, PEARC'19 conference publication, nominated for Best Oral Presentation at ISRS'19.

Building the future of intelligent systems

Deep expertise acrossthe AI & cloud stack

GenAI & Agentic Systems

Deep Dive

Cloud & Infrastructure

Deep Dive

Banking & Risk Automation

Deep Dive

Data Engineering & ML Ops

Deep Dive

Impact-drivenprojects at scale

Multi-Agent Workforce Planning Platform

Technical Stack

Impact & Outcomes

Real-Time Agentic Call Center Solution

What Was Built

Results

RCSA Agentic Automation

Architecture

Scale

GraphRAG Compliance Engine

How It Works

Capabilities

GenAI Knowledge Retrieval — Commercial Bank

Leadership Scope

Outcomes

SLM Fine-Tuning — Government Tax Platform

ML Techniques

Infrastructure

Taxy.AI — Agentic Tax Preparation Assistant

Technical Stack

Impact & Outcomes

LoRA MultiModal Fine-Tuning

Key Technical Features

See it in action

Tools & technologiesI work with daily

Thoughts on AI,engineering & strategy

Vibe Coding vs. Production Agentic AI: What the Demos Won't Show You

Agentic RAG in 2026: Why the Name You Bought Last Year Isn't the Architecture You Need This Year

The Quiet Revolution of Small Language Models — Why Bonsai Caught My Attention

AI Code Generation Is Barely Touching 30% of Software

Revolutionizing KYC with Agentic AI and Semantic Search

5 Reasons Agentic AI Fails — and How to Avoid Them

From Art to Engineering: A Practical Rubric for GPT-4.1 Prompt Design

Enhancing Entity Resolution Using Generative AI — Part 1

GenAI Defensive Data Poisoning

Knowledge Graphs vs. Agentic RAG — Part 1

Reviewing YOLOv4

YOLOv3 PyTorch Video & Image Model

What Is ShuffleNet?

From founder toenterprise AI leader

The person behindthe architecture

3× Faster Promotion

Patent Pending

Duke + DARPA Research

Founder at 22

Multilingual

Teaching & Mentorship

Published Researcher

Academic foundations

Continuous learning

Ready to buildsomething intelligent?

Building the future
of intelligent systems

Deep expertise across
the AI & cloud stack

Impact-driven
projects at scale

Tools & technologies
I work with daily

Thoughts on AI,
engineering & strategy

From founder to
enterprise AI leader

The person behind
the architecture

Ready to build
something intelligent?