Alok Upadhyay Building intelligence at scale.
Hands-on, research-focused ML Engineering Leader with 12+ years at Amazon & AWS. Leading 20+ person orgs building Multimodal AI products, large-scale Recommendation Systems, and Generative AI pipelines serving millions of users. 4 US Patents. Published at ICLR. IEEE Senior Member.
Blog
Thoughts on building Multimodal AI, ML systems at scale, and lessons from the trenches.
Loading posts...
ML Engineering Leadership Experience
From founding engineer to Manager of Managers -- building hands-on, research-driven ML products at massive scale.
- Lead the Outbound Message Generation & Engagement organization (~20 engineers, SDMs, and TPMs), owning customer enrollment, engagement, and retention through personalized touchpoints
- Architected a self-hosted GenAI Auto-QA system on AWS Bedrock using LLMs to validate 500M+ assets/month across Email, Push, and WhatsApp, eliminating manual vendor review
- Direct a massive-scale content generation pipeline delivering 3B+ personalized messages monthly, leveraging transformer recommendation architectures for retrieval and ranking
- Established the technical roadmap transitioning the org from legacy systems to a GenAI-first architecture, reducing inference cost by 17% through model optimizations
- Led a cross-functional team of ~14-18 SDEs, SDMs, Applied Scientists, and TPMs; managed the Manager of Managers layer for Authentication/Authorization/Presence teams
- Launched the Multimodal Recognition Engine fusing VoiceID, VisualID, PhoneID, and usage patterns to infer user identity, with cloud-edge synchronization in under 8ms
- Delivered VisualID (Facial Recognition) and PhoneID (Bluetooth Proximity) for Alexa personalization, coordinating across 23+ external teams
- Built the ML training pipeline for Continuous Improving Multimodal Recognition -- an online learning system integrated with LLM-powered Alexa+ experiences
- Devised the Authentication Confidence Levels (ACL) security standard -- a 6-tier scoring scheme adopted company-wide
- Founding Engineer; architected high-stakes identity and security features resulting in multiple granted US Patents
- Architected "Limit Access" on Alexa -- a multi-factor authentication system (Voice PIN + VoiceID) achieving HIPAA compliance for Amazon Pharmacy
- Designed the Person Recognition Identity API for 3rd-party developers, enabling secure personalization of Alexa interactions
- Invented the Cross-Modal Automated Ground Truth scheme (patented) using BLE proximity to ground-truth voice prints, bringing in 100M+ monthly labels
- Founding engineer for Alexa Identity; built foundational identity services, human-in-the-loop ground truthing, offline/online ML pipelines, launched Alexa Voice Training & Recognition
- Established operational excellence mechanisms ensuring low-latency, high-availability voice biometric services at scale
- Built the Resource Groups Tagging API enabling tag-based access control (TBAC) at 40K+ TPS
- Redesigned Tagging Discovery Services eliminating single-point-of-failure; deployed to AWS GovCloud with air-gapped security protocols
- Founding engineer for Amazon Home Services marketplace; designed seller onboarding flows, ASIN search, and AWS Step Functions workflows
- Built a hyperlocal seller notification mechanism reducing customer order claim time by 95% (~12hrs to ~30min)
Featured In
Alok's work on Alexa identity, voice recognition, and ambient AI has been covered by 10+ major outlets.
Multimodal AI, RecSys & GenAI Skills
Leadership & Strategy
Machine Learning & AI
Engineering & Infrastructure
US Patents in AI & Identity
4 granted US patents and 1 pending -- spanning multimodal identity recognition, biometric authentication, and person detection.
Peer-Reviewed ML Research Publications
Published research at ICLR, MathAI, and IEEE on multimodal AI reasoning, LLM recommendation systems, and biometric geometry.
ORCID: 0009-0003-6892-4875Are VLM Identity Judgments Logically Consistent? Evaluating Symmetry, CoT, and Transitivity in Person Re-ID
Do LLM Recommenders Obey Preference Axioms? Testing Logical Rationality in LLM-Based Recommendation
A Novel Architecture for Secure Communications in Mobile Systems
Riemannian Geometry of Multimodal Biometric Embedding Spaces
Additional papers under peer review
Conference Reviewer & Program Committee
Program Committee member and peer reviewer for top-tier ML and AI conferences including ICLR, CVPR, and ACM ICMR.
Academic Background
Birla Institute of Technology and Science (BITS), Pilani
Certifications & Memberships
FAA Certificated Private Pilot
Single Engine Land (ASEL)
Since Jan 2022