Hi π, I'm Ben Boby
π San Francisco, California Β | Β π +1 (510) 461-6737 Β | Β π§ benbobyabraham@gmail.com
Portfolio β’ LinkedIn β’ Twitter / X
Machine Learning Engineer specializing in production ML systems, large-scale data platforms, LLM applications, and MLOps.
I build end-to-end AI solutions β from data pipelines β model training β deployment β monitoring β real-time inference β with a focus on scalability, reliability, and measurable impact.
- Machine Learning & Deep Learning
- Large Language Models (LLMs), RAG, Agentic AI
- Production ML & MLOps (CI/CD, model monitoring, automated retraining)
- Real-time & batch data processing (Spark, Kafka, Airflow)
- Vector Search & semantic retrieval (Weaviate, Qdrant, FAISS)
- Cloud-native AI systems (AWS, Azure, GCP)
- Edge AI & optimized inference
- Built an open-source edge AI platform for deploying Small Language Models (SLMs)
- Designed real-time inference pipelines with containerized deployment
- Developed ETL & streaming pipelines processing 20M+ IoT events/day
- Implemented data quality frameworks, drift detection, and ML-ready data infrastructure
- Delivered real-time analytics & observability systems
- Created hands-on NLP & Generative AI labs
- Built evaluation frameworks for applied LLM use cases
Languages: Python β’ Go β’ Java/Scala β’ SQL
ML/AI: PyTorch β’ TensorFlow β’ Transformers β’ PEFT β’ LangChain
Data Engineering: Spark β’ Kafka β’ Airflow β’ Hive β’ HBase
MLOps: Docker β’ Kubernetes β’ MLflow β’ Prometheus β’ Grafana β’ GitHub Actions
Databases: PostgreSQL β’ MongoDB β’ Redis β’ Vector DBs
Cloud: AWS β’ Azure β’ GCP
- SF Python β’ PyBay Volunteer
- AWS GenAI Loft β’ GitHub β’ Cloudflare β’ n8n Hackathons
- Active contributor to the Bay Area AI & Python ecosystem
Iβm open to collaborating on:
- Production ML & LLM systems
- AI infrastructure & data platforms
- Applied GenAI products
- Open-source AI tools
