8000 benbobyabraham (Ben Boby) Β· GitHub
[go: up one dir, main page]

Skip to content
View benbobyabraham's full-sized avatar
πŸ’­
Open to ML / AI Engineer roles β€’ Production ML β€’ LLM systems β€’ MLOps
πŸ’­
Open to ML / AI Engineer roles β€’ Production ML β€’ LLM systems β€’ MLOps

Block or report benbobyabraham

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
benbobyabraham/README.md

Hi πŸ‘‹, I'm Ben Boby

AI/ML Engineer β€’ Machine Learning Systems β€’ Data Platforms β€’ LLM & GenAI

πŸ“ San Francisco, California Β  | Β  πŸ“ž +1 (510) 461-6737 Β  | Β  πŸ“§ benbobyabraham@gmail.com

Portfolio β€’ LinkedIn β€’ Twitter / X


πŸš€ About Me

Machine Learning Engineer specializing in production ML systems, large-scale data platforms, LLM applications, and MLOps.

I build end-to-end AI solutions β€” from data pipelines β†’ model training β†’ deployment β†’ monitoring β†’ real-time inference β€” with a focus on scalability, reliability, and measurable impact.


🧠 Core Expertise

  • Machine Learning & Deep Learning
  • Large Language Models (LLMs), RAG, Agentic AI
  • Production ML & MLOps (CI/CD, model monitoring, automated retraining)
  • Real-time & batch data processing (Spark, Kafka, Airflow)
  • Vector Search & semantic retrieval (Weaviate, Qdrant, FAISS)
  • Cloud-native AI systems (AWS, Azure, GCP)
  • Edge AI & optimized inference

πŸ—οΈ Professional Experience

πŸ”Ή AI/ML Systems – UmbrellaAI

  • Built an open-source edge AI platform for deploying Small Language Models (SLMs)
  • Designed real-time inference pipelines with containerized deployment

πŸ”Ή Senior Software Engineer – NTT DATA

  • Developed ETL & streaming pipelines processing 20M+ IoT events/day
  • Implemented data quality frameworks, drift detection, and ML-ready data infrastructure
  • Delivered real-time analytics & observability systems

πŸ”Ή Data Analyst – Golden Gate University

  • Created hands-on NLP & Generative AI labs
  • Built evaluation frameworks for applied LLM use cases

🧰 Tech Stack

Languages: Python β€’ Go β€’ Java/Scala β€’ SQL
ML/AI: PyTorch β€’ TensorFlow β€’ Transformers β€’ PEFT β€’ LangChain
Data Engineering: Spark β€’ Kafka β€’ Airflow β€’ Hive β€’ HBase
MLOps: Docker β€’ Kubernetes β€’ MLflow β€’ Prometheus β€’ Grafana β€’ GitHub Actions
Databases: PostgreSQL β€’ MongoDB β€’ Redis β€’ Vector DBs
Cloud: AWS β€’ Azure β€’ GCP


πŸ“Š GitHub Analytics


🌍 Community & Open Source

  • SF Python β€’ PyBay Volunteer
  • AWS GenAI Loft β€’ GitHub β€’ Cloudflare β€’ n8n Hackathons
  • Active contributor to the Bay Area AI & Python ecosystem

🀝 Let’s Collaborate

I’m open to collaborating on:

  • Production ML & LLM systems
  • AI infrastructure & data platforms
  • Applied GenAI products
  • Open-source AI tools

Popular repositories Loading

  1. 100DaysofML 100DaysofML Public

    Forked from kabirnagpal/100DaysofML

    This Repository provides resources for the 100 Days of ML initiative.

    3

  2. flask_site flask_site Public

    Python 2

  3. streamlit-app streamlit-app Public

    Python 2

  4. opencv-heroku opencv-heroku Public

    Python 2 1

  5. benbobyabraham benbobyabraham Public

    My info

    2

  6. Andrew-NG-Notes Andrew-NG-Notes Public

    Forked from edwin-das/Andrew-NG-Notes

    This is Andrew NG Coursera Handwritten Notes.

    Jupyter Notebook 2

0