MS Bioinformatics candidate at Northeastern University (GPA 3.94) with 3+ years of industry experience spanning pharmaceutical LIMS platforms, spatial transcriptomics, and de novo protein design. Currently building agentic multi-omics AI systems at Solid Biosciences and supporting 10,000+ researchers on HPC infrastructure at Northeastern's Research Computing Team.
I bridge the gap between wet-lab biology and production-grade AI/ML — from designing AAV capsid proteins with RFdiffusion3 to deploying full-stack bioinformatics platforms on AWS.
| Company | Role | Impact |
|---|---|---|
| Solid Biosciences | Computational Biology Co-op | Agentic multi-omics AI · 50,000+ AAV capsid variants · 80% manual analysis reduction |
| Boehringer Ingelheim | Digital Transformation Intern | LLM-powered insight engine · 1M+ cells processed · 40% efficiency gain |
| Northeastern University | Graduate Research Assistant | HPC support for 10,000+ researchers across SLURM, GPU, MPI clusters |
| Lee Makowski Lab | Programming Assistant | 92% protein gel classification accuracy · AlphaFold3 on HPC |
| Zifo Technologies | Scientific Application Analyst | 4+ GxP-compliant LIMS/ELN platforms · 3,500+ pharma users |
Protein Design & Structure Prediction
AlphaFold3 RFdiffusion3 RoseTTAFold3 Chai-1 Boltz-2 ProteinMPNN LigandMPNN ThermoMPNN ColabFold PyMOL ChimeraX
Genomics & Single-Cell Analysis
Scanpy Squidpy Seurat CellRanger Harmony Scanorama CellTypist GATK DESeq2 STAR BWA Nextflow Snakemake
AI/ML & LLM Engineering
LangChain RAG FAISS Graph Neural Networks Vision Transformers PyTorch TensorFlow scikit-learn XGBoost Apache Spark
Cloud & MLOps
AWS (EC2, S3, Batch, ECR, EFS, SageMaker, Bedrock, RDS, CloudWatch) Azure GCP Docker Singularity SLURM HPC Airflow CI/CD
Full-Stack Development
Django React Node.js TypeScript PostgreSQL MongoDB REST APIs
🧬 Agentic Multi-Omics AI System (Solid Biosciences) LangChain + GPT-4 + AWS SageMaker pipeline processing 500,000+ biomarker measurements for precision medicine target identification.
💊 AAV Capsid Protein Design Platform Cloud-native platform containerizing 11 protein design models (RFdiffusion3, Chai-1, Boltz-2, etc.) with auto-scaling GPU orchestration across 50,000+ de novo DMD variants.
🔍 Single-Cell & Spatial Transcriptomics Platform (Boehringer Ingelheim) End-to-end platform processing 1M+ cells with Scanpy, Squidpy, Harmony, and LLM-powered natural language querying via AWS Bedrock RAG.
✂️ CRISPR-Cas9 ML Optimization Random Forest + Gradient Boosting + Neural Network pipeline predicting sgRNA efficiency from 100,000+ sequences with 92% accuracy, reducing screening by 90%.
- Emberling et al. (2025) — Metabolic enzymes moonlighting as RNA binding proteins. Biophysical Journal. DOI
- Apoorvha, Brindha, Ganesan et al. (2025) — Weighted gene co-expression network analysis reveals hub genes of quiescence. 3 Biotech. DOI