Home

Exploring advanced deep learning architectures, hardware-aware optimizations, and performant AI deployments.

An implementation of efficient fine-tuning techniques for large language models, focusing on parameter-efficient methods like LoRA and QLoRA.

PyTorch Transformers bitsandbytes PEFT

A state-of-the-art RAG pipeline combining dense retrieval with generative language models.

PyTorch Hugging Face FAISS LangChain

Implementation of state-of-the-art image classification using Vision Transformers (ViT).

PyTorch Hugging Face TensorFlow

High-performance Stable Diffusion implementation with custom attention mechanisms.

PyTorch diffusers xFormers CUDA

Production-grade MLOps infrastructure with automated training and deployment pipelines.

Kubernetes MLflow Kubeflow

Optimized serving system for large language models with dynamic Kubernetes scaling.

vLLM Triton CUDA Graphs Ray

AI Systems Development