A goal-conditioned reinforcement learning framework with DDPG, SAC, TD3 and TQC for the Panda-Gym environment. Trains robotic arm for reaching, pushing, sliding and pick-place tasks.
Autonomous drone navigation framework with real-time object detection and spatial tagging. Uses YoloV8 and SORT for tracking with 3D spatial mapping capabilities.
Flexible framework for generating high-quality Image-Text pairs for fine-tuning generative models like Stable Diffusion and DALL-E using open-source VLLMs.
Implementation of TD3 and SAC reinforcement learning algorithms for MuJoCo environments including Ant, HalfCheetah, Hopper, and Walker for robotic control tasks.
Resource-conscious DQN framework for Atari games optimized for consumer hardware. Achieves 57% better performance than standard DQN with efficient scaling.
6-DoF camera pose estimation using deep learning models on 7-Scenes dataset. Compares different CNN backbones for performance tradeoffs in pose estimation.
Semantic segmentation of aerial images using UNet to classify land, water, buildings, roads, and vegetation in Dubai imagery.
Educational repository for U-Net image denoising with support for consistent and varying noise levels. Performs general image denoising with room for detail recovery improvements.
Educational implementation of NeRF for novel view synthesis. Renders objects at unseen angles with mask losses for improved foreground/background rendering.
Enhanced Super-Resolution Generative Adversarial Network for high-fidelity image super-resolution. Trained on HD images for generalization across various contexts.
Python-based chess engine with Alpha-beta pruning. Estimated ELO rating of 1550-1750, performing at 4-7 depth evaluations due to computational constraints.
Complete implementation of Canny Edge Detection algorithm with enhancements for accuracy and speed. Includes FFT-based convolution and configurable parameters.
Deep reinforcement learning for traffic signal optimization using SUMO simulations. Modified DuelDQN with weight transfer for multi-intersection scenarios.
Reinforcement learning on LunarLander-v2 using DQN and Actor-Critic methods. Actor-Critic converges 4x faster than DQN in continuous action spaces.
Educational framework for fine-tuning Stable Diffusion models via HuggingFace. Supports conditional and unconditional training with lightweight implementation.
Fast Fourier Transform for audio signal decomposition and visualization. Converts MP3 audio to frequency-based visualizations compiled into matching MP4 videos.