π§ Low-Resource Cross-Architecture Knowledge Transfer
Vision-Language Models work done during internship at SMU & M3S. Focus on transferring knowledge between architectures without paired supervision.
Final-year Electronic & Telecommunication Engineering undergraduate at University of Moratuwa.
Focusing on Spatio-Temporal 3D Visual Grounding on Dynamic Point Clouds. Passionate about 3D Computer Vision, Multimodal Learning, and Computational Photography.
Technical Guide β’ 5 min read
Career & Mentorship β’ 8 min read
Computer Vision β’ 12 min read
Image Processing β’ 6 min read
Vision-Language Models work done during internship at SMU & M3S. Focus on transferring knowledge between architectures without paired supervision.
Computer Vision System for Bin Picking Task using image Segmentation models such as SAM, DeepLab, Unet, and Segnet on an industrial robot arm.
PyTorch Implementation of paper titled D2BGAN: A Dark to Bright Image Conversion Model for Quality Enhancement without Paired Supervision.
Converts any image dataset into 2D t-SNE visualization for interactive 3D mapping of photography based on visual features.
Audio Signal Processing Challenge 2024 on Robovox Dataset for far-field speaker recognition by a mobile robot.
A tool to evaluate different explainability methods for Vision Transformer Architectures.
Extended WACV 2024 implementation for real-time 4K video exposure correction pipeline.
A distributed system integrating NVIDIA Jetson for edge computing with Microsoft Hololens for AR visualization.
Implementation of Kalman Filter for tracking humans in 3D point cloud data without pre-installed infrastructure.
A wearable clicker for public speakers and other professionals. Custom PCB and enclosure design.
Automated bots for Instagram and Twitter using Selenium and Python for engagement automation.
A comprehensive web application for managing power transformer inspections with AI-powered analysis.