Hi, it's Sayem, welcome to my Homepage!
Passionate about Vision-Language Models and AI research!
I am a Master's student in Computer Science and Engineering at Ulsan National Institute of Science and Technology (UNIST). Currently, I am working as a Research Assistant at Vision and Learning Lab, UNIST under the supervision of Prof. Seungryul Baek and Prof. Binod Bhattarai. My research focuses on Vision-Language Models (VLMs) for Hand-Object Mesh Reconstruction and Hand-Object Interaction Understanding.
Previously, I completed my Bachelor's in Industrial Engineering at UNIST. During my undergraduate years, I worked on optimizing machine learning models for human detection using thermal images, improving image segmentation with RGB and thermal fusion, and parallelizing bandit algorithms for computational efficiency in recommendation systems.
My broader research interests span Vision-Language Models (VLMs), 3D Hand-Object Interaction, Multimodal Learning and Scene Understanding. My current research explores text-guided dynamic contact modeling and fine-grained relationship understanding in vision-language models for hands object interaction scenario.
I am always open to collaborations and discussions. If you are interested in my research or have any inquiries, feel free to reach out to me at khalequzzamansayem@unist.ac.kr.