Hi, it's Sayem, welcome to my Homepage!
Passionate about Vision-Language Models and AI research!
I am a researcher in the Vision and Learning Lab at the Ulsan National Institute of Science and Technology (UNIST), where I also completed my Master's degree in Computer Science and Engineering. I work under the supervision of Prof. Seungryul Baek and Prof. Binod Bhattarai. My research focuses on Vision-Language Models (VLMs) for Hand-Object Mesh Reconstruction and Hand-Object Interaction Understanding.
Previously, I completed my Bachelor's in Industrial Engineering at UNIST. During my undergraduate years, I worked on optimizing machine learning models for human detection using thermal images, improving image segmentation with RGB and thermal fusion, and parallelizing bandit algorithms for computational efficiency in recommendation systems.
My broader research interests span Vision-Language Models (VLMs), 3D Hand-Object Interaction, Multimodal Learning and Scene Understanding.
I am always open to collaborations and discussions. If you are interested in my research or have any inquiries, feel free to reach out to me at khalequzzamansayem@unist.ac.kr.