About me

I am a final year PhD candidate at NTU, Singapore, under the SINGA scholarship by A*STAR. My PhD work is on Towards Semantic, Debiased and Moment Video Retrieval with Multi-modal Features under Prof. Joo Hwee Lim and Dr Hongyuan Zhu from I2R, A*STAR and Prof. Hanwang Zhang from SCSE, NTU.

During my PhD, I was also a visiting researcher at the University of Bristol, UK, under Prof. Michael Wray in Dima Damen’s group. I did my MSc under Prof. Ahmet Emir Dirik on vehicle detection. I have various working experiences, from a start-up in Istanbul to the industry (Turkish Airlines Technology) and a university internship during my MSc (Univeristy of Valencia). I also advised two award-winning start-ups. You can view my resume here.

Recent News

  • 07/2024: Expected date to submit the PhD Thesis.
  • 07/2024: Our recent work, Enhancing Video Corpus Moment Retrieval in Long Ego-centric Videos with LLM and Audio Fusion, is under review.
  • 10/2023: Visited MaVi Research Group, University of Bristol, for three months under the supervision of Dr Michael Wray on video corpus moment retrieval.

Publications

Enhancing Video Corpus Moment Retrieval in Long Ego-centric Videos with LLM and Audio Fusion
Burak Satar, Joo Hwee Lim, Hanwang Zhang, M Furkan Ilaslan, Hongyuan Zhu, Michael Wray
Under review

————————————————————————————————–

VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting
M Furkan Ilaslan, Ali Koksal, Kevin Qinghong Lin, Burak Satar, Mike Zheng Shou, Qianli Xu
Under review

————————————————————————————————–

Structural Causal Model
Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Burak Satar, Zhu Hongyuan, Hanwang Zhang, Joo-Hwee Lim
BMVC 2023 Full Paper, (Poster presentation)
[arXiv] [YouTube Ppt] [Poster] [Project Page]

————————————————————————————————–

An Overview of Challenges
An Overview of Challenges in Egocentric Text-Video Retrieval
Burak Satar, Zhu Hongyuan, Hanwang Zhang, Joo-Hwee Lim
CVPR Workshop 2023, Joint Ego4d/EPIC Workshop (Oral presentation)
[Extended Abstract] [YouTube Ppt]

————————————————————————————————–

Architecture
Exploiting Semantic Role Contextualized Video Features
for Multi-Instance Video Retrieval (3rd Place Award)
Burak Satar, Zhu Hongyuan, Hanwang Zhang, Joo-Hwee Lim
CVPR Workshop 2022, Epic-Kitchens-100 MIR Challenge under Joint Ego4d/EPIC Workshop
[Technical Report] [(pseudo)Code]

————————————————————————————————–

RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval
Burak Satar, Zhu Hongyuan, Hanwang Zhang, Joo-Hwee Lim
[arXiv 2022 Preprint] [(pseudo)Code]

Overview of our model on text-to-video retrieval
Semantic Role Aware Correlation Transformer for Text to Video Retrieval
Burak Satar, Zhu Hongyuan, Xavier Bresson, Joo-Hwee Lim
ICIP 2021 Full Paper (Oral presentation) and ICCV Workshop 2021 (Oral presentation)
[arXiv] [(pseudo)Code] [YouTube Ppt]

————————————————————————————————–

Detection and classification method
Deep Learning Based Vehicle Make-Model Classification
Burak Satar, Ahmet Emir Dirik
ICANN 2018 Full Paper (Oral presentation)
[arXiv] [Code]

Previous News