Dongwon Kim

POSTECH Computer vision lab. kdwon@postech.ac.kr

prof_pic.jpeg

I am a Ph.D. student at POSTECH Computer Vision Lab, supervised by Prof. Suha Kwak. My research centers on whether machines can learn representations with the right abstraction and hierarchy, spanning work in compositional representation (SelfMod), multi-modalities (DivE, SaG, MaskGen), and model-based RL (WIP).

News

Jul 2025 A paper on 1-dimensinal tokenization and text-to-image generation is accepted at ICCV 2025.
Jul 2025 I have completed my defense - now I’m officially a Ph.D. (thesis title: Learning Compositional Visual Representations for Vision-Language Understanding and Generation)
Sep 2024 A paper about object-centric learning is accepted at NeurIPS 2024.
Jun 2024 One paper is accepted at ECCV 2024.
Jun 2024 Joined Bytedance (San Jose, CA) as a research intern

Education

Mar, 2015 - Aug, 2019 B.S. in Computer Science & Engineering
POSTECH, Pohang, South Korea
Sep, 2019 - Present Integrated M.S & Ph.D in Computer Science & Engineering
POSTECH, Pohang, South Korea
Advisor: Prof. Suha Kwak


Experience

Jun, 2024 - Nov, 2024 Research Intern
Fundamental Research Team, ByteDance SEED, San Jose, US
  • Developed efficient text-to-image generative model using compact text-aware 1D tokens
  • First-authored paper “Democratizing Text-to-Image Masked Generative Models…” (accepted to ICCV 2025)


Publications

  1. maskgen.png
    Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
    Dongwon Kim*Ju He*Qihang Yu*Chenglin Yang, Xiaohui Shen, Suha Kwak, and Liang-Chieh Chen
    In IEEE/CVF International Conference on Computer Vision (ICCV), Jan 2025
  2. 158flux.png
    1.58-bit FLUX
    Chenglin Yang, Celong Liu, Xueqing DengDongwon Kim, Xing Mei, Xiaohui Shen, and Liang-Chieh Chen
    In arXiv preprint, Dec 2024
  3. selfmod.png
    Bootstrapping Top-down Information for Self-modulating Slot Attention
    Dongwon KimSeoyeon Kim, and Suha Kwak
    In Neural Information Processing Systems (NeurIPS), Dec 2024
  4. plot.png
    PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery
    Jicheol Park, Dongwon Kim, Boseung Jeong, and Suha Kwak
    In European Conference on Computer Vision (ECCV), Oct 2024
  5. risclip.jpg
    Extending CLIP’s Image-Text Alignment to Referring Image Segmentation
    Seoyeon KimMinguk KangDongwon KimJaesik Park, and Suha Kwak
    In Annual Conference on the North American Chapter of the Association for Computational Linguistics (NAACL), Jun 2024
  6. sag.png
    Shatter and Gather: Learning Referring Image Segmentation with Text Supervision
    Dongwon Kim*Namyup Kim*Cuiling Lan, and Suha Kwak
    In IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2023
  7. dive.png
    Improving Cross-Modal Retrieval With Set of Diverse Embeddings
    Dongwon KimNamyup Kim, and Suha Kwak
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2023
    (Highlight, 235/9155 = 2.5%)
  8. restr.png
    ReSTR: Convolution-Free Referring Image Segmentation Using Transformers
    Namyup KimDongwon KimCuiling LanWenjun Zeng, and Suha Kwak
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2022
  9. stml.png
    Self-Taught Metric Learning Without Labels
    Sungyeon KimDongwon KimMinsu Cho, and Suha Kwak
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2022
  10. et.png
    Embedding Transfer With Label Relaxation for Improved Metric Learning
    Sungyeon KimDongwon KimMinsu Cho, and Suha Kwak
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2021
  11. pa.gif
    Proxy Anchor Loss for Deep Metric Learning
    Sungyeon KimDongwon KimMinsu Cho, and Suha Kwak
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2020

Professional Services

Reviewer
  • IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  • IEEE/CVF International Conference on Computer Vision (ICCV), 2023
  • IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022-2023
  • European Conference on Computer Vision (ECCV), 2022
  • Winter Conference on Applications of Computer Vision (WACV), 2023
  • Asian Conference on Computer Vision (ACCV), 2022

Honors and Awards

POSTECHIAN Fellowship, POSTECH, 2023
  • $5,000 grant
BK21 Best Paper Award, POSTECH GSAI, 2023
  • Self-Taught Metric Learning without Labels (CVPR 2022)
Qualcomm Innovation Fellowship Winner, Qualcomm Korea Corp., 2022
  • Self-Taught Metric Learning without Labels (CVPR 2022)
  • ReSTR: Convolution?free Referring Image Segmentation Using Transformers (CVPR 2022)
NAVER x POSTECH AI DAY The 2nd and 3rd Prize, 2022
  • ReSTR: Convolution-free Referring Image Segmentation Using Transformers (CVPR 2022)
Qualcomm Innovation Fellowship Winner, Qualcomm Korea Corp., 2021
  • Embedding Transfer with Label Relaxation for Improved Metric Learning (CVPR 2021)
IPIU Best Paper Award, 2021
  • Embedding Transfer with Label Relaxation for Improved Metric Learning (CVPR 2021)