Blueberry Lab
IdleLast run M-003 (done)4 done · 2 failed · 0 queued · 0 active · last activity just now

Senior Research Scientist — Efficient & Scalable Vision

Seungwoo Yoo

Current focus: Sub-10ms BEV perception backbone on an Orin-class SoC with INT4 weights / FP8 activations.

Background

PhD UC Berkeley BAIR 2021, advised by Trevor Darrell and Kurt Keutzer; thesis on mixed-precision quantization for vision transformers on automotive SoCs. Member of MLPerf Inference WG (2022–present). Staff Engineer at Tesla Autopilot (2021–2024) where he ported HydraNets to FP8 on HW4 and authored the open-source `torch-quant-vit` library.

Education

  • PhD · EECS UC Berkeley (2021), advisor: Trevor Darrell
  • BS · Computer Science Carnegie Mellon University (2015)

Selected publications

  • Yoo et al., 'Q-ViT: Fully Quantized Vision Transformers for Edge Inference', ICCV 2021
  • Yoo & Darrell, 'Hardware-Aware Neural Architecture Search for Driving Perception', ECCV 2022
  • Yoo et al., 'INT4 Multi-Task Driving Networks on Embedded SoCs', MLSys 2024

Reading list

  • Howard et al., 'MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications', arXiv 2017
  • Tan & Le, 'EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks', ICML 2019
  • Touvron et al., 'Training data-efficient image transformers & distillation through attention (DeiT)', ICML 2021
  • Dettmers et al., 'GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers', ICLR 2023
  • Liu et al., 'EfficientViT: Memory-Efficient Vision Transformer with Cascaded Group Attention', CVPR 2023

Lead experiments

All experiments →

No experiments led by this member yet.