Hyun-Min Chang Mocchibird

Hi, I'm Hyun-Min Chang 🦉

MSc EE/IT at ETH Zürich · AI Research Intern at Huawei Research Center Switzerland

I work on low-level ML systems, specializing in kernel development, benchmarking, and hardware-aware performance optimization for specialized accelerators.

Focus

Custom kernel development for Ascend NPUs
Benchmarking and performance analysis for ML workloads
Quantization, fused operators, and efficient inference
Embedded and resource-constrained ML systems

Selected work

Upstream contributions

Public contribution work to huawei-csl/pto-kernels:

PR #62 — Fast Hadamard fused with dynamic quantization to int4
PR #49 — Fast Hadamard fused with fp16 → int8 dynamic quantization
PR #26 — PTO-ISA matmul with L2 cache locality optimization

Highlighted repositories

pto-kernels
Active development fork for Ascend NPU kernel work, experiments, benchmarking, and upstream contribution preparation.
pto-kernels-plots
Benchmark plots and performance analysis for kernel development and PR evaluation.
health-metrics
Self-hostable Streamlit app for tracking personal health metrics with local SQLite storage and authenticated editing.
MLonMCU
Embedded ML / microcontroller-related coursework and project work.

Interests

ML systems · performance engineering · kernel optimization · compilers · hardware-aware ML

Contact

GitHub: @Mocchibird

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hyun-Min Chang Mocchibird

Achievements

Achievements

Highlights

Block or report Mocchibird